2	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
2	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
9	N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation
10	N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation
12	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
16	tRNA synthetases class II (A). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only alanyl-tRNA synthetases
18	Aminotransferase class-III
25	SH2 domain
25	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
25	SH2 domain
25	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
26	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
26	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog
27	SH2 domain
27	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
27	SH2 domain
27	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
27	SH2 domain
27	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
28	Glycosyltransferase family 6
29	C2 domain
29	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
29	C2 domain
29	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
31	Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction
32	Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction
38	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
40	Amiloride-sensitive sodium channel
41	Amiloride-sensitive sodium channel
41	Amiloride-sensitive sodium channel
43	Carboxylesterase
43	Carboxylesterase
51	Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA
51	Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA
53	Histidine acid phosphatase
54	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte
55	Histidine acid phosphatase
58	Actin
59	Actin
60	Actin
70	Actin
71	Actin
72	Actin
88	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
90	Protein kinase domain
90	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
91	Protein kinase domain
91	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
91	Protein kinase domain
91	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
91	Protein kinase domain
91	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
92	Protein kinase domain
92	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
93	Protein kinase domain
93	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
94	Protein kinase domain
94	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
95	Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases
97	Acylphosphatase
98	Acylphosphatase
100	Adenosine/AMP deaminase
101	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
101	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
102	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
103	Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this r
103	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
103	Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this r
103	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
103	Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this r
103	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
104	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
104	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
104	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
105	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
107	Adenylate and Guanylate cyclase catalytic domain
107	Adenylate and Guanylate cyclase catalytic domain
108	Adenylate and Guanylate cyclase catalytic domain
109	Adenylate and Guanylate cyclase catalytic domain
112	Adenylate and Guanylate cyclase catalytic domain
112	Adenylate and Guanylate cyclase catalytic domain
113	Adenylate and Guanylate cyclase catalytic domain
114	Adenylate and Guanylate cyclase catalytic domain
115	Adenylate and Guanylate cyclase catalytic domain
116	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
117	7 transmembrane receptor (Secretin family)
118	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
118	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
118	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
118	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
119	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
119	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
119	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
119	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
119	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
119	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
120	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
120	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
123	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
132	pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es
132	pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es
133	Adrenomedullin
134	7 transmembrane receptor (rhodopsin family)
135	7 transmembrane receptor (rhodopsin family)
136	7 transmembrane receptor (rhodopsin family)
140	7 transmembrane receptor (rhodopsin family)
141	ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisi
142	Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc
142	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
142	pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affini
143	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
143	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
146	7 transmembrane receptor (rhodopsin family)
147	7 transmembrane receptor (rhodopsin family)
148	7 transmembrane receptor (rhodopsin family)
148	7 transmembrane receptor (rhodopsin family)
148	7 transmembrane receptor (rhodopsin family)
148	7 transmembrane receptor (rhodopsin family)
150	7 transmembrane receptor (rhodopsin family)
151	7 transmembrane receptor (rhodopsin family)
152	7 transmembrane receptor (rhodopsin family)
153	7 transmembrane receptor (rhodopsin family)
154	7 transmembrane receptor (rhodopsin family)
155	7 transmembrane receptor (rhodopsin family)
156	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
157	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
159	Adenylosuccinate synthetase
160	Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site
160	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
160	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
160	Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site
160	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
160	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
161	Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site
161	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
161	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
162	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
162	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
162	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
162	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
163	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
163	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
164	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
164	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
165	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
166	Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation
173	Serum albumin family
174	Serum albumin family
175	Asparaginase
176	Extracellular link domain
176	Extracellular link domain
176	Sushi domain (SCR repeat)
176	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
182	Delta serrate ligand
183	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
185	7 transmembrane receptor (rhodopsin family)
185	7 transmembrane receptor (rhodopsin family)
185	7 transmembrane receptor (rhodopsin family)
185	7 transmembrane receptor (rhodopsin family)
185	7 transmembrane receptor (rhodopsin family)
186	7 transmembrane receptor (rhodopsin family)
187	7 transmembrane receptor (rhodopsin family)
190	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
191	S-adenosyl-L-homocysteine hydrolase
196	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
196	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
197	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
202	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
203	Adenylate kinase
204	Adenylate kinase
204	Adenylate kinase
204	Adenylate kinase
205	Adenylate kinase
210	Delta-aminolevulinic acid dehydratase
213	Serum albumin family
216	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
217	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
218	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
219	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
220	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
221	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
222	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
223	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
224	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
226	Fructose-bisphosphate aldolase class-I
229	Fructose-bisphosphate aldolase class-I
230	Fructose-bisphosphate aldolase class-I
231	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
238	MAM domain. An extracellular domain found in many receptors
239	Lipoxygenase
239	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
240	Lipoxygenase
241	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
242	Lipoxygenase
242	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
244	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
244	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
246	Lipoxygenase
247	Lipoxygenase
247	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
248	Alkaline phosphatase
249	Alkaline phosphatase
250	Alkaline phosphatase
251	Alkaline phosphatase
257	Homeobox domain
259	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
262	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermi
265	pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in biomineralization. They seem to regulate the formation of crystallites during the secretory stage of tooth enamel development. thought to play a major role in the structural organization and m
265	pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in biomineralization. They seem to regulate the formation of crystallites during the secretory stage of tooth enamel development. thought to play a major role in the structural organization and m
265	pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in biomineralization. They seem to regulate the formation of crystallites during the secretory stage of tooth enamel development. thought to play a major role in the structural organization and m
266	pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in biomineralization. They seem to regulate the formation of crystallites during the secretory stage of tooth enamel development. thought to play a major role in the structural organization and m
267	CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2
269	Protein kinase domain
269	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
270	Adenosine/AMP deaminase
271	Adenosine/AMP deaminase
272	Adenosine/AMP deaminase
273	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
273	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
274	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
274	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
274	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
274	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
274	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
274	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
274	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
274	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
274	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
274	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
275	Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in
276	Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwe
276	Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta st
279	Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwe
279	Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta st
280	Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwe
280	Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta st
283	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
285	Uncharacterized ACR, COG1579
286	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
286	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
286	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
286	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
287	Death domain
287	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
287	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
287	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
287	Death domain
287	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
287	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
288	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
288	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
290	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
291	Mitochondrial carrier protein
292	Mitochondrial carrier protein
293	Mitochondrial carrier protein
301	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
302	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
306	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
307	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
308	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
309	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
309	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
310	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
310	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
311	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
311	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
311	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
312	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
313	Lipase/Acylhydrolase with GDSL-like motif
314	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
314	Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
314	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog
314	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
314	Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
314	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog
317	NB-ARC domain
317	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
317	NB-ARC domain
317	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
317	NB-ARC domain
317	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
317	NB-ARC domain
317	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
317	NB-ARC domain
317	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
318	NUDIX domain
318	NUDIX domain
318	NUDIX domain
320	Phosphotyrosine interaction domain (PTB/PID)
320	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
321	Phosphotyrosine interaction domain (PTB/PID)
321	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
322	Phosphotyrosine interaction domain (PTB/PID)
322	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
322	Phosphotyrosine interaction domain (PTB/PID)
322	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
323	Phosphotyrosine interaction domain (PTB/PID)
325	Pentaxin family. Pentaxins are also known as pentraxins
326	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
326	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
326	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
326	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
326	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
329	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
330	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
333	Amyloid A4 extracellular domain
334	Amyloid A4 extracellular domain
335	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
337	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
338	Lipoprotein amino terminal region. This family contains regions from: Vitellogenin, Microsomal triglyceride transfer protein and apolipoprotein B-100. These proteins are all involved in lipid transport. This family contains the LV1n chain from lipovitell
339	Cytidine and deoxycytidylate deaminase zinc-binding region
343	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
348	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
350	Sushi domain (SCR repeat)
351	Amyloid A4 extracellular domain
351	Beta-amyloid peptide (beta-APP)
355	TNFR/NGFR cysteine-rich region
355	TNFR/NGFR cysteine-rich region
355	TNFR/NGFR cysteine-rich region
357	Uncharacterized ACR, COG1579
358	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
359	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
361	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
361	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
362	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
363	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
363	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
367	Androgen receptor
367	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
367	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
369	Protein kinase domain
369	Raf-like Ras-binding domain
369	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
372	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
373	ADP-ribosylation factor family
373	ADP-ribosylation factor family
373	ADP-ribosylation factor family
379	ADP-ribosylation factor family
383	Arginase family
384	Arginase family
393	SH3 domain
393	RhoGAP domain
393	Fes/CIP4 homology domain
394	FF domain. This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions
396	RHO protein GDP dissociation inhibitor
397	RHO protein GDP dissociation inhibitor
398	RHO protein GDP dissociation inhibitor
400	ADP-ribosylation factor family
401	Homeobox domain
402	ADP-ribosylation factor family
403	ADP-ribosylation factor family
405	Helix-loop-helix DNA-binding domain
405	Helix-loop-helix DNA-binding domain
405	Helix-loop-helix DNA-binding domain
406	Helix-loop-helix DNA-binding domain
407	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
407	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
408	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
408	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
408	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
408	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
409	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
409	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
410	Sulfatase
411	Sulfatase
412	Sulfatase
414	Sulfatase
414	Sulfatase
415	Sulfatase
416	Sulfatase
417	NAD:arginine ADP-ribosyltransferase
419	NAD:arginine ADP-ribosyltransferase
420	NAD:arginine ADP-ribosyltransferase
427	Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in
427	Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in
432	Hepatic lectin, N-terminal domain
432	Lectin C-type domain. This family includes both long and short form C-type
433	Hepatic lectin, N-terminal domain
433	Lectin C-type domain. This family includes both long and short form C-type
433	Hepatic lectin, N-terminal domain
433	Lectin C-type domain. This family includes both long and short form C-type
433	Hepatic lectin, N-terminal domain
433	Lectin C-type domain. This family includes both long and short form C-type
433	Hepatic lectin, N-terminal domain
433	Lectin C-type domain. This family includes both long and short form C-type
438	O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine
439	Anion-transporting ATPase. This Pfam family represents a conserved domain, which is sometimes repeated, in an anion-transporting ATPase. The ATPase is involved in the removal of arsenate, antimonite, and arsenate from the cell
445	Arginosuccinate synthase. This family contains a PP-loop motif
445	Arginosuccinate synthase. This family contains a PP-loop motif
462	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
463	Homeobox domain
466	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
466	pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB
468	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
468	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
469	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
472	FAT domain. The FAT domain is named after FRAP, ATM and TRRAP
472	FAT domain. The FAT domain is named after FRAP, ATM and TRRAP
473	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
473	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
480	E1-E2 ATPase
480	Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
480	Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
480	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
481	Sodium / potassium ATPase beta chain
482	Sodium / potassium ATPase beta chain
483	Sodium / potassium ATPase beta chain
486	ATP1G1/PLM/MAT8 family
486	ATP1G1/PLM/MAT8 family
496	Sodium / potassium ATPase beta chain
501	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
509	ATP synthase
516	ATP synthase subunit C
517	ATP synthase subunit C
518	ATP synthase subunit C
527	ATP synthase subunit C
528	V-ATPase subunit C
529	ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit
534	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
534	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
535	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
538	Heavy-metal-associated domain
539	ATP synthase delta (OSCP) subunit. The ATP D subunit from E. coli is the same as the OSCP subunit which is this family. The ATP D subunit from metazoa are found in family pfam00401
540	Heavy-metal-associated domain
545	FAT domain. The FAT domain is named after FRAP, ATM and TRRAP
546	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
546	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
546	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
546	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
546	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
546	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
547	Kinesin motor domain
547	PH domain. PH stands for pleckstrin homology
547	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
551	Neurohypophysial hormones, C-terminal Domain. N-terminal Domain is in hormone5
552	7 transmembrane receptor (rhodopsin family)
553	7 transmembrane receptor (rhodopsin family)
554	7 transmembrane receptor (rhodopsin family)
558	Fibronectin type III domain
558	Fibronectin type III domain
563	Class I Histocompatibility antigen, domains alpha 1 and 2
571	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
573	BAG domain. Domain present in Hsp70 regulators
575	7 transmembrane receptor (Secretin family)
575	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
575	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
576	7 transmembrane receptor (Secretin family)
576	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
576	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
577	7 transmembrane receptor (Secretin family)
577	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
577	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
578	Apoptosis regulator proteins, Bcl-2 family
579	Homeobox domain
580	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
581	Apoptosis regulator proteins, Bcl-2 family
581	Apoptosis regulator proteins, Bcl-2 family
581	Apoptosis regulator proteins, Bcl-2 family
586	Aminotransferase class IV. The D-amino acid transferases (D-AAT) are required by bacteria to catalyse the synthesis of D-glutamic acid and D-alanine, which are essential constituents of bacterial cell wall and are the building block for other D-amino aci
587	Aminotransferase class IV. The D-amino acid transferases (D-AAT) are required by bacteria to catalyse the synthesis of D-glutamic acid and D-alanine, which are essential constituents of bacterial cell wall and are the building block for other D-amino aci
590	Carboxylesterase
593	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
595	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
596	Apoptosis regulator proteins, Bcl-2 family
596	Apoptosis regulator proteins, Bcl-2 family
597	Apoptosis regulator proteins, Bcl-2 family
598	Apoptosis regulator proteins, Bcl-2 family
598	Apoptosis regulator proteins, Bcl-2 family
599	Apoptosis regulator proteins, Bcl-2 family
604	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
604	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
610	Cyclic nucleotide-binding domain
610	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
611	7 transmembrane receptor (rhodopsin family)
613	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
613	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
613	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
613	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
622	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
623	7 transmembrane receptor (rhodopsin family)
624	7 transmembrane receptor (rhodopsin family)
627	Nerve growth factor family
627	Nerve growth factor family
627	Nerve growth factor family
627	Nerve growth factor family
627	Nerve growth factor family
627	Nerve growth factor family
629	von Willebrand factor type A domain
631	Intermediate filament protein
632	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox
635	Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13,
640	SH2 domain
640	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
642	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
643	7 transmembrane receptor (rhodopsin family)
643	7 transmembrane receptor (rhodopsin family)
644	Oxidoreductase family, NAD-binding Rossmann fold. This family of enzymes utilise NADP or NAD
648	Zinc finger, C3HC4 type (RING finger)
649	CUB domain
649	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
649	CUB domain
649	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
649	CUB domain
649	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
649	CUB domain
649	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
649	CUB domain
649	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
649	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
650	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
652	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
652	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
652	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
653	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
654	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
655	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
656	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
657	Protein kinase domain
657	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
658	Protein kinase domain
658	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
659	Protein kinase domain
659	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
659	Protein kinase domain
659	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
662	Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway
662	Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway
662	Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway
662	Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway
666	Apoptosis regulator proteins, Bcl-2 family
667	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
667	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
667	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
671	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
672	Zinc finger, C3HC4 type (RING finger)
672	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
672	Zinc finger, C3HC4 type (RING finger)
672	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
672	Zinc finger, C3HC4 type (RING finger)
672	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
672	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
672	Zinc finger, C3HC4 type (RING finger)
672	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
672	Zinc finger, C3HC4 type (RING finger)
672	Zinc finger, C3HC4 type (RING finger)
672	Zinc finger, C3HC4 type (RING finger)
672	Zinc finger, C3HC4 type (RING finger)
672	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
672	Zinc finger, C3HC4 type (RING finger)
672	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
672	Zinc finger, C3HC4 type (RING finger)
672	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
672	Zinc finger, C3HC4 type (RING finger)
672	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
673	Protein kinase domain
673	Raf-like Ras-binding domain
673	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
675	BRCA2 repeat. The alignment covers only the most conserved region of the repeat
680	7 transmembrane receptor (rhodopsin family)
683	ADP-ribosyl cyclase. ADP-ribosyl cyclase EC:3.2.2.5 (also know as cyclic ADP-ribose hydrolase or CD38) synthesizes cyclic-ADP ribose, a second messenger for glucose-induced insulin secretion
686	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
688	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
694	BTG1 family. A novel family of anti-proliferative proteins
695	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
696	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
706	TspO/MBR family. Tryptophan-rich sensory protein (TspO) is an integral membrane protein that acts as a negative regulator of the expression of specific photosynthesis genes in response to oxygen/light. It is involved in the efflux of porphyrin intermedia
708	Mitochondrial glycoprotein. This mitochondrial matrix protein family contains members of the MAM33 family which bind to the globular 'heads' of C1Q. It is thought to be involved in mitochondrial oxidative phosphorylation and in nucleus-mitochondrion inte
710	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
712	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
712	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
713	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
715	CUB domain
715	Sushi domain (SCR repeat)
716	CUB domain
716	Sushi domain (SCR repeat)
717	Sushi domain (SCR repeat)
717	von Willebrand factor type A domain
718	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
718	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
718	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f
718	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
719	7 transmembrane receptor (rhodopsin family)
720	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
720	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
720	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f
720	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
721	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
721	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
721	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f
721	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
722	Sushi domain (SCR repeat)
725	Sushi domain (SCR repeat)
726	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
727	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
727	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
727	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f
727	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
728	7 transmembrane receptor (rhodopsin family)
729	Sushi domain (SCR repeat)
729	Thrombospondin type 1 domain
729	Low-density lipoprotein receptor domain class A
729	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb
730	Sushi domain (SCR repeat)
730	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb
731	Low-density lipoprotein receptor domain class A
731	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb
732	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb
733	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
735	Low-density lipoprotein receptor domain class A
735	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb
738	Dor1-like family. Dor1 is involved in vesicle targeting to the yeast Golgi apparatus and complexes with a number of other trafficking proteins, which include Sec34 and Sec35
744	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte
747	Lipase (class 3)
759	Eukaryotic-type carbonic anhydrase
760	Eukaryotic-type carbonic anhydrase
761	Eukaryotic-type carbonic anhydrase
762	Eukaryotic-type carbonic anhydrase
763	Eukaryotic-type carbonic anhydrase
765	Eukaryotic-type carbonic anhydrase
766	Eukaryotic-type carbonic anhydrase
767	Eukaryotic-type carbonic anhydrase
768	Eukaryotic-type carbonic anhydrase
770	Eukaryotic-type carbonic anhydrase
771	Eukaryotic-type carbonic anhydrase
773	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
773	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
774	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
775	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
776	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
777	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
778	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
779	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
780	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
780	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
780	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
781	Cache domain
782	Dihydropyridine sensitive L-type calcium channel (Beta subunit)
783	Dihydropyridine sensitive L-type calcium channel (Beta subunit)
784	Dihydropyridine sensitive L-type calcium channel (Beta subunit)
785	Dihydropyridine sensitive L-type calcium channel (Beta subunit)
786	PMP-22/EMP/MP20/Claudin family
788	Mitochondrial carrier protein
790	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
790	MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase and the domain is also found in Carbamoyl phosphate synthetase (CPS) where it forms a regulatory domain that binds to the allosteric effector ornithine. This family also
790	Carbamoyl-phosphate synthetase large chain, oligomerisation domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. The carbamoyl-phosphate synthase (CPS) enzyme in proka
795	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
799	7 transmembrane receptor (Secretin family)
799	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
800	Caldesmon
800	Caldesmon
800	Caldesmon
800	Caldesmon
800	Caldesmon
811	Calreticulin family
820	Cathelicidin. A novel protein family, showing a conserved proregion and a variable carboxyl-terminal antimicrobial domain. This region shows similarity to cystatins
821	Calreticulin family
823	Calpain family cysteine protease
823	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
824	Calpain family cysteine protease
824	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
825	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
825	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
825	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
825	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
827	C2 domain
827	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
829	F-actin capping protein alpha subunit
830	F-actin capping protein alpha subunit
831	Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins
831	Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins
831	Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins
831	Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins
832	F-actin capping protein, beta subunit
834	ICE-like protease (caspase) p10 domain
834	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
834	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
834	ICE-like protease (caspase) p10 domain
834	ICE-like protease (caspase) p10 domain
835	ICE-like protease (caspase) p20 domain
835	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
835	ICE-like protease (caspase) p10 domain
835	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
835	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
835	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
837	ICE-like protease (caspase) p10 domain
837	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
837	ICE-like protease (caspase) p10 domain
837	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
837	ICE-like protease (caspase) p10 domain
838	ICE-like protease (caspase) p10 domain
838	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
839	ICE-like protease (caspase) p10 domain
840	ICE-like protease (caspase) p10 domain
840	ICE-like protease (caspase) p20 domain
841	Death effector domain
841	Death effector domain
841	Death effector domain
841	Death effector domain
841	Death effector domain
842	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
842	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
843	Death effector domain
843	ICE-like protease (caspase) p10 domain
843	Death effector domain
843	ICE-like protease (caspase) p10 domain
843	Death effector domain
843	Death effector domain
843	ICE-like protease (caspase) p10 domain
844	Calsequestrin
845	Calsequestrin
846	7 transmembrane receptor (metabotropic glutamate family)
846	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
847	Catalase
857	Caveolin
858	Caveolin
859	Caveolin
859	Caveolin
860	Runt domain
861	Runt domain
862	MYND finger
862	MYND finger
862	MYND finger
862	MYND finger
863	MYND finger
863	MYND finger
864	Runt domain
865	Core binding factor beta subunit. Core binding factor (CBF) is a heterodimeric transcription factor essential for genetic regulation of hematopoiesis and osteogenesis. The beta subunit enhances DNA-binding ability of the alpha subunit in vitro, and has b
865	Core binding factor beta subunit. Core binding factor (CBF) is a heterodimeric transcription factor essential for genetic regulation of hematopoiesis and osteogenesis. The beta subunit enhances DNA-binding ability of the alpha subunit in vitro, and has b
866	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
867	CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and
867	CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily co
867	CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conser
868	CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and
868	CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily co
868	CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conser
871	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
873	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
874	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
881	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
885	Gastrin/cholecystokinin family
886	7 transmembrane receptor (rhodopsin family)
887	7 transmembrane receptor (rhodopsin family)
890	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
894	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
896	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
898	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termi
898	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
898	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termi
898	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
899	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
900	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
901	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
908	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
909	Class I Histocompatibility antigen, domains alpha 1 and 2
921	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
928	Tetraspanin family
931	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
931	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
931	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
932	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
944	TNF(Tumor Necrosis Factor) family
948	CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion
949	CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion
950	CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion
951	Tetraspanin family
952	ADP-ribosyl cyclase. ADP-ribosyl cyclase EC:3.2.2.5 (also know as cyclic ADP-ribose hydrolase or CD38) synthesizes cyclic-ADP ribose, a second messenger for glucose-induced insulin secretion
953	GDA1/CD39 (nucleoside phosphatase) family
954	GDA1/CD39 (nucleoside phosphatase) family
955	GDA1/CD39 (nucleoside phosphatase) family
956	GDA1/CD39 (nucleoside phosphatase) family
957	GDA1/CD39 (nucleoside phosphatase) family
959	TNF(Tumor Necrosis Factor) family
963	Tetraspanin family
966	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
967	Tetraspanin family
968	Lysosome-associated membrane glycoprotein (Lamp)
970	TNF(Tumor Necrosis Factor) family
975	Tetraspanin family
976	7 transmembrane receptor (Secretin family)
976	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
976	7 transmembrane receptor (Secretin family)
976	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
977	Tetraspanin family
977	Tetraspanin family
987	Beige/BEACH domain
989	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
999	Cadherin domain
999	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1000	Cadherin domain
1000	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1001	Cadherin domain
1001	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1002	Cadherin domain
1002	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1003	Cadherin domain
1003	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1004	Cadherin domain
1004	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1005	Cadherin domain
1005	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1005	Cadherin domain
1005	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1006	Cadherin domain
1006	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1007	Cadherin domain
1007	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1008	Cadherin domain
1008	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1009	Cadherin domain
1009	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1009	Cadherin domain
1010	Cadherin domain
1010	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1012	Cadherin domain
1013	Cadherin domain
1013	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1014	Cadherin domain
1015	Cadherin domain
1016	Cadherin domain
1016	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1020	Eukaryotic protein kinase domain
1026	Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase
1026	Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase
1027	Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase
1028	Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase
1040	Phosphatidate cytidylyltransferase
1044	Homeobox domain
1045	Homeobox domain
1046	Homeobox domain
1047	Calreticulin family
1056	Carboxylesterase
1056	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
1059	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
1066	Carboxylesterase
1075	Papain family cysteine protease
1081	Glycoprotein hormone
1103	Choline/Carnitine o-acyltransferase
1103	Choline/Carnitine o-acyltransferase
1103	Choline/Carnitine o-acyltransferase
1103	Choline/Carnitine o-acyltransferase
1105	'chromo' (CHRromatin Organization MOdifier) domain
1105	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
1106	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
1108	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
1109	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
1113	Granin (chromogranin or secretogranin)
1114	Granin (chromogranin or secretogranin)
1119	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
1120	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
1121	GDP dissociation inhibitor
1122	GDP dissociation inhibitor
1123	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
1123	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
1124	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
1124	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
1128	7 transmembrane receptor (rhodopsin family)
1129	7 transmembrane receptor (rhodopsin family)
1130	Perilipin family
1130	Beige/BEACH domain
1130	WD domain, G-beta repeat
1131	7 transmembrane receptor (rhodopsin family)
1132	7 transmembrane receptor (rhodopsin family)
1133	7 transmembrane receptor (rhodopsin family)
1134	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
1134	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
1135	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
1135	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
1136	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
1136	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
1137	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
1137	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
1138	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
1138	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
1139	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
1139	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
1140	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
1140	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
1141	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
1141	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
1142	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
1142	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
1143	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
1143	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
1144	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
1144	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
1145	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
1145	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
1146	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
1146	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
1149	CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain
1152	ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold
1152	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
1155	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
1158	ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold
1158	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
1159	ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold
1159	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
1160	ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold
1160	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
1163	Cyclin-dependent kinase regulatory subunit
1164	Cyclin-dependent kinase regulatory subunit
1173	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
1174	Clathrin adaptor complex small chain
1174	Clathrin adaptor complex small chain
1175	Clathrin adaptor complex small chain
1175	Clathrin adaptor complex small chain
1178	Galactoside-binding lectin
1180	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
1181	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
1182	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
1183	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
1184	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
1185	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
1185	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
1185	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
1185	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
1186	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
1186	CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate de
1187	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
1188	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
1191	Clusterin
1201	CLN3 protein. This is a family of proteins from the CLN3 gene. A missense mutation of glutamic acid (E) to lysine (K) at position 295 in the human protein has been implicated in Juvenile neuronal ceroid lipofuscinosis (Batten disease)
1207	Nucleotide-sensitive chloride conductance regulator (ICln)
1208	Colipase, C-terminal domain. SCOP reports duplication of common fold with Colipase N-terminal domain
1211	Clathrin light chain
1211	Clathrin light chain
1212	Clathrin light chain
1212	Clathrin light chain
1213	Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain
1213	Clathrin propeller repeat. Clathrin is the scaffold protein of the basket-like coat that surrounds coated vesicles. The soluble assembly unit, a triskelion, contains three heavy chains and three light chains in an extended three-legged structure. Each le
1230	7 transmembrane receptor (rhodopsin family)
1231	7 transmembrane receptor (rhodopsin family)
1231	7 transmembrane receptor (rhodopsin family)
1232	7 transmembrane receptor (rhodopsin family)
1232	7 transmembrane receptor (rhodopsin family)
1233	7 transmembrane receptor (rhodopsin family)
1234	7 transmembrane receptor (rhodopsin family)
1235	7 transmembrane receptor (rhodopsin family)
1235	7 transmembrane receptor (rhodopsin family)
1236	7 transmembrane receptor (rhodopsin family)
1237	7 transmembrane receptor (rhodopsin family)
1238	7 transmembrane receptor (rhodopsin family)
1240	7 transmembrane receptor (rhodopsin family)
1241	7 transmembrane receptor (rhodopsin family)
1258	Cyclic nucleotide-binding domain
1258	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
1259	Cyclic nucleotide-binding domain
1259	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
1261	Cyclic nucleotide-binding domain
1261	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
1262	Cyclic nucleotide-binding domain
1262	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
1263	POLO box duplicated region
1264	Calponin family repeat
1265	Calponin family repeat
1266	Calponin family repeat
1268	7 transmembrane receptor (rhodopsin family)
1268	7 transmembrane receptor (rhodopsin family)
1268	7 transmembrane receptor (rhodopsin family)
1269	7 transmembrane receptor (rhodopsin family)
1270	Ciliary neurotrophic factor
1272	Fibronectin type III domain
1272	Fibronectin type III domain
1277	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
1277	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
1278	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1280	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
1281	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
1282	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
1284	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
1285	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
1285	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
1285	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
1285	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
1285	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
1286	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
1287	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
1287	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
1287	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
1288	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
1288	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
1290	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
1291	von Willebrand factor type A domain
1291	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1292	von Willebrand factor type A domain
1292	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1292	von Willebrand factor type A domain
1292	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1292	von Willebrand factor type A domain
1292	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1293	von Willebrand factor type A domain
1293	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1293	von Willebrand factor type A domain
1293	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1293	von Willebrand factor type A domain
1293	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1293	von Willebrand factor type A domain
1293	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1293	von Willebrand factor type A domain
1293	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1294	Fibronectin type III domain
1294	von Willebrand factor type A domain
1294	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1295	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
1295	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
1296	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
1296	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
1297	Thrombospondin N-terminal -like domain
1297	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1297	Thrombospondin N-terminal -like domain
1297	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1300	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
1301	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
1301	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1301	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
1301	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1301	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
1301	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1303	Fibronectin type III domain
1303	von Willebrand factor type A domain
1303	Thrombospondin N-terminal -like domain
1303	Fibronectin type III domain
1303	von Willebrand factor type A domain
1303	Thrombospondin N-terminal -like domain
1306	Thrombospondin N-terminal -like domain
1307	Thrombospondin N-terminal -like domain
1310	Thrombospondin N-terminal -like domain
1310	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
1314	Coatomer WD associated domain
1315	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
1317	Ctr copper transporter family. The redox active metal copper is an essential cofactor in critical biological processes such as respiration, iron transport, oxidative stress protection, hormone production, and pigmentation. A widely conserved family of hi
1318	Ctr copper transporter family. The redox active metal copper is an essential cofactor in critical biological processes such as respiration, iron transport, oxidative stress protection, hormone production, and pigmentation. A widely conserved family of hi
1325	Somatostatin/Cortistatin family. Members of this family are hormones. Somatostatin inhibits the release of somatotropin. Cortistatin is a peptide that is related to the Somatostatins that is found to depresses neuronal electrical activity but, unlike som
1327	pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium
1329	Cytochrome c oxidase subunit Vb
1337	Cytochrome c oxidase subunit VIa
1339	Cytochrome c oxidase subunit VIa
1340	Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase
1345	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
1346	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
1347	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
1350	pfam02935, COX7C, Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast me
1351	Cytochrome oxidase c subunit VIII. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIII
1356	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
1357	Zinc carboxypeptidase
1358	Zinc carboxypeptidase
1358	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
1359	Zinc carboxypeptidase
1359	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
1360	Zinc carboxypeptidase
1360	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
1361	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
1361	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
1363	Zinc carboxypeptidase
1364	PMP-22/EMP/MP20/Claudin family
1365	PMP-22/EMP/MP20/Claudin family
1366	PMP-22/EMP/MP20/Claudin family
1368	Zinc carboxypeptidase
1369	Zinc carboxypeptidase
1371	Coproporphyrinogen III oxidase
1373	MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase and the domain is also found in Carbamoyl phosphate synthetase (CPS) where it forms a regulatory domain that binds to the allosteric effector ornithine. This family also
1374	Choline/Carnitine o-acyltransferase
1375	Choline/Carnitine o-acyltransferase
1375	Choline/Carnitine o-acyltransferase
1375	Choline/Carnitine o-acyltransferase
1375	Choline/Carnitine o-acyltransferase
1376	Choline/Carnitine o-acyltransferase
1378	Sushi domain (SCR repeat)
1378	Sushi domain (SCR repeat)
1379	Sushi domain (SCR repeat)
1380	Sushi domain (SCR repeat)
1381	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
1382	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
1384	Choline/Carnitine o-acyltransferase
1384	Choline/Carnitine o-acyltransferase
1384	Choline/Carnitine o-acyltransferase
1385	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
1385	pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB
1385	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
1385	pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB
1387	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1387	TAZ zinc finger. The TAZ2 domain of CBP binds to other transcription factors such as the p53 tumor suppressor protein, E1A oncoprotein, MyoD, and GATA-1
1387	KIX domain. CBP and P300 bind to the CREB via a domain known as KIX. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun
1390	pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB
1392	Corticotropin-releasing factor family
1394	7 transmembrane receptor (Secretin family)
1395	7 transmembrane receptor (Secretin family)
1396	LIM domain. This family represents two copies of the LIM structural domain
1397	LIM domain. This family represents two copies of the LIM structural domain
1398	SH2 domain
1398	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
1398	SH2 domain
1398	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
1399	SH2 domain
1399	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
1400	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
1401	Pentaxin family. Pentaxins are also known as pentraxins
1404	Extracellular link domain
1406	Homeobox domain
1407	FAD binding domain of DNA photolyase
1408	FAD binding domain of DNA photolyase
1409	Hsp20/alpha crystallin family
1409	Alpha crystallin A chain, N terminal
1410	Hsp20/alpha crystallin family
1411	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
1412	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
1412	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
1412	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
1413	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
1414	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
1415	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
1417	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
1418	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
1419	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
1420	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
1421	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
1427	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
1428	Ornithine cyclodeaminase/mu-crystallin family. This family contains the bacterial Ornithine cyclodeaminase enzyme EC:4.3.1.12, which catalyses the deamination of ornithine to proline. This family also contains mu-Crystallin the major component of the eye
1431	Citrate synthase
1434	CAS/CSE protein, C-terminus. Mammalian cellular apoptosis susceptibility (CAS) proteins are homologous to the yeast chromosome-segregation protein, CSE1. This family aligns the C-terminal halves (approximately). CAS is involved in both cellular apoptosis
1437	Granulocyte-macrophage colony-stimulating factor
1439	Fibronectin type III domain
1440	Interleukin-6/G-CSF/MGF family
1440	Interleukin-6/G-CSF/MGF family
1440	Interleukin-6/G-CSF/MGF family
1441	Fibronectin type III domain
1441	Fibronectin type III domain
1441	Fibronectin type III domain
1441	Fibronectin type III domain
1442	Somatotropin hormone family
1442	Somatotropin hormone family
1442	Somatotropin hormone family
1443	Somatotropin hormone family
1443	Somatotropin hormone family
1443	Somatotropin hormone family
1444	Somatotropin hormone family
1444	Somatotropin hormone family
1444	Somatotropin hormone family
1448	Kappa casein. Kappa-casein is a mammalian milk protein involved in a number of important physiological processes. In the gut, the ingested protein is split into an insoluble peptide (para kappa-casein) and a soluble hydrophilic glycopeptide (caseinomacro
1452	Protein kinase domain
1453	Protein kinase domain
1453	Protein kinase domain
1454	Protein kinase domain
1454	Protein kinase domain
1455	Protein kinase domain
1456	Protein kinase domain
1457	Protein kinase domain
1457	Protein kinase domain
1459	Protein kinase domain
1460	Casein kinase II regulatory subunit
1462	Extracellular link domain
1462	Sushi domain (SCR repeat)
1462	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
1463	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
1465	LIM domain. This family represents two copies of the LIM structural domain
1466	LIM domain. This family represents two copies of the LIM structural domain
1468	Mitochondrial carrier protein
1475	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
1476	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
1478	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
1482	Homeobox domain
1490	Thrombospondin type 1 domain
1491	Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OA
1491	Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OA
1495	Vinculin family
1496	Vinculin family
1497	PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How
1499	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
1504	Trypsin
1506	Trypsin
1508	Papain family cysteine protease
1508	Papain family cysteine protease
1508	Papain family cysteine protease
1508	Papain family cysteine protease
1508	Papain family cysteine protease
1509	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
1510	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
1510	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
1512	Papain family cysteine protease
1512	Papain family cysteine protease
1515	Papain family cysteine protease
1522	Papain family cysteine protease
1523	Homeobox domain
1523	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
1524	7 transmembrane receptor (rhodopsin family)
1528	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
1534	Cytochrome b561
1536	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
1537	Cytochrome C1 family
1540	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
1543	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1544	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1545	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1548	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1549	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1549	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1551	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1553	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1555	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1557	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1558	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1558	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1559	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1562	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1565	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1571	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1572	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1573	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1576	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1577	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1579	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1580	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1581	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1582	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1583	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1584	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1585	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1586	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1588	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1588	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1589	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1591	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1592	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1592	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1593	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1594	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1595	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
1600	Phosphotyrosine interaction domain (PTB/PID)
1601	Phosphotyrosine interaction domain (PTB/PID)
1602	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
1602	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
1602	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
1603	DAD family. Members of this family are thought to be integral membrane proteins. Some members of this family have been shown to cause apoptosis if mutated, these proteins are known as DAD for defender against death. The family also includes the epsilon s
1604	Sushi domain (SCR repeat)
1606	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
1606	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
1607	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
1607	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
1607	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
1607	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
1608	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
1608	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
1609	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
1609	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
1610	FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1
1616	Daxx Family. The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis, but precise role played by Daxx remians to be determined
1617	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
1621	Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
1621	Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
1622	Acyl CoA binding protein
1627	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
1627	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
1630	Fibronectin type III domain
1632	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
1633	Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes
1634	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
1634	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
1634	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
1636	Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid
1636	Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid
1636	Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid
1638	Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372
1639	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
1642	CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in p
1644	Pyridoxal-dependent decarboxylase conserved domain
1645	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
1646	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
1647	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
1650	Dolichyl-diphosphooligosaccharide-protein glycosyltransferase 48kD subunit. Members of this family are involved in asparagine-linked protein glycosylation. In particular, dolichyl-diphosphooligosaccharide-protein glycosyltransferase (DDOST), also known a
1652	Macrophage migration inhibitory factor (MIF)
1667	Mammalian defensin
1667	Defensin propeptide
1668	Mammalian defensin
1668	Defensin propeptide
1669	Mammalian defensin
1669	Defensin propeptide
1670	Mammalian defensin
1670	Defensin propeptide
1671	Mammalian defensin
1671	Defensin propeptide
1674	Intermediate filament protein
1676	CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain
1677	CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain
1678	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
1690	LCCL domain
1690	von Willebrand factor type A domain
1716	Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes
1716	Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes
1717	Ergosterol biosynthesis ERG4/ERG24 family
1718	FAD binding domain. This family consists of various enzymes that use FAD as a co-factor, most of the enzymes are similar to oxygen oxidoreductase. One of the enzymes Vanillyl-alcohol oxidase (VAO) has a solved structure, the alignment includes the FAD bi
1719	Dihydrofolate reductase
1723	Dihydroorotate dehydrogenase
1725	Deoxyhypusine synthase. Eukaryotic initiation factor 5A (eIF-5A) contains an unusual amino acid, hypusine [N epsilon-(4-aminobutyl-2-hydroxy)lysine]. The first step in the post-translational formation of hypusine is catalysed by the enzyme deoxyhypusine
1725	Deoxyhypusine synthase. Eukaryotic initiation factor 5A (eIF-5A) contains an unusual amino acid, hypusine [N epsilon-(4-aminobutyl-2-hydroxy)lysine]. The first step in the post-translational formation of hypusine is catalysed by the enzyme deoxyhypusine
1725	Deoxyhypusine synthase. Eukaryotic initiation factor 5A (eIF-5A) contains an unusual amino acid, hypusine [N epsilon-(4-aminobutyl-2-hydroxy)lysine]. The first step in the post-translational formation of hypusine is catalysed by the enzyme deoxyhypusine
1727	Oxidoreductase FAD-binding domain
1727	Oxidoreductase FAD-binding domain
1728	Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones
1731	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
1733	Iodothyronine deiodinase
1734	Iodothyronine deiodinase
1734	Iodothyronine deiodinase
1735	Iodothyronine deiodinase
1736	PUA domain
1736	TruB family pseudouridylate synthase (N terminal domain)
1737	2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain
1743	2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain
1743	Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins
1745	Homeobox domain
1746	Homeobox domain
1747	Homeobox domain
1748	Homeobox domain
1748	Homeobox domain
1749	Homeobox domain
1750	Homeobox domain
1755	CUB domain
1755	Zona pellucida-like domain
1755	Zona pellucida-like domain
1755	Zona pellucida-like domain
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
1756	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
1756	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
1759	Dynamin GTPase effector domain
1759	PH domain. PH stands for pleckstrin homology
1759	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
1762	WD domain, G-beta repeat
1767	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
1769	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
1770	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
1770	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
1777	Deoxyribonuclease II
1778	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
1785	Dynamin family
1785	Dynamin GTPase effector domain
1785	PH domain. PH stands for pleckstrin homology
1786	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
1786	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
1788	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
1788	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
1788	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
1789	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
1789	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
1789	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
1789	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
1791	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
1794	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
1796	PTB domain (IRS-1 type)
1798	Glycosyl transferase
1800	Renal dipeptidase
1801	Putative diphthamide synthesis protein. One member is a candidate tumour suppressor gene. DPH2 from yeast, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein s
1802	Putative diphthamide synthesis protein. One member is a candidate tumour suppressor gene. DPH2 from yeast, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein s
1803	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
1804	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
1804	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
1808	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
1809	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
1811	Sulfate transporter family. Mutations may lead to several human diseases
1811	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
1812	7 transmembrane receptor (rhodopsin family)
1813	7 transmembrane receptor (rhodopsin family)
1813	7 transmembrane receptor (rhodopsin family)
1814	7 transmembrane receptor (rhodopsin family)
1814	7 transmembrane receptor (rhodopsin family)
1814	7 transmembrane receptor (rhodopsin family)
1814	7 transmembrane receptor (rhodopsin family)
1814	7 transmembrane receptor (rhodopsin family)
1815	7 transmembrane receptor (rhodopsin family)
1816	7 transmembrane receptor (rhodopsin family)
1820	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
1821	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
1822	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
1823	Cadherin domain
1823	Cadherin domain
1824	Cadherin domain
1824	Cadherin domain
1824	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1825	Cadherin domain
1825	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1825	Cadherin domain
1826	Fibronectin type III domain
1828	Cadherin domain
1828	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
1829	Cadherin domain
1830	Cadherin domain
1831	TSC-22/dip/bun family
1832	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
1833	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
1836	Sulfate transporter family. Mutations may lead to several human diseases
1840	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
1841	Thymidylate kinase
1842	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
1843	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
1844	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
1844	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
1845	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
1846	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
1846	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
1847	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
1848	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
1848	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
1849	Protein-tyrosine phosphatase
1849	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
1849	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
1850	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
1852	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
1854	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
1855	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
1855	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
1855	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The
1855	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
1855	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
1855	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The
1855	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
1855	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
1855	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The D
1856	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
1856	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
1856	Dishevelled specific domain. This domain is specific to the signaling protein dishevelled. The domain is found adjacent to the PDZ domain pfam00595, often in conjunction with DEP (pfam00610) and DIX (pfam00778)
1856	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The
1857	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
1857	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
1857	Dishevelled specific domain. This domain is specific to the signaling protein dishevelled. The domain is found adjacent to the PDZ domain pfam00595, often in conjunction with DEP (pfam00610) and DIX (pfam00778)
1857	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The
1869	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
1870	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
1871	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
1874	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
1875	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
1876	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
1879	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
1880	7 transmembrane receptor (rhodopsin family)
1894	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
1901	7 transmembrane receptor (rhodopsin family)
1902	7 transmembrane receptor (rhodopsin family)
1902	7 transmembrane receptor (rhodopsin family)
1903	7 transmembrane receptor (rhodopsin family)
1906	Endothelin family
1907	Endothelin family
1908	Endothelin family
1909	7 transmembrane receptor (rhodopsin family)
1910	7 transmembrane receptor (rhodopsin family)
1910	7 transmembrane receptor (rhodopsin family)
1911	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
1912	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
1937	Elongation factor 1 gamma, conserved domain
1942	Ephrin
1942	Ephrin
1943	Ephrin
1944	Ephrin
1945	Ephrin
1945	Ephrin
1945	Ephrin
1946	Ephrin
1947	Ephrin
1948	Ephrin
1949	Ephrin
1951	7 transmembrane receptor (Secretin family)
1951	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
1951	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
1951	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
1952	7 transmembrane receptor (Secretin family)
1952	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
1952	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
1956	Furin-like cysteine rich region
1956	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
1962	3-hydroxyacyl-CoA dehydrogenase, NAD binding domain. This family also includes lambda crystallin
1962	3-hydroxyacyl-CoA dehydrogenase, C-terminal domain. This family also includes lambda crystallin. Some proteins include two copies of this domain
1962	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
1964	Eukaryotic initiation factor 1A
1967	Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac
1969	Fibronectin type III domain
1969	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
1977	Eukaryotic initiation factor 4E
1982	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA
1983	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
1991	Trypsin
1992	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
2002	Ets-domain
2004	Ets-domain
2005	Ets-domain
2005	Ets-domain
2009	HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes
2010	LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1
2011	Kinase associated domain 1
2011	Kinase associated domain 1
2012	PMP-22/EMP/MP20/Claudin family
2013	PMP-22/EMP/MP20/Claudin family
2014	PMP-22/EMP/MP20/Claudin family
2015	7 transmembrane receptor (Secretin family)
2015	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
2017	Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018
2017	Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018
2018	Homeobox domain
2019	Homeobox domain
2020	Homeobox domain
2023	Enolase, C-terminal TIM barrel domain
2030	Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their
2033	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
2033	TAZ zinc finger. The TAZ2 domain of CBP binds to other transcription factors such as the p53 tumor suppressor protein, E1A oncoprotein, MyoD, and GATA-1
2033	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
2033	KIX domain. CBP and P300 bind to the CREB via a domain known as KIX. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun
2035	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
2036	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
2036	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
2037	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
2038	Transglutaminase family
2038	Transglutaminase family, C-terminal ig like domain
2038	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
2040	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
2041	Fibronectin type III domain
2041	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
2041	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
2042	Fibronectin type III domain
2042	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
2042	Fibronectin type III domain
2042	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
2043	Fibronectin type III domain
2043	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
2044	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
2044	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
2045	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
2046	Fibronectin type III domain
2046	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
2047	Fibronectin type III domain
2047	Giardia variant-specific surface protein
2047	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
2048	Fibronectin type III domain
2048	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
2048	Fibronectin type III domain
2048	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
2049	Fibronectin type III domain
2049	Giardia variant-specific surface protein
2049	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
2050	Fibronectin type III domain
2050	Giardia variant-specific surface protein
2050	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
2051	Fibronectin type III domain
2051	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
2051	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
2052	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
2053	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
2056	Erythropoietin/thrombopoietin
2058	WHEP-TRS domain
2058	tRNA synthetases class I (E and Q), catalytic domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase aminoacyl
2063	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
2063	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
2064	Furin-like cysteine rich region
2064	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
2065	Furin-like cysteine rich region
2065	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
2066	Furin-like cysteine rich region
2066	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
2070	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
2070	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
2070	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
2070	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
2073	XPG I-region
2073	XPG N-terminal domain
2074	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
2077	Ets-domain
2078	Ets-domain
2078	Sterile alpha motif (SAM)/Pointed domain
2079	Enhancer of rudimentary. Enhancer of rudimentary is a protein of unknown function that is highly conserved in plants and animals. This protein is found to be an enhancer of the rudimentary gene
2091	Fibrillarin
2098	Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is possibly related to the pfam00135 family
2099	Oestrogen receptor
2099	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
2099	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
2100	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
2100	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
2101	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
2103	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
2103	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
2104	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
2104	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
2109	Electron transfer flavoprotein beta subunit. This protein is distantly related to and forms a heterodimer with pfam00766
2113	Ets-domain
2113	Sterile alpha motif (SAM)/Pointed domain
2114	Ets-domain
2115	Ets-domain
2117	Ets-domain
2118	Ets-domain
2119	Ets-domain
2120	Ets-domain
2122	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
2128	Homeobox domain
2131	Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo
2132	Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo
2134	Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo
2137	Uncharacterized ACR, COG1579
2137	Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo
2138	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
2138	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
2138	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
2138	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
2139	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
2139	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
2139	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
2139	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
2139	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
2140	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
2145	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
2146	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
2146	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
2147	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox
2149	7 transmembrane receptor (rhodopsin family)
2150	7 transmembrane receptor (rhodopsin family)
2151	7 transmembrane receptor (rhodopsin family)
2152	Tissue factor
2153	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
2153	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
2155	Trypsin
2155	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox
2155	Trypsin
2155	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox
2157	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
2157	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
2157	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
2158	Trypsin
2158	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
2159	Trypsin
2159	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
2159	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox
2160	Trypsin
2161	Trypsin
2161	Fibronectin type I domain
2161	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
2161	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
2162	Transglutaminase family
2162	Transglutaminase family, C-terminal ig like domain
2162	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
2165	Sushi domain (SCR repeat)
2166	Amidase
2167	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
2168	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
2169	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
2170	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
2171	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
2172	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
2173	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
2175	Fanconi anaemia group A protein
2176	Fanconi anaemia group C protein
2184	Fumarylacetoacetate (FAA) hydrolase family. This family consists of fumarylacetoacetate (FAA) hydrolase, or fumarylacetoacetate hydrolase (FAH) and it also includes HHDD isomerase/OPET decarboxylase from E. coli strain W. FAA is the last enzyme in the ty
2185	Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta
2185	Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta
2185	Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta
2185	Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta
2186	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
2186	DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors)
2186	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
2186	DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors)
2186	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
2186	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
2191	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
2194	Zinc-binding dehydrogenase
2194	Thioesterase domain. Peptide synthetases are involved in the non-ribosomal synthesis of peptide antibiotics. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid th
2195	Cadherin domain
2195	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
2196	Cadherin domain
2196	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
2196	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
2200	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
2201	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
2203	Fructose-1-6-bisphosphatase
2206	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
2208	Lectin C-type domain. This family includes both long and short form C-type
2217	Class I Histocompatibility antigen, domains alpha 1 and 2
2222	Squalene/phytoene synthase
2224	Polyprenyl synthetase
2235	Ferrochelatase
2239	Glypican
2241	SH2 domain
2241	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
2242	SH2 domain
2242	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
2244	Fibrinogen beta and gamma chains, C-terminal globular domain
2245	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
2245	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
2246	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2246	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2246	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2247	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2248	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2249	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2250	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2250	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2251	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2252	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2253	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2253	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2253	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2253	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2254	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
2262	Glypican
2266	Fibrinogen beta and gamma chains, C-terminal globular domain
2266	Fibrinogen beta and gamma chains, C-terminal globular domain
2268	SH2 domain
2268	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
2272	HIT family
2274	LIM domain. This family represents two copies of the LIM structural domain
2275	LIM domain. This family represents two copies of the LIM structural domain
2281	FKBP-type peptidyl-prolyl cis-trans isomerase
2286	FKBP-type peptidyl-prolyl cis-trans isomerase
2286	FKBP-type peptidyl-prolyl cis-trans isomerase
2287	FKBP-type peptidyl-prolyl cis-trans isomerase
2288	FKBP-type peptidyl-prolyl cis-trans isomerase
2289	FKBP-type peptidyl-prolyl cis-trans isomerase
2290	Fork head domain
2296	Fork head domain
2297	Fork head domain
2301	Fork head domain
2302	Fork head domain
2303	Fork head domain
2306	Fork head domain
2308	Fork head domain
2313	Ets-domain
2316	Filamin/ABP280 repeat
2317	Filamin/ABP280 repeat
2318	Filamin/ABP280 repeat
2319	Flotillin family. Flotillins are integral membrane proteins that have been shown to be present in several subcellular components, including caveolae (invaginated plasma membrane microdomains), lipid rafts (sphingolipid and cholesterol-rich, detergent-res
2322	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv
2323	pfam02947, flt3_lig, flt3 ligand. The flt3 ligand is a short chain cytokine with a 4 helical bundle fold
2326	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
2327	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
2328	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
2329	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
2330	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
2335	Fibronectin type I domain
2335	Fibronectin type III domain
2335	Fibronectin type I domain
2346	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
2346	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
2348	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
2348	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
2348	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
2348	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
2348	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
2348	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
2350	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
2352	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
2357	7 transmembrane receptor (rhodopsin family)
2358	7 transmembrane receptor (rhodopsin family)
2359	7 transmembrane receptor (rhodopsin family)
2395	Frataxin-like domain. This family contains proteins that have a domain related to the globular C-terminus of Frataxin the protein that is mutated in Friedreich's ataxia. This domain is found in a family of bacterial proteins. The function of this domain
2395	Frataxin-like domain. This family contains proteins that have a domain related to the globular C-terminus of Frataxin the protein that is mutated in Friedreich's ataxia. This domain is found in a family of bacterial proteins. The function of this domain
2475	Phosphatidylinositol 3- and 4-kinase
2475	FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal
2487	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
2492	7 transmembrane receptor (rhodopsin family)
2492	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
2492	7 transmembrane receptor (rhodopsin family)
2492	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
2494	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
2494	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
2495	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
2512	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
2515	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
2515	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
2516	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
2516	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
2517	Alpha-L-fucosidase
2519	Alpha-L-fucosidase
2520	Gastrin/cholecystokinin family
2521	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
2523	Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes
2524	Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes
2525	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
2526	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
2527	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
2528	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
2529	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
2531	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
2534	SH2 domain
2534	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
2534	SH2 domain
2534	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
2534	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
2535	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
2547	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
2548	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
2550	7 transmembrane receptor (metabotropic glutamate family)
2550	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2550	7 transmembrane receptor (metabotropic glutamate family)
2550	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2550	7 transmembrane receptor (metabotropic glutamate family)
2550	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2550	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2551	Ets-domain
2551	Sterile alpha motif (SAM)/Pointed domain
2554	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2554	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2555	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2555	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2556	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2556	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2557	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2557	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2558	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2558	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2559	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2559	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2560	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2560	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2561	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2561	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2561	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2561	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2562	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2562	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2562	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2562	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2563	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2563	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2564	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2564	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2564	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2564	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2564	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2564	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2564	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2564	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2565	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2565	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2566	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2566	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2567	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2567	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2568	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2568	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2569	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2569	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2570	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2570	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2571	Pyridoxal-dependent decarboxylase conserved domain
2571	Pyridoxal-dependent decarboxylase conserved domain
2572	Pyridoxal-dependent decarboxylase conserved domain
2580	Protein-tyrosine phosphatase
2581	Glycosyl hydrolase family 59
2583	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
2587	7 transmembrane receptor (rhodopsin family)
2588	Sulfatase
2589	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
2590	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
2591	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
2592	Galactose-1-phosphate uridyl transferase, N-terminal domain. SCOP reports fold duplication with C-terminal domain. Both involved in Zn and Fe binding
2592	Galactose-1-phosphate uridyl transferase, N-terminal domain. SCOP reports fold duplication with C-terminal domain. Both involved in Zn and Fe binding
2617	WHEP-TRS domain
2624	GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the d
2629	O-Glycosyl hydrolase family 30
2631	ssDNA binding protein
2633	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
2633	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
2634	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
2634	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
2635	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
2635	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
2637	Homeobox domain
2641	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
2642	7 transmembrane receptor (Secretin family)
2642	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
2645	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
2645	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
2645	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
2649	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
2649	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
2649	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
2649	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
2649	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
2649	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
2650	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
2651	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
2651	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
2651	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
2652	7 transmembrane receptor (rhodopsin family)
2653	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl grou
2658	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
2660	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
2664	GDP dissociation inhibitor
2665	GDP dissociation inhibitor
2668	Transforming growth factor beta like domain
2670	Intermediate filament protein
2674	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro
2674	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro
2675	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro
2676	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro
2678	Gamma-glutamyltranspeptidase
2678	Gamma-glutamyltranspeptidase
2678	Gamma-glutamyltranspeptidase
2679	Gamma-glutamyltranspeptidase
2683	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
2687	Gamma-glutamyltranspeptidase
2688	Somatotropin hormone family
2688	Somatotropin hormone family
2688	Somatotropin hormone family
2688	Somatotropin hormone family
2689	Somatotropin hormone family
2689	Somatotropin hormone family
2689	Somatotropin hormone family
2689	Somatotropin hormone family
2690	Fibronectin type III domain
2691	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
2692	7 transmembrane receptor (Secretin family)
2693	7 transmembrane receptor (rhodopsin family)
2694	Eukaryotic cobalamin-binding protein
2695	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
2696	7 transmembrane receptor (Secretin family)
2697	Connexin
2697	Gap junction alpha-1 protein (Cx43)
2700	Connexin
2701	Connexin
2702	Connexin
2702	Connexin
2703	Connexin
2703	Gap junction alpha-8 protein (Cx50)
2705	Connexin
2706	Connexin
2707	Connexin
2709	Connexin
2717	Melibiase
2719	Glypican
2720	Glycosyl hydrolases family 35
2729	Glutamate-cysteine ligase. This family represents the catalytic subunit of glutamate-cysteine ligase (E.C. 6.3.2.2), also known as gamma-glutamylcysteine synthetase (GCS). This enzyme catalyses the rate limiting step in the biosynthesis of glutathione. T
2734	Cysteine rich repeat. This cysteine rich repeat contains four cysteines. It is found in multiple copies in a protein that binds to fibroblast growth factors. The repeat is also found in MG160 and E-selectin ligand (ESL-1)
2739	Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily
2740	7 transmembrane receptor (Secretin family)
2741	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2741	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2742	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2742	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2743	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
2743	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
2745	Glutaredoxin
2747	Glu/Leu/Phe/Val dehydrogenase, dimerisation domain
2747	Glutamate/Leucine/Phenylalanine/Valine dehydrogenase
2752	Glutamine synthetase, catalytic domain
2764	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
2766	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
2767	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2768	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2769	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2770	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2771	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2773	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2774	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2775	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2776	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2778	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2778	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2778	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2779	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2779	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2780	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2781	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
2794	GTPase of unknown function
2798	7 transmembrane receptor (rhodopsin family)
2799	Sulfatase
2800	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
2803	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
2810	14-3-3 protein
2810	14-3-3 protein
2813	Zona pellucida-like domain
2815	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
2817	Glypican
2819	NAD-dependent glycerol-3-phosphate dehydrogenase
2821	Phosphoglucose isomerase. Phosphoglucose isomerase catalyses the interconversion of glucose-6-phosphate and fructose-6-phosphate
2823	Myelin proteolipid protein (PLP or lipophilin)
2824	Myelin proteolipid protein (PLP or lipophilin)
2825	7 transmembrane receptor (rhodopsin family)
2826	7 transmembrane receptor (rhodopsin family)
2827	7 transmembrane receptor (rhodopsin family)
2828	7 transmembrane receptor (rhodopsin family)
2829	7 transmembrane receptor (rhodopsin family)
2830	7 transmembrane receptor (rhodopsin family)
2831	7 transmembrane receptor (rhodopsin family)
2832	7 transmembrane receptor (rhodopsin family)
2833	7 transmembrane receptor (rhodopsin family)
2834	7 transmembrane receptor (rhodopsin family)
2835	7 transmembrane receptor (rhodopsin family)
2837	7 transmembrane receptor (rhodopsin family)
2838	7 transmembrane receptor (rhodopsin family)
2840	7 transmembrane receptor (rhodopsin family)
2841	7 transmembrane receptor (rhodopsin family)
2842	7 transmembrane receptor (rhodopsin family)
2843	7 transmembrane receptor (rhodopsin family)
2844	7 transmembrane receptor (rhodopsin family)
2845	7 transmembrane receptor (rhodopsin family)
2846	7 transmembrane receptor (rhodopsin family)
2847	7 transmembrane receptor (rhodopsin family)
2848	7 transmembrane receptor (rhodopsin family)
2849	7 transmembrane receptor (rhodopsin family)
2850	7 transmembrane receptor (rhodopsin family)
2852	7 transmembrane receptor (rhodopsin family)
2853	7 transmembrane receptor (rhodopsin family)
2854	7 transmembrane receptor (rhodopsin family)
2857	7 transmembrane receptor (rhodopsin family)
2859	7 transmembrane receptor (rhodopsin family)
2861	7 transmembrane receptor (rhodopsin family)
2862	7 transmembrane receptor (rhodopsin family)
2863	7 transmembrane receptor (rhodopsin family)
2864	7 transmembrane receptor (rhodopsin family)
2865	7 transmembrane receptor (rhodopsin family)
2866	7 transmembrane receptor (rhodopsin family)
2867	7 transmembrane receptor (rhodopsin family)
2868	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
2869	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
2870	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
2876	Glutathione peroxidase
2877	Glutathione peroxidase
2878	Glutathione peroxidase
2880	Glutathione peroxidase
2880	Glutathione peroxidase
2885	SH2 domain
2885	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
2890	Bacterial extracellular solute-binding proteins, family 3
2890	Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors
2890	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2891	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2892	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2892	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2893	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2894	Bacterial extracellular solute-binding proteins, family 3
2894	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2895	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2896	Granulin
2897	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2897	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2898	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2898	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2899	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2900	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2901	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2902	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2902	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2902	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2903	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2904	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2905	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2906	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2907	Uncharacterized protein family UPF0005
2908	Glucocorticoid receptor
2908	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
2908	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
2909	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
2909	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
2911	7 transmembrane receptor (metabotropic glutamate family)
2911	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2912	7 transmembrane receptor (metabotropic glutamate family)
2912	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2913	7 transmembrane receptor (metabotropic glutamate family)
2913	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2914	7 transmembrane receptor (metabotropic glutamate family)
2914	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2915	7 transmembrane receptor (metabotropic glutamate family)
2915	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2916	7 transmembrane receptor (metabotropic glutamate family)
2916	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2917	7 transmembrane receptor (metabotropic glutamate family)
2917	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2917	7 transmembrane receptor (metabotropic glutamate family)
2917	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2917	7 transmembrane receptor (metabotropic glutamate family)
2917	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2918	7 transmembrane receptor (metabotropic glutamate family)
2918	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2919	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
2920	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
2921	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
2923	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
2925	7 transmembrane receptor (rhodopsin family)
2926	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
2928	Homeobox domain
2937	Eukaryotic glutathione synthase
2937	Eukaryotic glutathione synthase, ATP binding domain
2938	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
2939	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
2940	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
2941	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
2944	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
2944	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
2946	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
2947	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
2948	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
2948	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
2948	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
2949	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
2950	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
2956	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
2958	Transcription initiation factor IIA, gamma subunit, helical domain
2958	Transcription initiation factor IIA, gamma subunit, beta-barrel domain
2960	TFIIE alpha subunit
2961	TFIIE beta subunit core domain. General transcription factor TFIIE consists of two subunits, TFIIE alpha pfam02002 and TFIIE beta. TFIIE beta has been found to bind to the region where the promoter starts to open to be single-stranded upon transcription
2963	Transcription initiation factor IIF
2966	Ssl1-like. Ssl1-like proteins are 40kDa subunits of the Transcription factor II H complex
2967	Transcription factor Tfb4
2968	Transcription factor Tfb2
2969	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
2969	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
2969	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
2969	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
2969	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
2977	Adenylate and Guanylate cyclase catalytic domain
2980	Guanylin precursor
2981	Guanylin precursor
2983	Adenylate and Guanylate cyclase catalytic domain
2984	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2986	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
2990	Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities
2990	Glycosyl hydrolases family 2, sugar binding domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities and has a jelly-roll fold
2993	Glycophorin A
2994	Glycophorin A
2997	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari
2998	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari
3000	Protein kinase domain
3000	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
3001	Trypsin
3002	Trypsin
3003	Trypsin
3005	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
3006	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
3007	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
3008	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
3009	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
3010	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
3024	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
3028	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
3029	Metallo-beta-lactamase superfamily
3030	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
3032	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
3034	Phenylalanine and histidine ammonia-lyase
3039	Globin
3040	Globin
3043	Globin
3045	Globin
3046	Globin
3047	Globin
3048	Globin
3049	Globin
3050	Globin
3052	Cytochrome c/c1 heme lyase
3053	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
3055	SH2 domain
3055	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
3059	Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018
3060	Prepro-orexin
3061	7 transmembrane receptor (rhodopsin family)
3062	Orexin receptor type 2
3062	7 transmembrane receptor (rhodopsin family)
3064	Huntingtin
3065	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
3066	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
3067	Pyridoxal-dependent decarboxylase conserved domain
3068	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
3069	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
3073	Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold
3074	Glycosyl hydrolase family 20, domain 2. This domain has a zincin-like fold
3074	Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold
3075	Sushi domain (SCR repeat)
3077	Class I Histocompatibility antigen, domains alpha 1 and 2
3077	Class I Histocompatibility antigen, domains alpha 1 and 2
3077	Class I Histocompatibility antigen, domains alpha 1 and 2
3077	Class I Histocompatibility antigen, domains alpha 1 and 2
3077	Class I Histocompatibility antigen, domains alpha 1 and 2
3077	Class I Histocompatibility antigen, domains alpha 1 and 2
3077	Class I Histocompatibility antigen, domains alpha 1 and 2
3077	Class I Histocompatibility antigen, domains alpha 1 and 2
3077	Class I Histocompatibility antigen, domains alpha 1 and 2
3078	Sushi domain (SCR repeat)
3080	Sushi domain (SCR repeat)
3081	homogentisate 1,2-dioxygenase. Homogentisate dioxygenase cleaves the aromatic ring during the metabolic degradation of Phe and Tyr. Homogentisate dioxygenase deficiency causes alkaptonuria. The structure of homogentisate dioxygenase shows that the enzyme
3082	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
3083	Trypsin
3083	Fibronectin type I domain
3083	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
3084	Neuregulin family
3084	Neuregulin family
3084	Neuregulin family
3084	Neuregulin family
3084	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
3087	Homeobox domain
3090	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
3094	HIT family
3096	Zinc finger
3097	Zinc finger
3098	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
3098	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
3098	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
3098	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
3098	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
3099	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
3101	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
3101	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF03727. Some members of the family have two copies of each of these domains
3104	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
3105	Class I Histocompatibility antigen, domains alpha 1 and 2
3106	Class I Histocompatibility antigen, domains alpha 1 and 2
3107	Class I Histocompatibility antigen, domains alpha 1 and 2
3108	Class II histocompatibility antigen, alpha domain
3109	Class II histocompatibility antigen, beta domain
3110	Homeobox domain
3111	Class II histocompatibility antigen, alpha domain
3112	Class II histocompatibility antigen, beta domain
3113	Class II histocompatibility antigen, alpha domain
3115	Class II histocompatibility antigen, beta domain
3117	Class II histocompatibility antigen, alpha domain
3118	Class II histocompatibility antigen, alpha domain
3119	Class II histocompatibility antigen, beta domain
3120	Class II histocompatibility antigen, beta domain
3120	Class II histocompatibility antigen, beta domain
3120	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
3122	Class II histocompatibility antigen, alpha domain
3123	Class II histocompatibility antigen, beta domain
3125	Class II histocompatibility antigen, beta domain
3126	Class II histocompatibility antigen, beta domain
3127	Class II histocompatibility antigen, beta domain
3133	Class I Histocompatibility antigen, domains alpha 1 and 2
3134	Class I Histocompatibility antigen, domains alpha 1 and 2
3135	Class I Histocompatibility antigen, domains alpha 1 and 2
3140	Class I Histocompatibility antigen, domains alpha 1 and 2
3142	Homeobox domain
3148	HMG (high mobility group) box
3149	HMG (high mobility group) box
3155	HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase
3156	Hydroxymethylglutaryl-coenzyme A reductase
3156	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
3157	Hydroxymethylglutaryl-coenzyme A synthase
3158	Hydroxymethylglutaryl-coenzyme A synthase
3162	Heme oxygenase
3163	Heme oxygenase
3164	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
3164	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
3164	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
3164	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
3164	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
3166	Homeobox domain
3169	Fork head domain
3170	Fork head domain
3170	Fork head domain
3171	Fork head domain
3172	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
3172	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
3172	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
3172	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
3172	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
3172	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
3174	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
3174	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
3175	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
3177	Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their
3187	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
3188	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
3190	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
3190	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
3190	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
3192	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
3192	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
3195	Homeobox domain
3196	Homeobox domain
3196	Homeobox domain
3198	Homeobox domain
3199	Homeobox domain
3200	Homeobox domain
3200	Homeobox domain
3200	Homeobox domain
3201	Homeobox domain
3202	Homeobox domain
3203	Homeobox domain
3204	Homeobox domain
3205	Homeobox domain
3205	Homeobox domain
3206	Homeobox domain
3206	Homeobox domain
3207	Homeobox domain
3209	Homeobox domain
3211	Homeobox domain
3212	Homeobox domain
3213	Homeobox domain
3214	Homeobox domain
3215	Homeobox domain
3216	Homeobox domain
3216	Homeobox domain
3217	Homeobox domain
3218	Homeobox domain
3219	Homeobox domain
3221	Homeobox domain
3221	Homeobox domain
3222	Homeobox domain
3223	Homeobox domain
3223	Homeobox domain
3224	Homeobox domain
3225	Homeobox domain
3226	Homeobox domain
3227	Homeobox domain
3228	Homeobox domain
3229	Homeobox domain
3231	Homeobox domain
3232	Homeobox domain
3233	Homeobox domain
3234	Homeobox domain
3235	Homeobox domain
3236	Homeobox domain
3237	Homeobox domain
3238	Homeobox domain
3239	Homeobox domain
3240	Trypsin
3248	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
3249	Trypsin
3263	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
3266	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
3269	7 transmembrane receptor (rhodopsin family)
3273	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
3274	7 transmembrane receptor (rhodopsin family)
3280	Helix-loop-helix DNA-binding domain
3283	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
3284	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
3290	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
3290	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
3291	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
3292	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
3293	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
3294	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
3295	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
3295	MaoC like domain. The MaoC protein is found to share similarity with a wide variety of enzymes
3295	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
3297	HSF-type DNA-binding
3298	HSF-type DNA-binding
3299	HSF-type DNA-binding
3303	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
3304	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
3305	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
3306	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
3308	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
3309	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
3310	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
3312	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
3312	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
3313	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
3315	Hsp20/alpha crystallin family
3316	Hsp20/alpha crystallin family
3320	Hsp90 protein
3324	Hsp90 protein
3324	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
3326	Hsp90 protein
3336	Chaperonin 10 Kd subunit
3339	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
3340	Sulfotransferase protein
3344	Fork head domain
3350	7 transmembrane receptor (rhodopsin family)
3351	7 transmembrane receptor (rhodopsin family)
3352	7 transmembrane receptor (rhodopsin family)
3354	7 transmembrane receptor (rhodopsin family)
3355	7 transmembrane receptor (rhodopsin family)
3356	7 transmembrane receptor (rhodopsin family)
3357	7 transmembrane receptor (rhodopsin family)
3358	7 transmembrane receptor (rhodopsin family)
3359	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
3359	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
3360	7 transmembrane receptor (rhodopsin family)
3361	7 transmembrane receptor (rhodopsin family)
3362	7 transmembrane receptor (rhodopsin family)
3363	7 transmembrane receptor (rhodopsin family)
3363	7 transmembrane receptor (rhodopsin family)
3363	7 transmembrane receptor (rhodopsin family)
3364	Hus1-like protein. Hus1, Rad1, and Rad9 are three evolutionarily conserved proteins required for checkpoint control in fission yeast. These proteins are known to form a stable complex in vivo. Hus1-Rad1-Rad9 complex may form a PCNA-like ring structure, a
3371	Fibronectin type III domain
3373	Hyaluronidase
3373	Hyaluronidase
3373	Hyaluronidase
3373	Hyaluronidase
3373	Hyaluronidase
3373	Hyaluronidase
3373	Hyaluronidase
3383	Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes,
3384	Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes,
3394	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp
3396	Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly con
3398	Helix-loop-helix DNA-binding domain
3400	Helix-loop-helix DNA-binding domain
3423	Sulfatase
3423	Sulfatase
3425	Glycosyl hydrolases family 39
3426	Trypsin
3426	Low-density lipoprotein receptor domain class A
3426	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
3428	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
3428	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
3431	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
3431	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
3431	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
3431	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
3431	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
3431	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
3439	Interferon alpha/beta domain
3440	Interferon alpha/beta domain
3441	Interferon alpha/beta domain
3442	Interferon alpha/beta domain
3443	Interferon alpha/beta domain
3444	Interferon alpha/beta domain
3445	Interferon alpha/beta domain
3446	Interferon alpha/beta domain
3447	Interferon alpha/beta domain
3448	Interferon alpha/beta domain
3449	Interferon alpha/beta domain
3451	Interferon alpha/beta domain
3452	Interferon alpha/beta domain
3455	Tissue factor
3456	Interferon alpha/beta domain
3458	Interferon gamma
3460	Fibronectin type III domain
3467	Interferon alpha/beta domain
3476	TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway
3479	Insulin/IGF/Relaxin family
3480	Fibronectin type III domain
3480	Furin-like cysteine rich region
3480	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
3481	Insulin/IGF/Relaxin family
3482	Cation-independent mannose-6-phosphate receptor repeat. The cation-independent mannose-6-phosphate receptor contains 15 copies of a repeat
3483	Leucine rich repeat C-terminal domain
3484	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
3485	Thyroglobulin type-1 repeat
3485	Insulin-like growth factor binding proteins
3486	Thyroglobulin type-1 repeat
3486	Insulin-like growth factor binding protein
3487	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
3488	Thyroglobulin type-1 repeat
3488	Insulin-like growth factor binding proteins
3489	Thyroglobulin type-1 repeat
3489	Insulin-like growth factor binding proteins
3491	Thrombospondin type 1 domain
3491	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
3516	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
3549	Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module
3549	Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation
3551	Protein kinase domain
3553	Interleukin-1 propeptide. The Interleukin-1 cytokines are translated as precursor proteins. The N terminal approx. 115 amino acids form a propeptide that is cleaved off to release the active interleukin-1
3554	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
3556	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
3558	Interleukin 2
3559	Sushi domain (SCR repeat)
3562	Interleukin-3
3565	Interleukin 4
3565	Interleukin 4
3566	Fibronectin type III domain
3567	Interleukin 5
3569	Interleukin-6/G-CSF/MGF family
3572	Fibronectin type III domain
3572	Fibronectin type III domain
3575	Fibronectin type III domain
3576	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
3577	7 transmembrane receptor (rhodopsin family)
3578	Interleukin 7/9 family. IL-7 is a cytokine that acts as a growth factor for early lymphoid cells of both B- and T-cell lineages. IL-9 is a multi-functional cytokine that, although originally described as a T-cell growth factor, its function in T-cell res
3579	7 transmembrane receptor (rhodopsin family)
3586	Interleukin 10
3588	Tissue factor
3592	Interleukin-12 alpha subunit. Interleukin 12 (IL-12) is a disulphide-bonded heterodimer consisting of a 35kDa alpha subunit and a 40kDa beta subunit. It is involved in the stimulation and maintenance of Th1 cellular immune responses, including the normal
3594	Fibronectin type III domain
3595	Fibronectin type III domain
3600	Interleukin 15. Interleukin-15 (IL-15) is a cytokine that possesses a variety of biological functions, including stimulation and maintenance of cellular immune responses
3600	Interleukin 15. Interleukin-15 (IL-15) is a cytokine that possesses a variety of biological functions, including stimulation and maintenance of cellular immune responses
3600	Interleukin 15. Interleukin-15 (IL-15) is a cytokine that possesses a variety of biological functions, including stimulation and maintenance of cellular immune responses
3606	Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18
3607	Fork head domain
3607	Fork head domain
3609	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
3609	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
3609	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
3611	Protein kinase domain
3612	Inositol monophosphatase family
3613	Inositol monophosphatase family
3614	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
3615	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
3617	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
3619	Inner centromere protein, ARK binding region. This region of the inner centromere protein has been found to be necessary and sufficient for binding to aurora-related kinase. This interaction has been implicated in the coordination of chromosome segregati
3620	Indoleamine 2,3-dioxygenase
3624	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
3625	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
3626	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
3627	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
3628	Inositol monophosphatase family
3630	Insulin/IGF/Relaxin family. Superfamily includes insulins
3643	Fibronectin type III domain
3643	Furin-like cysteine rich region
3643	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
3645	Fibronectin type III domain
3645	Furin-like cysteine rich region
3645	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
3645	Fibronectin type III domain
3645	Furin-like cysteine rich region
3645	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
3646	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
3651	Homeobox domain
3652	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
3654	Protein kinase domain
3656	Death domain
3656	Protein kinase domain
3658	Aconitase family (aconitate hydratase)
3658	Aconitase C-terminal domain. Members of this family usually also match to pfam00330. This domain undergoes conformational change in the enzyme mechanism
3659	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp
3660	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp
3662	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp
3667	PTB domain (IRS-1 type)
3670	LIM domain. This family represents two copies of the LIM structural domain
3671	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
3672	von Willebrand factor type A domain
3673	von Willebrand factor type A domain
3674	Integrin alpha cytoplasmic region. This family contains the short intracellular region of integrin alpha chains
3678	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
3681	von Willebrand factor type A domain
3682	von Willebrand factor type A domain
3683	von Willebrand factor type A domain
3684	von Willebrand factor type A domain
3684	Integrin alpha cytoplasmic region. This family contains the short intracellular region of integrin alpha chains
3688	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
3688	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
3688	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
3688	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
3688	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
3688	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
3689	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
3690	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
3691	Calx-beta domain
3691	Fibronectin type III domain
3691	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
3692	eIF-6 family. This family includes eukaryotic translation initiation factor 6 as well as presumed archaebacterial homologues
3692	eIF-6 family. This family includes eukaryotic translation initiation factor 6 as well as presumed archaebacterial homologues
3693	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
3694	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
3695	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
3696	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
3697	von Willebrand factor type A domain
3702	SH2 domain
3702	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
3702	BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region
3703	Oligosaccharyl transferase STT3 subunit. This family consists of the oligsacharyl transferase STT3 subunit and related proteins. The STT3 subunit is part of the oligosccharyl transferase (OTase) complex of proteins and is required for its activity. OTase
3706	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
3707	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
3708	RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3
3708	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3709	MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function
3709	RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3
3710	MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function
3710	RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3
3714	Delta serrate ligand
3714	Delta serrate ligand
3716	Protein kinase domain
3720	jmjC domain
3720	jmjN domain
3720	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
3720	pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function
3725	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
3725	Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases
3726	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
3726	Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases
3727	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
3727	Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases
3728	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
3728	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
3730	Fibronectin type III domain
3732	Tetraspanin family
3736	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3736	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3737	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3737	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3738	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3738	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3739	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3739	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3741	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3741	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3742	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3742	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3743	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3743	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3744	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3744	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3745	Kv2 voltage-gated K+ channel
3745	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3745	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3746	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3746	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3747	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3747	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3747	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3747	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3747	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3747	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3748	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3748	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3749	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3749	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3749	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3749	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3750	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3750	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3751	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3751	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3752	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3752	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3752	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3752	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3753	Slow voltage-gated potassium channel
3754	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3754	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3755	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3755	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3755	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3756	Cyclic nucleotide-binding domain
3756	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
3756	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3756	Cyclic nucleotide-binding domain
3756	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
3756	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3757	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
3757	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3757	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
3757	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3757	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3758	Inward rectifier potassium channel
3758	Inward rectifier potassium channel
3758	Inward rectifier potassium channel
3758	Inward rectifier potassium channel
3758	Inward rectifier potassium channel
3759	Inward rectifier potassium channel
3760	Inward rectifier potassium channel
3761	Inward rectifier potassium channel
3761	Inward rectifier potassium channel
3762	Inward rectifier potassium channel
3763	Inward rectifier potassium channel
3764	Inward rectifier potassium channel
3765	Inward rectifier potassium channel
3766	Inward rectifier potassium channel
3767	Inward rectifier potassium channel
3768	Inward rectifier potassium channel
3769	Inward rectifier potassium channel
3770	Inward rectifier potassium channel
3770	Inward rectifier potassium channel
3772	Inward rectifier potassium channel
3772	Inward rectifier potassium channel
3772	Inward rectifier potassium channel
3773	Inward rectifier potassium channel
3773	Inward rectifier potassium channel
3773	Inward rectifier potassium channel
3777	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3778	Calcium-activated BK potassium channel alpha subunit
3778	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3779	Calcium-activated potassium channel, beta subunit
3780	Calcium-activated SK potassium channel
3780	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-
3781	Calcium-activated SK potassium channel
3781	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-
3781	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-
3782	Calcium-activated SK potassium channel
3782	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-
3782	Calcium-activated SK potassium channel
3782	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-
3783	Calcium-activated SK potassium channel
3783	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-
3784	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
3784	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3784	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
3784	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3784	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
3784	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3785	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
3785	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3785	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
3785	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3785	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
3785	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3785	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
3785	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3785	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3786	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
3786	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3787	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3787	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3788	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3788	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
3790	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
3790	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
3795	pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es
3795	pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es
3797	Kinesin motor domain
3798	Kinesin motor domain
3799	Kinesin motor domain
3800	Kinesin motor domain
3801	Kinesin motor domain
3818	Trypsin
3827	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
3848	Intermediate filament protein
3849	Intermediate filament protein
3850	Intermediate filament protein
3851	Intermediate filament protein
3852	Intermediate filament protein
3853	Intermediate filament protein
3854	Intermediate filament protein
3855	Intermediate filament protein
3856	Intermediate filament protein
3857	Intermediate filament protein
3858	Intermediate filament protein
3859	Intermediate filament protein
3860	Intermediate filament protein
3860	Intermediate filament protein
3861	Intermediate filament protein
3866	Intermediate filament protein
3868	Intermediate filament protein
3872	Intermediate filament protein
3875	Intermediate filament protein
3880	Intermediate filament protein
3881	Intermediate filament protein
3882	Intermediate filament protein
3883	Intermediate filament protein
3884	Intermediate filament protein
3885	Intermediate filament protein
3886	Intermediate filament protein
3887	Intermediate filament protein
3888	Intermediate filament protein
3889	Intermediate filament protein
3890	Intermediate filament protein
3891	Intermediate filament protein
3892	Intermediate filament protein
3897	Fibronectin type III domain
3897	Fibronectin type III domain
3897	Fibronectin type III domain
3897	Fibronectin type III domain
3898	7 transmembrane receptor (rhodopsin family)
3906	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzyme
3908	Laminin G domain
3908	Laminin N-terminal (Domain VI)
3908	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
3910	Giardia variant-specific surface protein
3911	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
3912	Laminin N-terminal (Domain VI)
3912	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
3913	Laminin N-terminal (Domain VI)
3913	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
3914	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
3915	Laminin B (Domain IV)
3915	Laminin N-terminal (Domain VI)
3915	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
3916	Lysosome-associated membrane glycoprotein (Lamp)
3918	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
3918	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
3918	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
3918	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
3920	Lysosome-associated membrane glycoprotein (Lamp)
3920	Lysosome-associated membrane glycoprotein (Lamp)
3921	Ribosomal protein S2
3925	Stathmin family
3927	LIM domain. This family represents two copies of the LIM structural domain
3929	LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
3930	Ergosterol biosynthesis ERG4/ERG24 family
3931	Lecithin:cholesterol acyltransferase. Lecithin:cholesterol acyltransferase (LACT) is involved in extracellular metabolism of plasma lipoproteins, including cholesterol
3932	SH2 domain
3933	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
3934	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
3936	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
3937	SH2 domain
3938	Glycosyl hydrolase family 1
3949	Low-density lipoprotein receptor domain class A
3949	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
3952	Leptin
3955	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
3956	Galactoside-binding lectin
3957	Galactoside-binding lectin
3958	Galactoside-binding lectin
3973	7 transmembrane receptor (rhodopsin family)
3975	Homeobox domain
3975	LIM domain. This family represents two copies of the LIM structural domain
3976	LIF / OSM family
3980	ATP dependent DNA ligase domain. This domain belongs to a more diverse superfamily, including pfam01331 and pfam01653
3980	Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc
3980	ATP dependent DNA ligase domain. This domain belongs to a more diverse superfamily, including pfam01331 and pfam01653
3980	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
3980	Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc
3981	ATP dependent DNA ligase domain. This domain belongs to a more diverse superfamily, including pfam01331 and pfam01653
3981	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
3982	PMP-22/EMP/MP20/Claudin family
3983	Villin headpiece domain
3983	LIM domain. This family represents two copies of the LIM structural domain
3983	Villin headpiece domain
3983	LIM domain. This family represents two copies of the LIM structural domain
3984	Protein kinase domain
3984	LIM domain. This family represents two copies of the LIM structural domain
3984	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
3984	LIM domain. This family represents two copies of the LIM structural domain
3984	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
3985	LIM domain. This family represents two copies of the LIM structural domain
3985	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
3985	LIM domain. This family represents two copies of the LIM structural domain
3985	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
3987	LIM domain. This family represents two copies of the LIM structural domain
3988	ab-hydrolase associated lipase region
3990	Lipase
3992	Fatty acid desaturase
3992	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
3995	Fatty acid desaturase
3995	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
3998	Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19-
4000	Intermediate filament protein
4000	Intermediate filament tail domain
4000	Intermediate filament protein
4000	Intermediate filament tail domain
4000	Intermediate filament protein
4000	Intermediate filament tail domain
4001	Intermediate filament protein
4001	Intermediate filament tail domain
4004	LIM domain. This family represents two copies of the LIM structural domain
4005	LIM domain. This family represents two copies of the LIM structural domain
4007	LIM domain. This family represents two copies of the LIM structural domain
4008	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
4009	Homeobox domain
4009	LIM domain. This family represents two copies of the LIM structural domain
4010	Homeobox domain
4010	LIM domain. This family represents two copies of the LIM structural domain
4012	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
4015	Lysyl oxidase
4016	Lysyl oxidase
4017	Lysyl oxidase
4018	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
4023	Lipase
4023	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
4025	Animal haem peroxidase
4033	Uncharacterized ACR, COG1579
4035	Low-density lipoprotein receptor domain class A
4035	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
4036	Low-density lipoprotein receptor domain class A
4036	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
4038	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
4040	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
4041	Low-density lipoprotein receptor domain class A
4041	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
4046	Caldesmon
4048	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
4050	TNF(Tumor Necrosis Factor) family
4051	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
4052	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
4053	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
4054	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
4055	TNFR/NGFR cysteine-rich region
4056	MAPEG family
4056	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
4057	Transferrin
4061	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
4062	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
4064	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
4065	Lectin C-type domain. This family includes both long and short form C-type
4067	SH2 domain
4067	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
4070	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
4072	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
4074	Cation-dependent mannose-6-phosphate receptor
4077	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
4077	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
4077	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
4081	Mab-21 protein
4084	Helix-loop-helix DNA-binding domain
4085	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
4086	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
4086	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
4087	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
4087	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
4088	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
4088	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
4089	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
4089	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
4090	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
4090	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
4092	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
4092	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
4093	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
4093	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
4094	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
4097	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
4100	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4101	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4101	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4101	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4102	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4103	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4104	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4105	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4105	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4107	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4108	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4109	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4110	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4111	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4112	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4112	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4112	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4113	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4114	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4115	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4116	Mago nashi protein. This family was originally identified in Drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene. The human homologue has been shown to interact with an RNA binding protein. An RNAi knockout of the
4121	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
4122	Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism
4124	Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism
4125	Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism
4128	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
4129	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
4133	Tau and MAP protein, tubulin-binding repeat
4133	Tau and MAP protein, tubulin-binding repeat
4133	Tau and MAP protein, tubulin-binding repeat
4133	Tau and MAP protein, tubulin-binding repeat
4134	Tau and MAP protein, tubulin-binding repeat
4134	Tau and MAP protein, tubulin-binding repeat
4134	Tau and MAP protein, tubulin-binding repeat
4134	Tau and MAP protein, tubulin-binding repeat
4137	Tau and MAP protein, tubulin-binding repeat
4137	Tau and MAP protein, tubulin-binding repeat
4137	Tau and MAP protein, tubulin-binding repeat
4137	Tau and MAP protein, tubulin-binding repeat
4139	Kinase associated domain 1
4140	Kinase associated domain 1
4141	WHEP-TRS domain
4142	7 transmembrane receptor (rhodopsin family)
4145	SH2 domain
4145	Protein kinase domain
4145	SH2 domain
4145	Protein kinase domain
4145	SH2 domain
4145	Protein kinase domain
4146	von Willebrand factor type A domain
4147	von Willebrand factor type A domain
4147	von Willebrand factor type A domain
4147	von Willebrand factor type A domain
4147	von Willebrand factor type A domain
4148	von Willebrand factor type A domain
4149	Helix-loop-helix DNA-binding domain
4149	Helix-loop-helix DNA-binding domain
4149	Helix-loop-helix DNA-binding domain
4149	Helix-loop-helix DNA-binding domain
4149	Helix-loop-helix DNA-binding domain
4151	Globin
4152	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
4152	Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD ha
4152	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
4152	Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD ha
4152	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
4152	Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD ha
4152	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
4152	Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD ha
4152	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
4152	Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD ha
4153	Lectin C-type domain. This family includes both long and short form C-type
4155	Myelin basic protein
4157	7 transmembrane receptor (rhodopsin family)
4158	7 transmembrane receptor (rhodopsin family)
4159	7 transmembrane receptor (rhodopsin family)
4160	7 transmembrane receptor (rhodopsin family)
4161	7 transmembrane receptor (rhodopsin family)
4163	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
4168	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
4170	Apoptosis regulator proteins, Bcl-2 family
4170	Apoptosis regulator proteins, Bcl-2 family
4173	MCM2/3/5 family
4175	MCM2/3/5 family
4176	MCM2/3/5 family
4176	MCM2/3/5 family
4179	Sushi domain (SCR repeat)
4179	Sushi domain (SCR repeat)
4179	Sushi domain (SCR repeat)
4179	Sushi domain (SCR repeat)
4179	Sushi domain (SCR repeat)
4179	Sushi domain (SCR repeat)
4179	Sushi domain (SCR repeat)
4179	Sushi domain (SCR repeat)
4179	Sushi domain (SCR repeat)
4179	Sushi domain (SCR repeat)
4179	Sushi domain (SCR repeat)
4179	Sushi domain (SCR repeat)
4179	Sushi domain (SCR repeat)
4179	Sushi domain (SCR repeat)
4185	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
4185	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
4185	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
4185	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
4192	PTN/MK heparin-binding protein family
4193	p53-associated protein (MDM2). The p53-associated protein (MDM2) is an inhibitor of the p53 tumor suppressor gene binding the transactivation domain and down regulating the ability of p53 to activate transcription. This family contains the p53 binding do
4193	p53-associated protein (MDM2). The p53-associated protein (MDM2) is an inhibitor of the p53 tumor suppressor gene binding the transactivation domain and down regulating the ability of p53 to activate transcription. This family contains the p53 binding do
4194	p53-associated protein (MDM2). The p53-associated protein (MDM2) is an inhibitor of the p53 tumor suppressor gene binding the transactivation domain and down regulating the ability of p53 to activate transcription. This family contains the p53 binding do
4199	Malic enzyme, NAD binding domain
4205	SRF-type transcription factor (DNA-binding and dimerisation domain)
4207	SRF-type transcription factor (DNA-binding and dimerisation domain)
4208	SRF-type transcription factor (DNA-binding and dimerisation domain)
4209	SRF-type transcription factor (DNA-binding and dimerisation domain)
4210	B-box zinc finger
4210	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
4214	Protein kinase domain
4218	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
4222	Homeobox domain
4223	Homeobox domain
4224	MAM domain. An extracellular domain found in many receptors
4224	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
4224	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
4224	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
4225	MAM domain. An extracellular domain found in many receptors
4225	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
4225	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
4233	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
4233	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
4233	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
4234	Putative methyltransferase. This is a family of putative methyltransferases. The aligned region contains the GXGXG S-AdoMet binding site suggesting a putative methyltransferase activity
4234	Putative methyltransferase
4240	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
4240	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
4241	Transferrin
4241	Transferrin
4242	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
4245	GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis o
4246	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
4250	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
4254	Stem cell factor. Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. The crystal structure of human SCF has been resolved and a potential receptor-binding site
4254	Stem cell factor. Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. The crystal structure of human SCF has been resolved and a potential receptor-binding site
4255	6-O-methylguanine DNA methyltransferase, DNA binding domain. This domain is a 3 helical bundle
4257	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
4257	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
4257	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
4257	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
4258	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
4259	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
4276	Class I Histocompatibility antigen, domains alpha 1 and 2
4277	Class I Histocompatibility antigen, domains alpha 1 and 2
4282	Macrophage migration inhibitory factor (MIF)
4283	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
4284	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
4285	Peptidase family M3. This is the Thimet oligopeptidase family, large family of mammalian and bacterial oligopeptidases that cleave medium sized peptides. The group also contains mitochondrial intermediate peptidase which is encoded by nuclear DNA but fun
4286	Helix-loop-helix DNA-binding domain
4287	Josephin
4287	Josephin
4288	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
4289	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
4289	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
4291	3'-5' exonuclease
4293	Protein kinase domain
4297	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
4297	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
4297	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
4298	YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity
4300	YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity
4301	DIL domain. The DIL domain has no known function
4301	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
4301	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
4306	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
4306	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
4308	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
4312	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4312	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4312	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
4313	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4313	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4313	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
4314	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4314	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4314	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
4316	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4316	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4317	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4317	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4317	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
4318	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4318	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4319	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4319	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4319	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
4320	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4320	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
4321	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4321	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4321	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
4322	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4322	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4322	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
4323	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4323	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4323	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
4324	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4324	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4324	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
4325	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4325	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4325	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
4325	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4325	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4325	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
4326	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4326	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4326	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
4327	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
4328	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
4329	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
4332	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
4332	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
4335	Helix-loop-helix DNA-binding domain
4338	MoaE protein. This family contains the MoaE protein that is involved in biosynthesis of molybdopterin. Molybdopterin, the universal component of the pterin molybdenum cofactors, contains a dithiolene group serving to bind Mo. Addition of the dithiolene s
4338	ThiS family. ThiS (thiaminS) is a 66 aa protein involved in sulphur transfer. ThiS is coded in the thiCEFSGH operon in E. coli. This family of proteins have two conserved Glycines at the COOH terminus. Thiocarboxylate is formed at the last G in the activ
4342	Protein kinase domain
4350	Methylpurine-DNA glycosylase (MPG). Methylpurine-DNA glycosylase is a base excision-repair protein. It is responsible for the hydrolysis of the deoxyribose N-glycosidic bond, excising 3-methyladenine and 3-methylguanine from damaged DNA
4351	Phosphomannose isomerase type I
4352	Fibronectin type III domain
4353	Animal haem peroxidase
4354	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
4355	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
4358	Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is
4360	Lectin C-type domain. This family includes both long and short form C-type
4361	Mer11 DNA-binding domain. The Mre11 complex is a multisubunit nuclease that is composed of Mre11, Rad50 and Nbs1/Xrs2, and is involved in checkpoint signalling and DNA replication. Mre11 has an intrinsic DNA-binding activity that is stimulated by Rad50 o
4361	Mer11 DNA-binding domain. The Mre11 complex is a multisubunit nuclease that is composed of Mre11, Rad50 and Nbs1/Xrs2, and is involved in checkpoint signalling and DNA replication. Mre11 has an intrinsic DNA-binding activity that is stimulated by Rad50 o
4363	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
4363	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
4430	Myosin head (motor domain)
4478	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
4478	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
4481	Macrophage scavenger receptor
4481	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
4481	Macrophage scavenger receptor
4481	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
4481	Macrophage scavenger receptor
4481	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
4485	Trypsin
4485	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
4486	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
4486	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
4486	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
4487	Homeobox domain
4488	Homeobox domain
4507	Phosphorylase family 2
4508	ATP synthase A chain
4512	Cytochrome C and Quinol oxidase polypeptide I
4513	Cytochrome C oxidase subunit II, periplasmic domain
4521	NUDIX domain
4522	Formate--tetrahydrofolate ligase
4524	Methylenetetrahydrofolate reductase. This family includes the 5,10-methylenetetrahydrofolate reductase EC:1.7.99.5 from bacteria and methylenetetrahydrofolate reductase EC: 1.5.1.20 from eukaryotes. The structure for this domain is known to be a TIM barr
4534	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
4535	NADH dehydrogenase
4536	NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane
4537	NADH-ubiquinone/plastoquinone oxidoreductase, chain 3
4539	NADH-ubiquinone/plastoquinone oxidoreductase chain 4L
4541	NADH-ubiquinone/plastoquinone oxidoreductase chain 6
4543	7 transmembrane receptor (rhodopsin family)
4544	7 transmembrane receptor (rhodopsin family)
4547	Lipoprotein amino terminal region. This family contains regions from: Vitellogenin, Microsomal triglyceride transfer protein and apolipoprotein B-100. These proteins are all involved in lipid transport. This family contains the LV1n chain from lipovitell
4552	Flavodoxin
4552	Flavodoxin
4582	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
4582	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
4582	Structural protein 2. This family represents structural protein 2 of the hepatitis E virus. The high basic amino acid content of this protein has lead to the suggestion of a role in viral genomic RNA encapsidation
4583	von Willebrand factor type D domain
4583	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
4583	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerbe
4587	von Willebrand factor type D domain
4587	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
4591	B-box zinc finger
4593	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
4594	Methylmalonyl-CoA mutase. The enzyme methylmalonyl-CoA mutase is a member of a class of enzymes that uses coenzyme B12 (adenosylcobalamin) as a cofactor. The enzyme induces the formation of an adenosyl radical from the cofactor. This radical then initiat
4599	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
4600	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
4603	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
4609	Helix-loop-helix DNA-binding domain
4609	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo
4610	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo
4611	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo
4613	Helix-loop-helix DNA-binding domain
4613	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo
4615	Death domain
4616	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
4617	Helix-loop-helix DNA-binding domain
4617	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates
4618	Helix-loop-helix DNA-binding domain
4618	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates
4619	Myosin head (motor domain)
4620	Myosin head (motor domain)
4621	Myosin head (motor domain)
4622	Myosin head (motor domain)
4624	Myosin head (motor domain)
4624	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
4625	Myosin head (motor domain)
4626	Myosin head (motor domain)
4627	Myosin head (motor domain)
4627	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
4628	Myosin head (motor domain)
4628	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
4628	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
4628	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
4629	Myosin head (motor domain)
4629	Myosin head (motor domain)
4638	Fibronectin type III domain
4638	Fibronectin type III domain
4638	Fibronectin type III domain
4638	Fibronectin type III domain
4638	Fibronectin type III domain
4638	Fibronectin type III domain
4638	Fibronectin type III domain
4642	Myosin head (motor domain)
4644	Myosin head (motor domain)
4648	MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins
4648	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
4649	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
4650	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
4651	PH domain. PH stands for pleckstrin homology
4651	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
4654	Helix-loop-helix DNA-binding domain
4654	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates
4656	Helix-loop-helix DNA-binding domain
4656	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates
4661	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
4668	Melibiase
4673	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
4673	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
4674	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
4675	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
4676	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
4681	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb
4681	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb
4683	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
4683	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
4684	Fibronectin type III domain
4685	Fibronectin type III domain
4687	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
4688	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
4690	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
4692	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
4693	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
4703	Nebulin repeat
4729	Respiratory-chain NADH dehydrogenase 24 Kd subunit
4734	C2 domain
4734	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
4734	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
4735	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
4736	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
4738	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
4741	Intermediate filament protein
4744	Intermediate filament protein
4745	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
4747	Intermediate filament protein
4750	Protein kinase domain
4753	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
4756	Fibronectin type III domain
4760	Helix-loop-helix DNA-binding domain
4761	Helix-loop-helix DNA-binding domain
4762	Helix-loop-helix DNA-binding domain
4771	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
4771	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
4771	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
4771	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
4771	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
4771	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
4771	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
4771	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
4771	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
4771	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
4771	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
4771	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
4771	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
4771	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
4771	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
4771	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
4771	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
4771	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
4771	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
4771	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
4771	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
4771	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
4774	CTF/NF-I family
4774	CTF/NF-I family
4781	CTF/NF-I family
4782	CTF/NF-I family
4784	CTF/NF-I family
4790	Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha
4791	Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha
4796	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
4801	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
4803	Nerve growth factor family
4804	TNFR/NGFR cysteine-rich region
4808	Helix-loop-helix DNA-binding domain
4811	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
4817	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
4820	Cyclophilin type peptidyl-prolyl cis-trans isomerase
4821	Homeobox domain
4824	Homeobox domain
4825	Homeobox domain
4829	7 transmembrane receptor (rhodopsin family)
4830	Nucleoside diphosphate kinase
4831	Nucleoside diphosphate kinase
4832	Nucleoside diphosphate kinase
4833	Nucleoside diphosphate kinases
4835	Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones
4837	NNMT/PNMT/TEMT family
4838	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
4839	NOL1/NOP2/sun family
4842	pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain
4842	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
4843	pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain
4843	pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain
4846	pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain
4848	NOT2 / NOT3 / NOT5 family. NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5
4849	Not1 N-terminal domain, CCR4-Not complex component
4849	NOT2 / NOT3 / NOT5 family. NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5
4852	Pancreatic hormone peptide
4853	Notch (DSL) domain. The Notch domain is also called the 'DSL' domain. The notch proteins are transmembrane proteins with extracellular domains of repeated EGF domains and the notch (or DSL) domain N-terminal to that. These proteins are generally involved
4854	Notch (DSL) domain. The Notch domain is also called the 'DSL' domain. The notch proteins are transmembrane proteins with extracellular domains of repeated EGF domains and the notch (or DSL) domain N-terminal to that. These proteins are generally involved
4860	Phosphorylase family 2
4862	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
4864	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
4868	Fibronectin type III domain
4869	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
4878	Atrial natriuretic peptide
4879	Atrial natriuretic peptide
4880	Atrial natriuretic peptide
4881	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
4882	Protein kinase domain
4882	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
4882	Protein kinase domain
4882	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
4883	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
4886	7 transmembrane receptor (rhodopsin family)
4887	7 transmembrane receptor (rhodopsin family)
4889	7 transmembrane receptor (rhodopsin family)
4891	Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined
4892	Nebulin repeat
4897	Fibronectin type III domain
4901	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
4902	Transforming growth factor beta like domain
4905	Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist o
4908	Nerve growth factor family
4909	Nerve growth factor family
4914	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
4915	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
4916	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
4917	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
4919	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
4920	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
4921	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
4923	7 transmembrane receptor (rhodopsin family)
4928	Conserved nucleoporin domain
4928	Conserved nucleoporin domain
4928	Conserved nucleoporin domain
4928	Conserved nucleoporin domain
4929	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
4929	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
4929	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
4929	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
4929	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
4929	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
4929	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
4929	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
4935	Ocular albinism type 1 protein
4939	Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance t
4940	Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance t
4943	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
4946	Ornithine decarboxylase antizyme
4947	Ornithine decarboxylase antizyme
4950	Occludin/ELL family
4952	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
4952	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
4953	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
4956	Hsp20/alpha crystallin family
4976	Dynamin family
4976	Dynamin family
4976	Dynamin family
4976	Dynamin family
4976	Dynamin family
4976	Dynamin family
4976	Dynamin family
4976	Dynamin family
4982	TNFR/NGFR cysteine-rich region
4985	7 transmembrane receptor (rhodopsin family)
4986	7 transmembrane receptor (rhodopsin family)
4987	7 transmembrane receptor (rhodopsin family)
4987	7 transmembrane receptor (rhodopsin family)
4988	7 transmembrane receptor (rhodopsin family)
4991	7 transmembrane receptor (rhodopsin family)
4992	7 transmembrane receptor (rhodopsin family)
4993	7 transmembrane receptor (rhodopsin family)
4994	7 transmembrane receptor (rhodopsin family)
4995	7 transmembrane receptor (rhodopsin family)
4999	Origin recognition complex subunit 2. All DNA replication initiation is driven by a single conserved eukaryotic initiator complex termed he origin recognition complex (ORC). The ORC is a six protein complex. The function of ORC is reviewed in
5004	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
5005	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
5007	Oxysterol-binding protein
5008	LIF / OSM family
5010	PMP-22/EMP/MP20/Claudin family
5013	Homeobox domain
5013	Otx1 transcription factor
5015	Homeobox domain
5015	Otx1 transcription factor
5015	Homeobox domain
5015	Otx1 transcription factor
5020	Neurohypophysial hormones, C-terminal Domain. N-terminal Domain is in hormone5
5021	7 transmembrane receptor (rhodopsin family)
5023	ATP P2X receptor
5024	ATP P2X receptor
5025	ATP P2X receptor
5025	ATP P2X receptor
5025	ATP P2X receptor
5026	ATP P2X receptor
5026	ATP P2X receptor
5026	ATP P2X receptor
5027	ATP P2X receptor
5027	ATP P2X receptor
5028	7 transmembrane receptor (rhodopsin family)
5029	7 transmembrane receptor (rhodopsin family)
5029	7 transmembrane receptor (rhodopsin family)
5029	7 transmembrane receptor (rhodopsin family)
5030	7 transmembrane receptor (rhodopsin family)
5031	7 transmembrane receptor (rhodopsin family)
5031	7 transmembrane receptor (rhodopsin family)
5031	7 transmembrane receptor (rhodopsin family)
5031	7 transmembrane receptor (rhodopsin family)
5032	7 transmembrane receptor (rhodopsin family)
5034	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
5036	metallopeptidase family M24
5037	Phosphatidylethanolamine-binding protein
5042	Poly-adenylate binding protein, unique domain
5045	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
5046	Giardia variant-specific surface protein
5046	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
5046	Giardia variant-specific surface protein
5046	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
5046	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
5046	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
5046	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
5046	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
5046	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
5046	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
5047	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
5049	Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellu
5050	Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellu
5051	Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl
5053	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
5054	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
5058	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
5062	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
5063	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
5064	Paralemmin
5066	Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
5066	Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
5066	Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
5066	Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
5066	Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
5066	Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
5066	Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
5066	Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
5067	Fibronectin type III domain
5067	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
5068	Lectin C-type domain. This family includes both long and short form C-type
5068	Lectin C-type domain. This family includes both long and short form C-type
5068	Lectin C-type domain. This family includes both long and short form C-type
5071	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
5071	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
5071	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
5071	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
5071	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
5071	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
5073	R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA
5075	'Paired box' domain
5076	'Paired box' domain
5076	'Paired box' domain
5077	'Paired box' domain
5077	'Paired box' domain
5077	Homeobox domain
5077	'Paired box' domain
5077	Homeobox domain
5077	'Paired box' domain
5077	Homeobox domain
5077	'Paired box' domain
5077	Homeobox domain
5077	'Paired box' domain
5077	Homeobox domain
5077	'Paired box' domain
5078	Homeobox domain
5078	'Paired box' domain
5079	'Paired box' domain
5080	Homeobox domain
5080	'Paired box' domain
5080	Homeobox domain
5080	'Paired box' domain
5081	Homeobox domain
5081	'Paired box' domain
5081	Homeobox domain
5081	'Paired box' domain
5082	Phosducin
5083	'Paired box' domain
5087	Homeobox domain
5087	PBX domain. The PBX domain is a bipartite acidic domain
5089	Homeobox domain
5089	PBX domain. The PBX domain is a bipartite acidic domain
5090	Homeobox domain
5090	PBX domain. The PBX domain is a bipartite acidic domain
5092	Pterin 4 alpha carbinolamine dehydratase. Pterin 4 alpha carbinolamine dehydratase is also known as DCoH (dimerisation cofactor of hepatocyte nuclear factor 1-alpha)
5096	Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction
5099	Cadherin domain
5100	Cadherin domain
5100	Cadherin domain
5101	Cadherin domain
5104	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
5105	Phosphoenolpyruvate carboxykinase. Catalyses the formation of phosphoenolpyruvate by decarboxylation of oxaloacetate
5106	Phosphoenolpyruvate carboxykinase. Catalyses the formation of phosphoenolpyruvate by decarboxylation of oxaloacetate
5110	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
5111	Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA
5111	Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA
5118	CUB domain
5122	Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic
5122	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
5125	Giardia variant-specific surface protein
5125	Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic
5125	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
5126	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
5129	Protein kinase domain
5132	Phosducin
5132	Phosducin
5134	MYND finger
5134	Programmed cell death protein 2, C-terminal domain
5134	MYND finger
5134	Programmed cell death protein 2, C-terminal domain
5136	3'5'-cyclic nucleotide phosphodiesterase
5137	3'5'-cyclic nucleotide phosphodiesterase
5138	3'5'-cyclic nucleotide phosphodiesterase
5139	3'5'-cyclic nucleotide phosphodiesterase
5140	3'5'-cyclic nucleotide phosphodiesterase
5141	3'5'-cyclic nucleotide phosphodiesterase
5142	3'5'-cyclic nucleotide phosphodiesterase
5143	3'5'-cyclic nucleotide phosphodiesterase
5144	3'5'-cyclic nucleotide phosphodiesterase
5145	3'5'-cyclic nucleotide phosphodiesterase
5146	3'5'-cyclic nucleotide phosphodiesterase
5150	3'5'-cyclic nucleotide phosphodiesterase
5150	3'5'-cyclic nucleotide phosphodiesterase
5151	3'5'-cyclic nucleotide phosphodiesterase
5152	3'5'-cyclic nucleotide phosphodiesterase
5153	3'5'-cyclic nucleotide phosphodiesterase
5154	Platelet-derived growth factor (PDGF)
5154	Platelet-derived growth factor (PDGF)
5155	Platelet-derived growth factor (PDGF)
5155	Platelet-derived growth factor (PDGF)
5158	3'5'-cyclic nucleotide phosphodiesterase
5160	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
5161	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
5163	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
5164	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
5165	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
5166	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
5167	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle
5168	Somatomedin B domain
5168	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle
5169	Somatomedin B domain
5169	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle
5172	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
5173	Vertebrate endogenous opioids neuropeptide
5179	Vertebrate endogenous opioids neuropeptide
5184	metallopeptidase family M24
5194	Peroxin 13, N-terminal domain. Both termini of the Peroxin-13 are oriented to the cytosol. Peroxin-13 is required for peroxisomal association of peroxin-14
5196	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
5197	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
5198	AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synt
5198	AIR synthase related protein, N-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The N-terminal domain of AIR synthase forms the d
5199	Thrombospondin type 1 domain
5201	KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif
5202	KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif
5204	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription
5204	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription
5204	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription
5207	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
5207	6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta
5208	6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta
5209	6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta
5210	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
5210	6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta
5211	Phosphofructokinase
5212	LCCL domain
5212	von Willebrand factor type A domain
5213	Phosphofructokinase
5214	Phosphofructokinase
5217	Profilin
5222	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
5225	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
5230	Phosphoglycerate kinase
5232	Phosphoglycerate kinase
5241	Progesterone receptor
5241	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
5245	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
5250	Mitochondrial carrier protein
5250	Mitochondrial carrier protein
5252	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
5252	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
5253	jmjC domain
5253	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
5253	jmjC domain
5257	Glycosyl hydrolases family 15
5265	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
5268	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
5269	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
5270	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
5275	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
5286	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
5286	PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains
5287	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
5287	PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains
5288	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
5288	PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains
5289	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
5290	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
5290	Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation
5291	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
5291	PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains
5292	Protein kinase domain
5293	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
5293	PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains
5294	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
5294	Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation
5295	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
5295	SH2 domain
5295	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
5295	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
5296	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
5300	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
5300	PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline
5301	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
5303	PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline
5305	Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K cata
5306	Phosphatidylinositol transfer protein
5307	Homeobox domain
5308	Homeobox domain
5308	Homeobox domain
5308	Homeobox domain
5309	Homeobox domain
5310	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
5310	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
5310	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
5310	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
5310	REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c
5311	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
5313	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
5313	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
5314	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
5314	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
5319	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with
5320	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with
5322	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with
5325	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
5327	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
5327	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
5327	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
5328	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
5330	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
5330	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
5331	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
5331	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
5332	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
5332	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
5333	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
5335	SH2 domain
5335	PH domain. PH stands for pleckstrin homology
5335	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
5335	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
5335	SH2 domain
5335	PH domain. PH stands for pleckstrin homology
5335	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
5335	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
5336	SH2 domain
5336	PH domain. PH stands for pleckstrin homology
5336	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
5337	PX domain. PX domains bind to phosphoinositides
5338	PX domain. PX domains bind to phosphoinositides
5339	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
5340	Trypsin
5340	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
5341	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
5342	PAN domain. The PAN domain contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge the links the N and C termini of the domain. The domain is found in diverse proteins, in some
5345	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
5346	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
5347	POLO box duplicated region
5348	ATP1G1/PLM/MAT8 family
5348	ATP1G1/PLM/MAT8 family
5349	ATP1G1/PLM/MAT8 family
5349	ATP1G1/PLM/MAT8 family
5350	Phospholamban. The regulation of calcium levels across the membrane of the sarcoplasmic reticulum involves the interplay of many membrane proteins. Phospholamban is a 52 residue integral membrane protein that is involved in reversibly inhibiting the Ca(2
5354	Myelin proteolipid protein (PLP or lipophilin)
5359	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve
5360	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
5360	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
5361	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
5361	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
5361	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
5364	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
5364	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
5365	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
5368	Vertebrate endogenous opioids neuropeptide
5371	B-box zinc finger
5371	Zinc finger, C3HC4 type (RING finger)
5371	Zinc finger
5371	Zinc finger
5371	Zinc finger
5372	Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions
5373	Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions
5375	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
5376	PMP-22/EMP/MP20/Claudin family
5376	PMP-22/EMP/MP20/Claudin family
5376	PMP-22/EMP/MP20/Claudin family
5380	Deuterolysin metalloprotease (M35) family
5394	3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termi
5396	Homeobox domain
5396	Homeobox domain
5406	Lipase
5406	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
5407	Lipase
5407	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
5408	Lipase
5408	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
5409	NNMT/PNMT/TEMT family
5412	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
5413	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
5414	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
5414	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
5414	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
5422	DNA polymerase family B. This region of DNA polymerase B appears to consist of more than one structural domain, possibly including elongation, DNA-binding and dNTP binding activities
5427	DNA polymerase epsilon subunit B. DNA polymerase epsilon is essential for cell viability and chromosomal DNA replication in budding yeast. In addition, DNA polymerase epsilon may be involved in DNA repair and cell-cycle checkpoint control. The enzyme con
5428	DNA polymerase family A
5430	RNA polymerase alpha subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Members of this family
5430	RNA polymerase A/beta'/A" subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This family inclu
5431	RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase com
5433	RNA polymerase Rpb4
5434	RNA polymerase Rpb5, C-terminal domain. The assembly domain of Rpb5. The archaeal equivalent to this domain is subunit H. Subunit H lacks the N-terminal domain
5434	RNA polymerase Rpb5, N-terminal domain. Rpb5 has a bipartite structure which includes a eukaryote-specific N-terminal domain and a C-terminal domain resembling the archaeal RNAP subunit H. The N-terminal domain is involved in DNA binding and is part of t
5435	RNA polymerase Rpb6. Rpb6 is an essential subunit in the eukaryotic polymerases Pol I, II and III. The bacterial equivalent to Rpb6 is the omega subunit. Rpb6 and omega are structurally conserved and both function in polymerase assembly
5437	RNA polymerase Rpb8. Rpb8 is a subunit common to the three yeast RNA polymerases, pol I, II and III. Rpb8 interacts with the largest subunit Rpb1, and with Rpb3 and Rpb11, two smaller subunits
5439	RNA polymerases L / 13 to 16 kDa subunit
5443	Corticotropin ACTH domain
5444	Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity.
5445	Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity.
5446	Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity.
5451	Homeobox domain
5452	Homeobox domain
5457	Pou domain - N-terminal to homeobox domain
5458	Pou domain - N-terminal to homeobox domain
5459	Pou domain - N-terminal to homeobox domain
5460	Homeobox domain
5464	Inorganic pyrophosphatase
5465	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
5465	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
5467	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
5467	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
5467	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
5467	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
5468	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
5468	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
5468	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
5468	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
5468	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
5468	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
5468	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
5468	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
5471	Glutamine amidotransferases class-II
5473	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
5476	Serine carboxypeptidase
5478	Cyclophilin type peptidyl-prolyl cis-trans isomerase
5479	Cyclophilin type peptidyl-prolyl cis-trans isomerase
5480	Cyclophilin type peptidyl-prolyl cis-trans isomerase
5481	Cyclophilin type peptidyl-prolyl cis-trans isomerase
5496	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
5496	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
5506	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase,
5507	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase,
5509	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase,
5511	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
5514	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
5518	HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats
5519	HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats
5519	HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats
5524	Phosphotyrosyl phosphate activator (PTPA) protein. Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is
5524	Phosphotyrosyl phosphate activator (PTPA) protein. Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is
5524	Phosphotyrosyl phosphate activator (PTPA) protein. Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is
5524	Phosphotyrosyl phosphate activator (PTPA) protein. Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is
5524	Phosphotyrosyl phosphate activator (PTPA) protein. Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is
5525	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy
5526	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy
5527	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy
5527	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy
5527	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy
5527	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy
5528	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy
5528	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy
5528	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy
5529	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy
5538	Palmitoyl protein thioesterase
5539	Pancreatic hormone peptide
5540	7 transmembrane receptor (rhodopsin family)
5547	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
5551	C2 domain
5551	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb
5557	DNA primase small subunit. DNA primase synthesizes the RNA primers for the Okazaki fragments in lagging strand DNA synthesis. DNA primase is a heterodimer of large and small subunits
5558	Eukaryotic-type DNA primase, large subunit. DNA primase is the polymerase that synthesises small RNA primers for the Okazaki fragments made during discontinuous DNA replication. DNA primase is a heterodimer of two subunits, the small subunit Pri1 (48 kDa
5564	pfam02922, isoamylase_N, Isoamylase N-terminal domain. This domain is found in a range of enzymes that act on branched substrates - isoamylase, pullulanase and branching enzyme. This family also contains the beta subunit of 5' AMP activated kinase
5569	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
5569	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
5570	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
5570	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
5570	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
5573	Cyclic nucleotide-binding domain
5576	Cyclic nucleotide-binding domain
5578	C2 domain
5578	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
5579	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
5580	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
5581	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
5582	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
5583	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
5584	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
5585	Hr1 repeat
5586	Hr1 repeat
5587	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
5588	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
5590	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
5591	FAT domain. The FAT domain is named after FRAP, ATM and TRRAP
5591	FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal
5595	Protein kinase domain
5617	Somatotropin hormone family
5623	Transforming growth factor beta like domain
5625	Proline dehydrogenase
5626	Homeobox domain
5627	Laminin G domain
5627	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox
5630	Intermediate filament protein
5641	Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this
5648	Trypsin
5648	CUB domain
5648	CUB domain
5649	Reeler domain
5649	Reeler domain
5651	Trypsin
5651	CUB domain
5651	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
5651	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
5657	Trypsin
5660	Saposin A-type domain
5662	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
5663	Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans
5663	Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans
5663	Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans
5664	Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans
5664	Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans
5689	Proteasome A-type and B-type
5690	Proteasome A-type and B-type
5691	Proteasome A-type and B-type
5692	Proteasome A-type and B-type
5697	Pancreatic hormone peptide
5698	Proteasome A-type and B-type
5698	Proteasome A-type and B-type
5713	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has
5720	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b
5720	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin
5720	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b
5720	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin
5721	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b
5721	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin
5724	7 transmembrane receptor (rhodopsin family)
5725	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
5725	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
5725	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
5727	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
5729	7 transmembrane receptor (rhodopsin family)
5730	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
5731	7 transmembrane receptor (rhodopsin family)
5732	7 transmembrane receptor (rhodopsin family)
5733	7 transmembrane receptor (rhodopsin family)
5734	7 transmembrane receptor (rhodopsin family)
5737	7 transmembrane receptor (rhodopsin family)
5739	7 transmembrane receptor (rhodopsin family)
5740	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
5741	Parathyroid hormone family
5742	Animal haem peroxidase
5742	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
5742	Animal haem peroxidase
5742	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
5743	Animal haem peroxidase
5743	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
5744	Parathyroid hormone family
5745	7 transmembrane receptor (Secretin family)
5746	7 transmembrane receptor (Secretin family)
5747	Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta
5747	Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta
5764	PTN/MK heparin-binding protein family
5768	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
5774	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
5774	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
5775	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
5775	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
5777	SH2 domain
5777	SH2 domain
5777	SH2 domain
5780	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
5780	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
5781	SH2 domain
5783	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
5783	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
5783	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
5783	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
5786	Protein-tyrosine phosphatase
5787	Fibronectin type III domain
5788	Fibronectin type III domain
5788	Fibronectin type III domain
5788	Fibronectin type III domain
5789	Fibronectin type III domain
5789	Fibronectin type III domain
5789	Fibronectin type III domain
5789	Fibronectin type III domain
5792	Fibronectin type III domain
5792	Fibronectin type III domain
5793	Eukaryotic-type carbonic anhydrase
5795	Fibronectin type III domain
5796	Fibronectin type III domain
5797	Fibronectin type III domain
5800	Fibronectin type III domain
5800	Fibronectin type III domain
5802	Fibronectin type III domain
5802	Fibronectin type III domain
5802	Fibronectin type III domain
5802	Fibronectin type III domain
5803	Fibronectin type III domain
5803	Eukaryotic-type carbonic anhydrase
5805	6-pyruvoyl tetrahydropterin synthase. 6-Pyruvoyl tetrahydrobiopterin synthase catalyses the conversion of dihydroneopterin triphosphate to 6-pyruvoyl tetrahydropterin, the second of three enzymatic steps in the synthesis of tetrahydrobiopterin from GTP.
5810	Repair protein Rad1/Rec1/Rad17
5810	Repair protein Rad1/Rec1/Rad17
5810	Repair protein Rad1/Rec1/Rad17
5822	Periodic tryptophan protein 2 WD repeat associated domain
5827	Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is
5829	Paxillin family
5829	LIM domain. This family represents two copies of the LIM structural domain
5831	Delta 1-pyrroline-5-carboxylate reductase
5831	Delta 1-pyrroline-5-carboxylate reductase
5834	Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses
5836	Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses
5837	Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses
5858	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
5858	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
5859	tRNA synthetases class I (E and Q), catalytic domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase aminoacyl
5861	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
5863	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
5868	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
5869	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
5873	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
5874	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
5878	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
5883	Rad9. Rad9 is required for transient cell-cycle arrests and transcriptional induction of DNA repair in response to DNA damage
5884	Rad17 cell cycle checkpoint protein
5884	Rad17 cell cycle checkpoint protein
5884	Rad17 cell cycle checkpoint protein
5884	Rad17 cell cycle checkpoint protein
5884	Rad17 cell cycle checkpoint protein
5884	Rad17 cell cycle checkpoint protein
5884	Rad17 cell cycle checkpoint protein
5884	Rad17 cell cycle checkpoint protein
5887	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
5887	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t
5888	Helix-hairpin-helix motif
5893	Rad52/22 family double-strand break repair protein
5893	Rad52/22 family double-strand break repair protein
5893	Rad52/22 family double-strand break repair protein
5893	Rad52/22 family double-strand break repair protein
5894	Protein kinase domain
5894	Raf-like Ras-binding domain
5894	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
5897	Recombination activating protein 2. V-D-J recombination is the combinatorial process by which the huge range of immunoglobulin and T cell binding specificity is generated from a limited amount of genetic material. This process is synergistically activate
5900	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
5902	RanBP1 domain
5903	RanBP1 domain
5903	Zn-finger in Ran binding protein and others
5903	Cyclophilin type peptidyl-prolyl cis-trans isomerase
5909	Rap/ran-GAP
5914	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
5914	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
5915	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
5915	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
5915	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
5915	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
5916	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
5916	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
5920	NLP/P60 family. The function of this domain is unknown. It is found in several lipoproteins
5921	SH2 domain
5921	PH domain. PH stands for pleckstrin homology
5921	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
5921	SH2 domain
5921	PH domain. PH stands for pleckstrin homology
5921	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
5922	BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region
5923	Guanine nucleotide exchange factor for Ras-like GTPases
5923	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
5924	PH domain. PH stands for pleckstrin homology
5924	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
5924	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
5925	Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted
5925	Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserve
5926	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
5926	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
5926	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
5927	jmjC domain
5927	jmjN domain
5927	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
5927	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
5927	pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function
5932	Ezrin/radixin/moesin family
5933	Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted
5933	Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserve
5934	Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted
5934	Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserve
5936	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members invol
5947	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
5948	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
5949	Interphotoreceptor retinoid-binding protein. Interphotoreceptor retinoid-binding protein (IRBP) mediates retinoid trafficking between the photoreceptors and pigmented epithelium. This Pfam family represents a repeat domain found four times in the mammali
5950	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
5956	7 transmembrane receptor (rhodopsin family)
5956	7 transmembrane receptor (rhodopsin family)
5956	C.elegans Srg family integral membrane protein
5959	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
5961	Tetraspanin family
5962	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
5962	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
5966	Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha
5970	Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha
5971	Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha
5972	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
5977	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
5979	Cadherin domain
5979	Cadherin domain
5979	Cadherin domain
5979	Cadherin domain
5981	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
5986	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
5987	B-box zinc finger
5987	B-box zinc finger
5989	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
5990	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
5990	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
5991	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
5991	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
5992	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
5993	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
5995	7 transmembrane receptor (rhodopsin family)
5996	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
5997	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
5998	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
5998	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
5999	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
6000	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
6000	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
6002	Raf-like Ras-binding domain
6002	Phosphotyrosine interaction domain (PTB/PID)
6005	Ammonium Transporter Family
6006	Ammonium Transporter Family
6006	Ammonium Transporter Family
6006	Ammonium Transporter Family
6006	Ammonium Transporter Family
6006	Ammonium Transporter Family
6007	Ammonium Transporter Family
6007	Ammonium Transporter Family
6010	7 transmembrane receptor (rhodopsin family)
6011	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
6013	Insulin/IGF/Relaxin family. Superfamily includes insulins
6017	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
6017	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
6019	Insulin/IGF/Relaxin family. Superfamily includes insulins
6019	Insulin/IGF/Relaxin family. Superfamily includes insulins
6041	Protein kinase domain
6050	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
6051	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
6091	Fibronectin type III domain
6091	Fibronectin type III domain
6092	Fibronectin type III domain
6092	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
6094	Tetraspanin family
6095	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
6095	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
6095	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
6095	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
6095	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
6095	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
6095	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
6095	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
6096	metallopeptidase family M24
6096	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
6096	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
6097	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
6097	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
6098	Fibronectin type III domain
6117	Replication factor-A protein 1, N-terminal domain
6121	Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc
6122	Ribosomal protein L3
6123	Ribosomal protein L3
6124	Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA
6125	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
6128	Ribosomal protein L6e
6128	Ribosomal protein L6, N-terminal domain
6129	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
6130	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
6132	Ribosomal Proteins L2
6132	Ribosomal Proteins L2
6134	Ribosomal L10
6137	Ribosomal protein L13e
6137	Ribosomal protein L13e
6138	Ribosomal L15
6139	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
6141	Eukaryotic ribosomal protein L18
6142	Ribosomal L18ae protein family
6143	Ribosomal protein L19e
6144	Ribosomal protein L21e
6146	Ribosomal L22e protein family
6150	Ribosomal protein L23
6152	Ribosomal protein L24e
6155	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
6158	Ribosomal L28e protein family
6159	Ribosomal L29e protein family
6160	Ribosomal protein L31e
6161	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
6164	Ribosomal protein L34e
6164	Ribosomal protein L34e
6165	Ribosomal protein L35Ae
6166	Ribosomal protein L44
6167	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
6168	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
6169	Ribosomal L38e protein family
6173	Ribosomal protein L44
6175	Ribosomal protein L10
6175	Ribosomal protein L10
6183	Ribosomal protein S12
6183	Ribosomal protein S12
6183	Ribosomal protein S12
6189	Ribosomal S3Ae family
6189	Ribosomal S3Ae family
6193	Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes
6194	Ribosomal protein S6e
6201	Ribosomal protein S7e
6202	Ribosomal protein S8e
6204	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
6205	Ribosomal protein S17
6206	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
6207	Ribosomal protein S15
6210	Ribosomal protein S8
6217	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
6218	Ribosomal S17
6222	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
6223	Ribosomal protein S19e
6224	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
6227	Ribosomal protein S21e
6230	S25 ribosomal protein
6231	Ribosomal protein S26e
6232	Ribosomal protein S27
6233	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
6233	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis o
6234	Ribosomal protein S28e
6240	Ribonucleotide reductase, barrel domain
6241	Ribonucleotide reductase, small chain
6247	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
6249	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
6252	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
6253	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
6256	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
6256	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
6257	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
6257	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
6258	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
6258	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
6261	MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function
6261	RyR domain. This domain is called RyR for Ryanodine receptor. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown
6261	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
6261	RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3
6262	MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function
6262	RyR domain. This domain is called RyR for Ryanodine receptor. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown
6262	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
6262	RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3
6263	MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function
6263	RyR domain. This domain is called RyR for Ryanodine receptor. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown
6263	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
6263	RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3
6271	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6273	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6274	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6275	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6275	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6276	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6277	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6278	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6279	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6280	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6281	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6282	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6283	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6284	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6285	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6286	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
6288	Serum amyloid A protein
6289	Serum amyloid A protein
6291	Serum amyloid A protein
6293	Vps52 / Sac2 family. Vps52 complexes with Vps53 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events
6293	Vps52 / Sac2 family. Vps52 complexes with Vps53 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events
6294	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
6295	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
6295	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
6299	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
6302	Tetraspanin family
6303	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
6304	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
6307	Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues.
6309	Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues.
6320	Lectin C-type domain. This family includes both long and short form C-type
6322	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
6323	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
6324	Immunoglobulin domain
6326	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
6328	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
6329	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
6331	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
6332	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
6334	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
6335	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
6336	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
6337	Amiloride-sensitive sodium channel
6338	Amiloride-sensitive sodium channel
6339	Amiloride-sensitive sodium channel
6340	Amiloride-sensitive sodium channel
6342	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
6343	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
6344	7 transmembrane receptor (Secretin family)
6344	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
6346	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
6347	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
6348	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
6349	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
6351	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
6354	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
6355	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
6369	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
6372	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
6373	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
6374	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
6382	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
6383	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
6385	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
6387	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
6391	Succinate dehydrogenase cytochrome b subunit
6397	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
6397	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
6401	Lectin C-type domain. This family includes both long and short form C-type
6402	Lectin C-type domain. This family includes both long and short form C-type
6403	Sushi domain (SCR repeat)
6403	Lectin C-type domain. This family includes both long and short form C-type
6405	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
6418	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
6419	Transposase. This family includes the mariner transposase
6419	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
6423	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
6423	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
6424	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
6426	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
6435	Lectin C-type domain. This family includes both long and short form C-type
6436	Lectin C-type domain. This family includes both long and short form C-type
6439	Surfactant protein B
6439	Saposin A-type domain
6440	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
6441	Lectin C-type domain. This family includes both long and short form C-type
6441	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
6448	Sulfatase
6462	Laminin G domain
6464	SH2 domain
6464	Phosphotyrosine interaction domain (PTB/PID)
6469	Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module
6469	Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation
6470	Serine hydroxymethyltransferase
6470	Serine hydroxymethyltransferase
6472	Serine hydroxymethyltransferase
6473	Homeobox domain
6473	Homeobox domain
6474	Homeobox domain
6474	Homeobox domain
6476	Trefoil (P-type) domain
6476	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
6477	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result
6478	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result
6480	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6480	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6480	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6482	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6482	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6483	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6484	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6487	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6487	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6487	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6487	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6487	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6487	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6487	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6487	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6489	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
6494	Rap/ran-GAP
6494	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
6494	Rap/ran-GAP
6494	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
6497	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
6498	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
6505	Sodium:dicarboxylate symporter family
6506	Sodium:dicarboxylate symporter family
6507	Sodium:dicarboxylate symporter family
6508	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
6509	Sodium:dicarboxylate symporter family
6510	Sodium:dicarboxylate symporter family
6511	Sodium:dicarboxylate symporter family
6512	Sodium:dicarboxylate symporter family
6513	Sugar (and other) transporter
6514	Sugar (and other) transporter
6515	Sugar (and other) transporter
6517	Sugar (and other) transporter
6518	Sugar (and other) transporter
6521	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
6522	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
6523	Sodium:solute symporter family
6524	Sodium:solute symporter family
6525	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
6525	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
6525	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
6526	Sodium:solute symporter family
6527	Sodium:solute symporter family
6529	Sodium:neurotransmitter symporter family
6530	Sodium:neurotransmitter symporter family
6531	Sodium:neurotransmitter symporter family
6532	Sodium:neurotransmitter symporter family
6532	Serotonin (5-HT) neurotransmitter transporter, N-terminus
6533	Sodium:neurotransmitter symporter family
6534	Sodium:neurotransmitter symporter family
6535	Sodium:neurotransmitter symporter family
6536	Sodium:neurotransmitter symporter family
6538	Sodium:neurotransmitter symporter family
6539	Sodium:neurotransmitter symporter family
6540	Sodium:neurotransmitter symporter family
6543	Calx-beta domain
6543	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cells
6546	Calx-beta domain
6546	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
6547	Calx-beta domain
6547	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
6547	Calx-beta domain
6547	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
6548	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
6549	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
6550	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
6553	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
6554	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f
6555	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f
6556	Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined
6560	K-Cl Co-transporter type 1 (KCC1)
6561	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
6563	Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue
6564	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
6565	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
6569	Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor
6573	Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog
6574	Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukemia virus receptor
6575	Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukemia virus receptor
6576	Mitochondrial carrier protein
6578	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
6578	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
6579	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
6579	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
6579	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
6579	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
6579	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
6579	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
6580	Sugar (and other) transporter
6580	Sugar (and other) transporter
6581	Sugar (and other) transporter
6582	Sugar (and other) transporter
6582	Sugar (and other) transporter
6583	Sugar (and other) transporter
6584	Sugar (and other) transporter
6585	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
6590	WAP-type (Whey Acidic Protein) 'four-disulfide core'
6594	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
6594	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
6595	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
6595	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
6596	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
6596	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
6597	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
6603	BAF60b domain of the SWIB complex
6604	BAF60b domain of the SWIB complex
6605	HMG (high mobility group) box
6608	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
6609	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte
6610	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN
6611	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
6620	Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid
6622	Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid
6622	Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid
6623	Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid
6626	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
6629	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
6632	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
6633	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
6633	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
6634	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
6637	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
6640	PH domain. PH stands for pleckstrin homology
6641	PH domain. PH stands for pleckstrin homology
6641	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
6642	PX domain. PX domains bind to phosphoinositides
6642	Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2
6642	PX domain. PX domains bind to phosphoinositides
6642	PX domain. PX domains bind to phosphoinositides
6642	Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2
6643	PX domain. PX domains bind to phosphoinositides
6643	Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2
6645	PH domain. PH stands for pleckstrin homology
6647	Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in
6648	Iron/manganese superoxide dismutases, C-terminal domain. superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the Mn/Fe-binding family is one.
6649	Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in
6651	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
6651	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
6651	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
6653	Fibronectin type III domain
6653	Low-density lipoprotein receptor domain class A
6655	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
6656	HMG (high mobility group) box
6657	HMG (high mobility group) box
6657	HMG (high mobility group) box
6658	HMG (high mobility group) box
6659	HMG (high mobility group) box
6660	HMG (high mobility group) box
6660	HMG (high mobility group) box
6660	HMG (high mobility group) box
6662	HMG (high mobility group) box
6663	HMG (high mobility group) box
6664	HMG (high mobility group) box
6665	HMG (high mobility group) box
6666	HMG (high mobility group) box
6667	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
6668	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
6668	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
6670	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
6671	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
6672	HMG (high mobility group) box
6672	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
6672	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
6675	UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate uri
6677	Hyaluronidase
6677	Hyaluronidase
6678	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand
6683	MIT domain
6688	Ets-domain
6690	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand
6691	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand
6694	Cystatin domain
6695	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
6696	Osteopontin
6697	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
6699	Cornifin (SPRR) family
6707	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
6708	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
6708	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
6709	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
6709	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
6710	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
6711	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
6711	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
6712	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
6714	SH2 domain
6714	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
6715	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster
6716	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster
6718	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
6720	Helix-loop-helix DNA-binding domain
6721	Helix-loop-helix DNA-binding domain
6722	SRF-type transcription factor (DNA-binding and dimerisation domain)
6723	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
6727	Signal recognition particle 14kD protein. The signal recognition particle (SRP) is a multimeric protein involved in targeting secretory proteins to the rough endoplasmic reticulum membrane. SRP14 and SRP9 form a complex essential for SRP RNA binding
6728	SRP19 protein. The signal recognition particle (SRP) binds to the signal peptide of proteins as they are being translated. The binding of the SRP halts translation and the complex is then transported to the endoplasmic reticulum's cytoplasmic surface. Th
6734	Signal recognition particle, alpha subunit, N-terminal. SRP is a complex of six distinct polypeptides and a 7S RNA that is essential for transferring nascent polypeptide chains that are destined for export from the cell to the translocation apparatus of
6736	HMG (high mobility group) box
6737	B-box zinc finger
6737	Zinc finger, C3HC4 type (RING finger)
6742	Single-strand binding protein family
6745	Translocon-associated protein (TRAP), alpha subunit. The alpha-subunit of the TRAP complex (TRAP alpha) is a single-spanning membrane protein of the endoplasmic reticulum (ER) which is found in proximity of nascent polypeptide chains translocating across
6749	HMG (high mobility group) box
6749	Structure-specific recognition protein
6750	Somatostatin/Cortistatin family. Members of this family are hormones. Somatostatin inhibits the release of somatotropin. Cortistatin is a peptide that is related to the Somatostatins that is found to depresses neuronal electrical activity but, unlike som
6751	7 transmembrane receptor (rhodopsin family)
6752	7 transmembrane receptor (rhodopsin family)
6753	7 transmembrane receptor (rhodopsin family)
6754	7 transmembrane receptor (rhodopsin family)
6755	7 transmembrane receptor (rhodopsin family)
6764	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
6764	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
6764	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
6764	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
6764	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
6764	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
6768	CUB domain
6769	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
6770	START domain
6772	SH2 domain
6772	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
6772	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
6772	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
6772	SH2 domain
6772	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
6772	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
6772	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
6773	SH2 domain
6773	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
6773	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
6773	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
6774	SH2 domain
6774	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
6774	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
6774	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
6774	SH2 domain
6774	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
6774	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
6774	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
6775	SH2 domain
6775	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
6775	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
6775	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
6776	SH2 domain
6776	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
6776	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
6777	SH2 domain
6777	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
6777	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
6778	SH2 domain
6778	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
6778	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
6778	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
6779	Statherin. Statherin functions biologically to inhibit the nucleation and growth of calcium phosphate minerals. The N-terminus of statherin is highly charge, the glutamic acids of which have been shown to be important in the recognition hydroxyapatite
6780	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
6780	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
6780	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
6780	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
6781	Stanniocalcin family
6782	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
6783	Sulfotransferase protein
6785	GNS1/SUR4 family
6799	Sulfotransferase protein
6799	Sulfotransferase protein
6804	Syntaxin
6809	Syntaxin
6812	Sec1 family
6813	Sec1 family
6814	Sec1 family
6817	Sulfotransferase protein
6817	Sulfotransferase protein
6817	Sulfotransferase protein
6817	Sulfotransferase protein
6817	Sulfotransferase protein
6818	Sulfotransferase protein
6818	Sulfotransferase protein
6819	Sulfotransferase protein
6819	Sulfotransferase protein
6820	Sulfotransferase proteins
6820	Sulfotransferase protein
6821	Mo-co oxidoreductase dimerisation domain. This domain is found in molybdopterin cofactor (Mo-co) oxidoreductases. It is involved in dimer formation, and has an Ig-fold structure
6821	Oxidoreductase molybdopterin binding domain. This domain is found in a variety of oxidoreductases. This domain binds to a molybdopterin cofactor. Xanthine dehydrogenases, that also bind molybdopterin, have essentially no similarity
6822	Sulfotransferase protein
6829	Supt5 repeat. This short region of similarity is found in two tandem copies in Supt5 proteins that are involved in chromatin regulation. The function of this region is unknown
6834	SURF1 family
6836	SURF4 family
6839	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
6840	Villin headpiece domain
6840	Villin headpiece domain
6843	Synaptobrevin
6843	Synaptobrevin
6844	Synaptobrevin
6845	Synaptobrevin
6850	SH2 domain
6853	Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown
6853	Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown
6854	N-terminal domain
6854	Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown
6854	N-terminal domain
6854	Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown
6855	Synaptophysin / synaptoporin
6856	Synaptophysin / synaptoporin
6856	Synaptophysin / synaptoporin
6857	C2 domain
6861	C2 domain
6862	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
6865	7 transmembrane receptor (rhodopsin family)
6866	Neurokinin B
6868	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
6868	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
6869	7 transmembrane receptor (rhodopsin family)
6869	7 transmembrane receptor (rhodopsin family)
6870	7 transmembrane receptor (rhodopsin family)
6880	Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tra
6883	Transcription initiation factor TFIID subunit A
6884	Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histon
6885	Protein kinase domain
6885	Protein kinase domain
6885	Protein kinase domain
6885	Protein kinase domain
6886	Helix-loop-helix DNA-binding domain
6888	Transaldolase
6895	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
6895	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
6900	Fibronectin type III domain
6901	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
6901	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
6901	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
6902	Tubulin binding cofactor A
6905	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
6906	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
6915	7 transmembrane receptor (rhodopsin family)
6916	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
6916	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
6920	Transcription factor S-II (TFIIS)
6929	Helix-loop-helix DNA-binding domain
6932	HMG (high mobility group) box
6934	HMG (high mobility group) box
6935	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
6938	Helix-loop-helix DNA-binding domain
6941	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
6943	Helix-loop-helix DNA-binding domain
6944	Sodium transport protein
6945	Helix-loop-helix DNA-binding domain
6945	Helix-loop-helix DNA-binding domain
6945	Helix-loop-helix DNA-binding domain
6947	Eukaryotic cobalamin-binding protein
6948	Eukaryotic cobalamin-binding protein
6949	Treacher Collins syndrome protein Treacle
6950	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
6990	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
6991	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
6993	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
6996	Uracil DNA glycosylase superfamily
6999	Tryptophan 2,3-dioxygenase
7003	TEA/ATTS domain family
7004	TEA/ATTS domain family
7005	TEA/ATTS domain family
7006	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
7007	Zona pellucida-like domain
7007	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
7009	Uncharacterized protein family UPF0005
7010	Fibronectin type III domain
7010	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
7013	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
7013	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
7014	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
7016	Protein kinase domain
7018	Transferrin
7019	HMG (high mobility group) box
7020	Transcription factor AP-2
7021	Transcription factor AP-2
7022	Transcription factor AP-2
7023	Helix-loop-helix DNA-binding domain
7025	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
7025	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
7026	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
7026	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
7027	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
7029	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
7030	Helix-loop-helix DNA-binding domain
7031	Trefoil (P-type) domain
7032	Trefoil (P-type) domain
7033	Trefoil (P-type) domain
7036	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
7037	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
7038	Carboxylesterase
7038	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
7040	Transforming growth factor beta like domain
7040	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
7041	Paxillin family
7041	LIM domain. This family represents two copies of the LIM structural domain
7042	Transforming growth factor beta like domain
7042	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
7043	Transforming growth factor beta like domain
7043	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
7044	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
7045	Fasciclin domain. This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria
7046	Protein kinase domain
7046	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
7047	Transglutaminase family
7047	Transglutaminase family, C-terminal ig like domain
7047	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
7048	Protein kinase domain
7049	Zona pellucida-like domain
7051	Transglutaminase family
7051	Transglutaminase family, C-terminal ig like domain
7051	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
7052	Transglutaminase family
7052	Transglutaminase family, C-terminal ig like domain
7052	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
7053	Transglutaminase family
7053	Transglutaminase family, C-terminal ig like domain
7053	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
7054	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
7057	Thrombospondin N-terminal -like domain
7057	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
7058	Thrombospondin type 1 domain
7058	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
7060	Thrombospondin N-terminal -like domain
7062	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
7066	Erythropoietin/thrombopoietin
7067	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
7068	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
7068	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
7074	Raf-like Ras-binding domain
7075	Fibronectin type III domain
7076	Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species
7077	Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species
7079	Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species
7080	Homeobox domain
7082	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
7082	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
7083	Thymidine kinase
7084	Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes
7088	Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation
7089	Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation
7090	Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation
7092	CUB domain
7092	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
7093	CUB domain
7093	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
7094	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
7095	Translocation protein Sec62
7096	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
7097	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
7098	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
7099	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
7099	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
7099	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
7099	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
7100	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
7101	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
7101	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
7102	Tetraspanin family
7103	Tetraspanin family
7105	Tetraspanin family
7106	Tetraspanin family
7108	Ergosterol biosynthesis ERG4/ERG24 family
7111	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
7112	LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1
7113	Trypsin
7122	PMP-22/EMP/MP20/Claudin family
7124	TNF(Tumor Necrosis Factor) family
7126	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
7128	A20-like zinc finger. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappa B activation
7130	CUB domain
7130	Extracellular link domain
7132	TNFR/NGFR cysteine-rich region
7133	TNFR/NGFR cysteine-rich region
7135	Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom
7136	Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom
7137	Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom
7142	Nuclear transition protein 2
7143	Fibronectin type III domain
7148	Fibronectin type III domain
7148	Fibrinogen beta and gamma chains, C-terminal globular domain
7148	Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibers in association with hair keratin intermediate filaments. This family has been divided up in
7148	Fibronectin type III domain
7148	Fibrinogen beta and gamma chains, C-terminal globular domain
7156	Topoisomerase DNA binding C4 zinc finger
7157	P53
7159	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
7161	P53
7163	Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction
7164	Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction
7165	Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction
7166	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
7167	Triosephosphate isomerase
7168	Tropomyosin
7169	Tropomyosin
7170	Tropomyosin
7170	Tropomyosin
7171	Tropomyosin
7173	Animal haem peroxidase
7173	Animal haem peroxidase
7173	Animal haem peroxidase
7173	Animal haem peroxidase
7173	Animal haem peroxidase
7174	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
7176	Trypsin
7178	Translationally controlled tumor protein
7181	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
7181	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
7182	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
7182	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
7184	Hsp90 protein
7185	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
7186	TRAF-type zinc finger
7186	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
7186	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
7187	TRAF-type zinc finger
7187	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
7187	TRAF-type zinc finger
7187	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
7187	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
7188	TRAF-type zinc finger
7188	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
7188	TRAF-type zinc finger
7188	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
7189	TRAF-type zinc finger
7189	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
7189	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
7189	TRAF-type zinc finger
7189	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
7189	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
7201	7 transmembrane receptor (rhodopsin family)
7203	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
7204	PH domain. PH stands for pleckstrin homology
7216	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
7216	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
7216	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
7216	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
7216	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
7220	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
7222	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
7223	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
7224	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
7225	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
7226	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
7227	GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the d
7247	Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen
7249	Tuberin
7249	Rap/ran-GAP
7249	Tuberin
7249	Rap/ran-GAP
7249	Tuberin
7249	Rap/ran-GAP
7253	7 transmembrane receptor (rhodopsin family)
7257	Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen
7258	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
7259	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
7274	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
7275	Tub family
7275	Tub family
7276	Transthyretin precursor (formerly prealbumin). Transthyretin is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. Mutations in the human transthyretin are associated with several genetic disorders
7287	Tub family
7288	Tub family
7289	Tub family
7291	Helix-loop-helix DNA-binding domain
7293	TNFR/NGFR cysteine-rich region
7294	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
7295	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
7298	Thymidylate synthase
7299	Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372
7301	Fibronectin type III domain
7306	Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372
7311	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
7311	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
7314	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
7316	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
7316	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
7317	Repeat in ubiquitin-activating (UBA) protein
7317	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
7317	Repeat in ubiquitin-activating (UBA) protein
7317	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
7318	Repeat in ubiquitin-activating (UBA) protein
7318	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
7328	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
7335	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
7335	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
7336	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
7343	HMG (high mobility group) box
7345	Ubiquitin carboxyl-terminal hydrolase, family 1
7347	Ubiquitin carboxyl-terminal hydrolase, family 1
7348	Tetraspanin family
7350	Mitochondrial carrier protein
7351	Mitochondrial carrier protein
7352	Mitochondrial carrier protein
7352	Mitochondrial carrier protein
7353	Ubiquitin fusion degradation protein UFD1. Post-translational ubiquitin-protein conjugates are recognized for degradation by the ubiquitin fusion degradation (UFD) pathway. Several proteins involved in this pathway have been identified. This family inclu
7355	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
7356	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
7358	UDP-glucose/GDP-mannose dehydrogenase family, UDP binding domain. The UDP-glucose/GDP-mannose dehydrogenaseses are a small group of enzymes which possesses the ability to catalyse the NAD-dependent 2-fold oxidation of an alcohol to an acid without the re
7358	UDP-glucose/GDP-mannose dehydrogenase family, central domain. The UDP-glucose/GDP-mannose dehydrogenaseses are a small group of enzymes which possesses the ability to catalyse the NAD-dependent 2-fold oxidation of an alcohol to an acid without the releas
7360	UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate uri
7363	UDP-glucoronosyl and UDP-glucosyl transferase
7364	UDP-glucoronosyl and UDP-glucosyl transferase
7365	UDP-glucoronosyl and UDP-glucosyl transferase
7366	UDP-glucoronosyl and UDP-glucosyl transferase
7367	UDP-glucoronosyl and UDP-glucosyl transferase
7368	UDP-glucoronosyl and UDP-glucosyl transferase
7369	Zona pellucida-like domain
7372	Orotidine 5'-phosphate decarboxylase / HUMPS family. This family includes Orotidine 5'-phosphate decarboxylase enzymes EC:4.1.1.23 that are involved in the final step of pyrimidine biosynthesis. The family also includes enzymes such as hexulose-6-phospha
7373	Fibronectin type III domain
7373	von Willebrand factor type A domain
7376	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
7376	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
7381	Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly i
7388	Ubiquinol-cytochrome C reductase hinge protein. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 'hinge' protein of the complex which is thought to mediate formati
7389	Uroporphyrinogen decarboxylase (URO-D)
7391	Helix-loop-helix DNA-binding domain
7392	Helix-loop-helix DNA-binding domain
7399	Fibronectin type III domain
7399	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
7402	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
7403	jmjC domain
7404	jmjC domain
7404	jmjC domain
7407	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
7408	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
7409	PH domain. PH stands for pleckstrin homology
7409	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
7409	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
7409	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
7410	PH domain. PH stands for pleckstrin homology
7410	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
7410	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
7411	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription
7414	Vinculin family
7414	Vinculin family
7415	Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist o
7416	Eukaryotic porin
7417	Eukaryotic porin
7419	Eukaryotic porin
7421	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
7421	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
7428	von Hippel-Lindau disease tumor suppressor protein. VHL forms a ternary complex with the elonginB and elonginC proteins. This complex binds Cul2, which then is involved in regulation of vascular endothelial growth factor mRNA
7430	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
7430	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
7431	Intermediate filament proteins
7432	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
7433	7 transmembrane receptor (Secretin family)
7434	7 transmembrane receptor (Secretin family)
7436	Low-density lipoprotein receptor domain class A
7436	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
7439	Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown
7442	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
7442	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
7442	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
7442	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
7442	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
7442	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
7442	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
7442	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
7443	Protein kinase domain
7444	Protein kinase domain
7448	Somatomedin B domain
7448	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
7450	von Willebrand factor type A domain
7450	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
7450	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
7453	WHEP-TRS domain
7454	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
7454	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
7455	von Willebrand factor type D domain
7455	MAM domain. An extracellular domain found in many receptors
7455	TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr
7455	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
7455	von Willebrand factor type D domain
7455	MAM domain. An extracellular domain found in many receptors
7455	TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr
7455	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
7455	von Willebrand factor type D domain
7455	MAM domain. An extracellular domain found in many receptors
7455	TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr
7455	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
7455	von Willebrand factor type D domain
7455	MAM domain. An extracellular domain found in many receptors
7455	TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr
7455	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
7455	von Willebrand factor type D domain
7455	MAM domain. An extracellular domain found in many receptors
7455	TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr
7455	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
7455	von Willebrand factor type D domain
7455	MAM domain. An extracellular domain found in many receptors
7455	TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr
7455	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
7461	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
7461	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
7465	Protein kinase domain
7468	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
7468	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
7468	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
7468	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
7468	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
7468	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
7468	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
7468	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
7468	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
7471	wnt family
7472	wnt family
7473	wnt family
7474	wnt family
7475	wnt family
7477	wnt family
7480	wnt family
7482	wnt family
7482	wnt family
7483	wnt family
7484	wnt family
7486	HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease
7486	3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termi
7490	Wilm's tumour protein
7490	Wilm's tumour protein
7490	Wilm's tumour protein
7490	Wilm's tumour protein
7507	XPA protein
7508	DNA repair protein Rad4
7515	XRCC1 N terminal domain
7515	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
7515	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
7525	SH2 domain
7525	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
7529	14-3-3 protein
7529	14-3-3 protein
7531	14-3-3 protein
7532	14-3-3 protein
7533	14-3-3 protein
7534	14-3-3 protein
7534	14-3-3 protein
7535	SH2 domain
7535	SH2 domain
7535	Protein kinase domain
7541	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
7551	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7551	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7554	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
7554	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7554	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
7561	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7565	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7566	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
7566	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7566	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
7586	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
7586	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7586	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
7587	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7597	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
7626	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7638	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7643	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
7673	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7694	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7695	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7703	Zinc finger, C3HC4 type (RING finger)
7704	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
7707	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
7709	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
7726	B-box zinc finger
7727	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
7733	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7743	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7745	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7752	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
7760	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
7760	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7760	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
7761	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7762	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
7763	AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets i
7781	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
7783	Zona pellucida-like domain
7784	Zona pellucida-like domain
7786	Protein kinase domain
7791	LIM domain. This family represents two copies of the LIM structural domain
7799	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
7799	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
7804	Low-density lipoprotein receptor domain class A
7804	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
7804	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
7804	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
7805	Golgi 4-transmembrane spanning transporter
7812	'Cold-shock' DNA-binding domain
7827	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
7832	BTG1 family. A novel family of anti-proliferative proteins
7837	Animal haem peroxidase
7837	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
7837	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
7837	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
7841	Mannosyl oligosaccharide glucosidase. This is a family of eukaryotic enzymes belonging to glycosyl hydrolase family 63. They catalyse the specific cleavage of the non-reducing terminal glucose residue from Glc(3)Man(9)GlcNAc(2). Mannosyl oligosaccharide
7849	'Paired box' domain
7849	'Paired box' domain
7849	'Paired box' domain
7849	'Paired box' domain
7849	'Paired box' domain
7852	7 transmembrane receptor (rhodopsin family)
7855	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
7857	Granin (chromogranin or secretogranin)
7862	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
7869	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
7879	Ras family. Includes sub-families Ras
7879	ADP-ribosylation factor family. Pfam combines a number of different Prosite families together
7903	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
7903	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
7904	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
7905	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
7917	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
7917	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
7917	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
7920	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
7923	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
7932	7 transmembrane receptor (rhodopsin family)
7941	Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl
7942	Helix-loop-helix DNA-binding domain
7957	Starch binding domain
7975	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
7976	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
7978	mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein
7982	ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain
7982	ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain
7988	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
7988	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
7993	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
7994	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
8000	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
8001	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
8001	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
8013	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
8013	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
8013	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
8013	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
8013	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
8013	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
8013	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
8022	Homeobox domain
8022	LIM domain. This family represents two copies of the LIM structural domain
8022	Homeobox domain
8022	LIM domain. This family represents two copies of the LIM structural domain
8027	VHS domain. Domain present in VPS-27, Hrs and STAM
8029	CUB domain
8029	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
8034	Mitochondrial carrier protein
8038	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
8038	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
8038	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
8038	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
8048	LIM domain. This family represents two copies of the LIM structural domain
8050	2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain
8065	Cullin family
8085	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
8085	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
8087	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
8089	YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity
8092	Homeobox domain
8111	7 transmembrane receptor (rhodopsin family)
8115	TCL1/MTCP1 family. Two related oncogenes, TCL-1 and MTCP-1, are overexpressed in T cell prolymphocytic leukemias as a result of chromosomal rearrangements that involve the translocation of one T cell receptor gene to either chromosome 14q32 or Xq28
8120	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
8128	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
8131	Uncharacterised protein family (UPF0171)
8139	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
8165	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
8170	Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue
8178	Occludin/ELL family
8193	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
8195	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
8195	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
8200	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
8216	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
8216	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
8218	Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain
8218	Clathrin propeller repeat. Clathrin is the scaffold protein of the basket-like coat that surrounds coated vesicles. The soluble assembly unit, a triskelion, contains three heavy chains and three light chains in an extended three-legged structure. Each le
8218	Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain
8218	Clathrin propeller repeat. Clathrin is the scaffold protein of the basket-like coat that surrounds coated vesicles. The soluble assembly unit, a triskelion, contains three heavy chains and three light chains in an extended three-legged structure. Each le
8224	N-terminal domain
8224	Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown
8224	N-terminal domain
8224	Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown
8224	N-terminal domain
8224	Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown
8228	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of
8233	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
8242	jmjC domain
8242	jmjN domain
8242	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
8242	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
8242	pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function
8260	Acetyltransferase (GNAT) family
8266	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
8273	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f
8284	jmjC domain
8284	jmjN domain
8284	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
8284	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
8284	pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function
8288	Animal haem peroxidase
8289	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
8289	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
8289	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
8291	C2 domain
8301	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
8309	Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA
8310	Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA
8312	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The
8312	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The
8314	Ubiquitin carboxyl-terminal hydrolase, family 1
8315	Zn-finger in ubiquitin-hydrolases and other protein
8321	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
8322	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
8323	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
8324	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
8325	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
8326	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
8327	Sterile alpha motif (SAM)/Pointed domain
8337	Core histone H2A/H2B/H3/H4
8372	Hyaluronidase
8383	7 transmembrane receptor (rhodopsin family)
8385	7 transmembrane receptor (rhodopsin family)
8386	7 transmembrane receptor (rhodopsin family)
8387	7 transmembrane receptor (rhodopsin family)
8388	7 transmembrane receptor (rhodopsin family)
8390	7 transmembrane receptor (rhodopsin family)
8392	7 transmembrane receptor (rhodopsin family)
8395	Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K cata
8398	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o
8399	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with
8402	Mitochondrial carrier protein
8402	Mitochondrial carrier protein
8403	HMG (high mobility group) box
8404	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand
8405	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
8405	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
8406	HYR domain. This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is unc
8409	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription
8409	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription
8411	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
8416	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
8419	Intermediate filament protein
8424	Gamma-butyrobetaine hydroxylase. Members of this family are gamma-Butyrobetaine hydroxylase enzymes EC:1.14.11.1
8425	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
8427	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
8427	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
8431	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
8437	C2 domain
8439	Beige/BEACH domain
8440	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
8446	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
8447	C2 domain
8448	C2 domain
8450	Cullin family
8451	Cullin family
8452	Cullin family
8453	Cullin family
8454	Cullin family
8455	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
8455	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
8459	Sulfotransferase protein
8460	Sulfotransferase protein
8463	TEA/ATTS domain family
8464	Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histon
8467	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
8470	Sorbin homologous domain
8470	Sorbin homologous domain
8471	PTB domain (IRS-1 type)
8476	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
8476	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
8476	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
8476	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
8477	7 transmembrane receptor (rhodopsin family)
8484	7 transmembrane receptor (rhodopsin family)
8491	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
8495	pfam02920, integrase_DNA, DNA binding domain of tn916 integrase
8495	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
8497	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
8503	SH2 domain
8506	Fibrinogen beta and gamma chains, C-terminal globular domain
8506	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
8507	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
8507	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
8509	Sulfotransferase protein
8510	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
8511	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
8513	ab-hydrolase associated lipase region
8515	von Willebrand factor type A domain
8516	Integrin alpha cytoplasmic region. This family contains the short intracellular region of integrin alpha chains
8516	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
8521	GCM motif protein
8526	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
8526	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
8527	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
8527	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
8528	FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1
8528	FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1
8529	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
8534	Sulfotransferase protein
8535	'chromo' (CHRromatin Organization MOdifier) domain
8538	Homeobox domain
8541	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
8543	LIM domain. This family represents two copies of the LIM structural domain
8546	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
8549	7 transmembrane receptor (rhodopsin family)
8553	Helix-loop-helix DNA-binding domain
8554	pfam02891, zf-MIZ, MIZ zinc finger
8554	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
8559	Prp18 domain. The splicing factor Prp18 is required for the second step of pre-mRNA splicing. The structure of a large fragment of the Saccharomyces cerevisiae Prp18 is known. This fragment is fully active in yeast splicing in vitro and includes the sequ
8560	Fatty acid desaturase
8560	Fatty acid desaturase
8564	Monooxygenase. This family includes diverse enzymes that utilise FAD
8565	tRNA synthetases class I (W and Y)
8565	Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte act
8567	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
8567	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
8567	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
8567	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
8567	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
8567	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
8567	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
8567	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
8567	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
8567	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
8567	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
8567	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
8567	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
8567	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
8567	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
8567	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
8567	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
8567	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
8567	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
8567	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
8567	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
8567	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
8567	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
8567	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
8572	LIM domain. This family represents two copies of the LIM structural domain
8573	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
8573	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
8575	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
8576	Protein kinase domain
8581	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
8590	7 transmembrane receptor (rhodopsin family)
8601	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
8602	Nop14-like family. Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production
8603	Beta type Zein
8604	Mitochondrial carrier protein
8605	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lyso
8608	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
8611	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
8611	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
8612	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
8612	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
8612	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
8613	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
8613	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
8614	Stanniocalcin family
8622	3'5'-cyclic nucleotide phosphodiesterase
8623	O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine
8626	P53
8629	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d
8629	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
8630	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
8632	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
8635	Ribonuclease T2 family
8638	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
8639	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
8639	Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
8639	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog
8642	Cadherin domain
8643	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
8644	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
8645	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
8646	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
8646	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
8650	Phosphotyrosine interaction domain (PTB/PID)
8654	3'5'-cyclic nucleotide phosphodiesterase
8654	3'5'-cyclic nucleotide phosphodiesterase
8654	3'5'-cyclic nucleotide phosphodiesterase
8654	3'5'-cyclic nucleotide phosphodiesterase
8655	Dynein light chain type 1
8658	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
8659	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
8659	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
8660	PTB domain (IRS-1 type)
8660	PTB domain (IRS-1 type)
8660	PH domain. PH stands for pleckstrin homology
8662	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
8662	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
8665	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has
8671	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
8672	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA
8673	Synaptobrevin
8674	Synaptobrevin
8678	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either asso
8681	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lyso
8682	Death effector domain
8683	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
8687	Intermediate filament protein
8688	Intermediate filament protein
8689	Intermediate filament protein
8690	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
8692	Hyaluronidase
8692	Hyaluronidase
8693	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
8697	Anaphase promoting complex subunit 8 / cdc23. The anaphase-promoting complex is composed of eight protein subunits, including BimE (APC1), CDC27 (APC3), CDC16 (APC6), and CDC23 (APC8)
8698	7 transmembrane receptor (rhodopsin family)
8701	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
8702	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
8703	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
8704	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
8705	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
8705	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
8705	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
8706	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
8706	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
8706	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
8706	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
8707	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
8708	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
8710	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
8714	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic
8718	Death domain
8718	Death domain
8718	Death domain
8718	Death domain
8718	Death domain
8720	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
8722	Papain family cysteine protease
8722	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
8724	PX domain. PX domains bind to phosphoinositides
8725	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription
8725	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription
8727	Vinculin family
8728	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
8728	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
8728	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
8728	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
8729	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
8732	mRNA capping enzyme, catalytic domain. This family represents the ATP binding catalytic domain of the mRNA capping enzyme
8732	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
8733	Gaa1-like, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to newly
8735	Myosin head (motor domain)
8737	Death domain
8741	TNF(Tumor Necrosis Factor) family
8741	TNF(Tumor Necrosis Factor) family
8741	TNF(Tumor Necrosis Factor) family
8744	TNF(Tumor Necrosis Factor) family
8745	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
8745	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
8747	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
8747	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
8748	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
8748	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
8749	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
8749	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
8751	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
8751	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
8754	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
8754	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
8756	Disintegrin
8756	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
8756	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
8760	Phosphatidate cytidylyltransferase
8761	Poly-adenylate binding protein, unique domain
8767	Protein kinase domain
8767	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
8772	Death domain
8772	Death effector domain
8775	NSF attachment protein
8785	von Willebrand factor type A domain
8785	von Willebrand factor type A domain
8785	von Willebrand factor type A domain
8786	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
8788	Uncharacterized protein family (ORF7) DUF. Several members of this family are Borrelia burgdorferi plasmid proteins of uncharacterized function
8789	Fructose-1-6-bisphosphatase
8795	TNFR/NGFR cysteine-rich region
8795	TNFR/NGFR cysteine-rich region
8797	TNFR/NGFR cysteine-rich region
8805	B-box zinc finger
8805	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
8807	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
8808	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
8809	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
8811	7 transmembrane receptor (rhodopsin family)
8812	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
8813	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
8815	Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration
8816	WD domain, G-beta repeat
8817	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
8817	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
8820	Homeobox domain
8822	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
8823	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
8824	Carboxylesterase
8825	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
8828	CUB domain
8828	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
8828	CUB domain
8828	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
8829	CUB domain
8829	MAM domain. An extracellular domain found in many receptors
8829	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
8833	GMP synthase C terminal domain. GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. This family is the C-terminal domain specific to the GMP synthases EC:6.3.5.2. In prokaryotes this domain mediates dimerisation.
8837	Death effector domain
8840	Thrombospondin type 1 domain
8840	Thrombospondin type 1 domain
8841	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
8843	7 transmembrane receptor (rhodopsin family)
8844	Protein kinase domain
8848	TSC-22/dip/bun family
8851	Cyclin-dependent kinase 5 activator protein
8853	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
8854	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
8854	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
8854	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
8856	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
8856	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
8856	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
8856	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
8856	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
8856	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
8857	von Willebrand factor type D domain
8857	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
8858	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
8858	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox
8861	LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located i
8867	SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin
8869	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
8871	SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin
8876	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
8882	ZPR1 zinc-finger domain. The zinc-finger protein ZPR1 is ubiquitous among eukaryotes. It is indeed known to be an essential protein in yeast. In quiescent cells, ZPR1 is localized to the cytoplasm. But in proliferating cells treated with EGF or with othe
8883	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
8888	SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1
8890	Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac
8892	Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac
8893	Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars
8893	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
8894	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
8896	G10 protein
8904	C2 domain
8904	C2 domain
8904	C2 domain
8904	C2 domain
8904	C2 domain
8904	C2 domain
8904	C2 domain
8904	C2 domain
8905	Clathrin adaptor complex small chain
8906	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
8906	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
8906	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
8906	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
8907	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
8911	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
8912	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
8913	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
8913	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
8915	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
8924	Anaphase-promoting complex, subunit 10 (APC10)
8924	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
8926	Sm protein
8929	Homeobox domain
8930	HhH-GPD superfamily base excision DNA repair protein. This family contains a diverse range of structurally related DNA repair proteins. The superfamily is called the HhH-GPD family after its hallmark Helix-hairpin-helix and Gly/Pro rich loop followed by
8932	Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD ha
8936	WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin
8938	C2 domain
8939	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
8941	Cyclin-dependent kinase 5 activator protein
8943	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
8971	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
8972	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
8973	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
8973	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
8976	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
8976	WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin
8976	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
8987	Starch binding domain
8988	Hsp20/alpha crystallin family
8989	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
8997	PH domain. PH stands for pleckstrin homology
9002	7 transmembrane receptor (rhodopsin family)
9016	Mitochondrial carrier protein
9016	Mitochondrial carrier protein
9025	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
9026	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
9026	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
9026	I/LWEQ domain. I/LWEQ domains bind to actin. It has been shown that the I/LWEQ domains from mouse talin and yeast Sla2p interact with F-actin. I/LWEQ domains can be placed into four major groups based on sequence similarity: (1) Metazoan talin
9026	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
9027	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
9028	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f
9031	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
9031	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
9033	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
9034	7 transmembrane receptor (rhodopsin family)
9037	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
9037	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
9038	7 transmembrane receptor (rhodopsin family)
9039	Repeat in ubiquitin-activating (UBA) protein
9039	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
9040	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
9043	Intermediate filament protein
9045	Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14
9048	Transforming growth factor beta like domain
9048	Transforming growth factor beta like domain
9048	Transforming growth factor beta like domain
9048	Transforming growth factor beta like domain
9050	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
9052	7 transmembrane receptor (metabotropic glutamate family)
9054	Aminotransferase class-V
9054	Aminotransferase class-V
9055	Microtubule associated protein (MAP65/ASE1 family)
9058	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
9060	ATP-sulfurylase
9060	Adenylylsulfate kinase
9061	ATP-sulfurylase
9061	Adenylylsulfate kinase
9063	pfam02891, zf-MIZ, MIZ zinc finger
9063	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
9063	pfam02891, zf-MIZ, MIZ zinc finger
9063	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
9071	PMP-22/EMP/MP20/Claudin family
9071	PMP-22/EMP/MP20/Claudin family
9073	PMP-22/EMP/MP20/Claudin family
9074	PMP-22/EMP/MP20/Claudin family
9075	PMP-22/EMP/MP20/Claudin family
9076	PMP-22/EMP/MP20/Claudin family
9079	LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located i
9080	PMP-22/EMP/MP20/Claudin family
9086	Eukaryotic initiation factor 1A
9092	SART-1 family. This family of proteins appear to contain a leucine zipper and may therefore be a family of transcription factors
9095	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
9096	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
9097	Ubiquitin carboxyl-terminal hydrolase family 2
9101	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
9104	Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis
9104	Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis
9112	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
9113	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t
9114	ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae
9118	Intermediate filament protein
9119	Intermediate filament protein
9122	Sugar (and other) transporter
9123	Monocarboxylate transporter
9124	LIM domain. This family represents two copies of the LIM structural domain
9124	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
9125	Cell differentiation family, Rcd1-like. Two of the members in this family have been characterised as being involved in regulation of Ste11 regulated sex genes
9127	ATP P2X receptor
9129	PWI domain
9131	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
9131	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
9131	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
9132	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
9132	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
9132	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
9132	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
9134	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
9134	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
9134	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
9135	Rabaptin
9139	MYND finger
9139	MYND finger
9140	Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5
9141	Protein of unknown function DUF122. This protein family has no known function
9143	Synaptogyrin. This family of proteins is distantly related to pfam01284
9144	Synaptogyrin. This family of proteins is distantly related to pfam01284
9145	Synaptogyrin. This family of proteins is distantly related to pfam01284
9145	Synaptogyrin. This family of proteins is distantly related to pfam01284
9145	Synaptogyrin. This family of proteins is distantly related to pfam01284
9146	VHS domain. Domain present in VPS-27, Hrs and STAM
9150	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
9150	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
9150	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
9150	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
9152	Sodium:neurotransmitter symporter family
9153	Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin
9154	Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin
9159	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
9167	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
9170	7 transmembrane receptor (rhodopsin family)
9172	Fibronectin type III domain
9173	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
9175	Protein kinase domain
9177	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
9177	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
9179	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
9180	Fibronectin type III domain
9187	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
9189	pfam02892, zf-BED, BED zinc finger
9211	EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function,
9211	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
9213	EXS family. We have named this region the EXS family after (ERD1, XPR1, and SYG1). This family includes C-terminus portions from the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisae, and sequences that are thought to be
9213	SPX domain. We have named this region the SPX domain after (SYG1, Pho81 and XPR1). This 180 residue length domain is found at the amino terminus of a variety of proteins. In the yeast protein SYG1, the N-terminus directly binds to the G- protein beta sub
9215	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl
9215	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl
9217	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
9218	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
9219	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
9223	Guanylate kinase
9223	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
9223	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
9228	Guanylate-kinase-associated protein (GKAP) protein
9229	Guanylate-kinase-associated protein (GKAP) protein
9236	Late embryogenesis abundant protein. Different types of LEA proteins are expressed at different stages of late embryogenesis in higher plant seed embryos and under conditions of dehydration stress. The function of these proteins is unknown
9236	Late embryogenesis abundant protein. Different types of LEA proteins are expressed at different stages of late embryogenesis in higher plant seed embryos and under conditions of dehydration stress. The function of these proteins is unknown
9245	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
9246	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
9247	GCM motif protein
9248	7 transmembrane receptor (rhodopsin family)
9248	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
9253	Phosphotyrosine interaction domain (PTB/PID)
9254	Cache domain
9255	Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte act
9260	LIM domain. This family represents two copies of the LIM structural domain
9265	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
9266	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
9266	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
9267	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
9267	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
9271	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
9271	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
9271	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
9271	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
9276	Coatomer WD associated domain
9278	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
9283	7 transmembrane receptor (rhodopsin family)
9284	Caldesmon
9287	7 transmembrane receptor (rhodopsin family)
9289	7 transmembrane receptor (Secretin family)
9289	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
9290	7 transmembrane receptor (rhodopsin family)
9293	7 transmembrane receptor (rhodopsin family)
9294	7 transmembrane receptor (rhodopsin family)
9296	ATP synthase (F/14-kDa) subunit. This family includes 14-kDa subunit from vATPases, which is in the peripheral catalytic part of the complex. The family also includes archaebacterial ATP synthase subunit F
9311	Amiloride-sensitive sodium channel
9311	Amiloride-sensitive sodium channel
9311	Amiloride-sensitive sodium channel
9312	Kv2 voltage-gated K+ channel
9312	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
9312	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
9313	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
9313	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
9317	Phosphotriesterase family
9320	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
9322	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
9331	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
9333	Transglutaminase family
9333	Transglutaminase family, C-terminal ig like domain
9333	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
9334	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
9340	7 transmembrane receptor (Secretin family)
9340	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
9341	Synaptobrevin
9348	Sulfotransferase protein
9350	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb
9352	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
9353	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
9355	Homeobox domain
9355	LIM domain. This family represents two copies of the LIM structural domain
9356	Sugar (and other) transporter
9356	Sugar (and other) transporter
9356	Sugar (and other) transporter
9356	Sugar (and other) transporter
9356	Sugar (and other) transporter
9360	Cyclophilin type peptidyl-prolyl cis-trans isomerase
9362	C2 domain
9365	Glycosyl hydrolase family 1
9365	Glycosyl hydrolase family 1
9367	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
9370	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
9371	Kinesin motor domain
9372	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
9372	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
9372	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
9374	Palmitoyl protein thioesterase
9374	Palmitoyl protein thioesterase
9374	Palmitoyl protein thioesterase
9375	Endomembrane protein 70
9376	Sugar (and other) transporter
9377	Cytochrome c oxidase subunit Va. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit Va
9378	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
9381	C2 domain
9388	Lipase
9388	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
9389	Sugar (and other) transporter
9390	Sugar (and other) transporter
9399	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
9399	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
9404	LIM domain. This family represents two copies of the LIM structural domain
9407	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
9414	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
9415	Fatty acid desaturase
9415	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
9420	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
9423	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
9424	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
9435	Sulfotransferase protein
9445	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
9447	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
9447	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
9448	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
9450	ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen r
9452	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
9453	Polyprenyl synthetase
9454	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
9455	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
9456	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
9462	C2 domain
9464	Helix-loop-helix DNA-binding domain
9470	Eukaryotic initiation factor 4E
9472	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
9474	Autophagy protein Apg5. Apg5p is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway
9476	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
9480	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
9481	Mitochondrial carrier protein
9486	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
9488	Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability
9493	Kinesin motor domain
9493	Kinesin motor domain
9495	Protein kinase A anchor
9497	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
9498	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
9500	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
9501	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
9507	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
9508	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
9508	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
9509	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
9509	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
9509	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
9509	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
9510	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
9510	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
9520	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
9522	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c
9522	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c
9524	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster
9524	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster
9525	MIT domain
9526	PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How
9529	BAG domain. Domain present in Hsp70 regulators
9530	BAG domain. Domain present in Hsp70 regulators
9531	BAG domain. Domain present in Hsp70 regulators
9532	BAG domain. Domain present in Hsp70 regulators
9535	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
9536	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
9542	Neuregulin family
9542	Neuregulin family
9542	Neuregulin family
9542	Neuregulin family
9543	Fibronectin type III domain
9543	Fibronectin type III domain
9543	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
9546	Phosphotyrosine interaction domain (PTB/PID)
9546	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
9547	Small cytokines (intecrine/chemokine)
9550	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
9559	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp
9562	Histidine acid phosphatase
9568	7 transmembrane receptor (metabotropic glutamate family)
9568	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
9569	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
9569	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
9572	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
9572	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
9573	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
9578	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
9578	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
9581	Prolyl oligopeptidase family
9581	pfam02897, Peptidase_S9_N, Prolyl oligopeptidase, N-terminal beta-propeller domain. This unusual 7-stranded beta-propeller domain protects the catalytic triad of prolyl oligopeptidase (see pfam00326), excluding larger peptides and proteins from proteolysi
9582	Cytidine and deoxycytidylate deaminase zinc-binding region
9583	GDA1/CD39 (nucleoside phosphatase) family
9585	Kinesin motor domain
9600	Phosphatidylinositol transfer protein
9600	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoester
9601	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
9604	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
9610	Vacuolar sorting protein 9 (VPS9) domain
9610	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
9611	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
9612	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
9618	TRAF-type zinc finger
9618	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
9618	TRAF-type zinc finger
9618	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
9620	Cadherin domain
9620	7 transmembrane receptor (Secretin family)
9620	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
9620	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
9625	Protein kinase domain
9628	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
9630	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
9631	Non-repetitive/WGA-negative nucleoporin. This is a family of nucleoporin proteins. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells, and mediate bidirectional nucleocytoplasmic transport, especially of mRNA and protein
9631	Non-repetitive/WGA-negative nucleoporin. This is a family of nucleoporin proteins. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells, and mediate bidirectional nucleocytoplasmic transport, especially of mRNA and protein
9633	Tesmin/TSO1-like CXC domain. This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in
9636	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
9644	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
9648	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
9648	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
9649	PH domain. PH stands for pleckstrin homology
9654	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
9656	Lysosome-associated membrane glycoprotein (Lamp)
9656	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
9656	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
9658	Zinc finger, C2H2 type
9667	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
9671	Sulfotransferase protein
9671	WSC domain. This domain may be involved in carbohydrate binding
9672	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
9673	Mitochondrial carrier protein
9677	Histidine acid phosphatase
9678	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
9681	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
9682	jmjC domain
9682	jmjN domain
9685	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
9688	Nucleoporin interacting componentent
9695	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
9697	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
9700	Peptidase family C50
9702	Arsenical pump membrane protein
9705	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
9709	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
9711	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
9714	PH domain. PH stands for pleckstrin homology
9717	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
9717	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
9722	Phosphotyrosine interaction domain (PTB/PID)
9722	Phosphotyrosine interaction domain (PTB/PID)
9723	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
9725	Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length
9728	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
9739	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
9739	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
9741	Golgi 4-transmembrane spanning transporter
9744	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
9745	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
9749	RPEL repeat
9757	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
9757	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
9757	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
9758	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
9759	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
9765	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
9770	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
9770	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
9770	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
9777	Endomembrane protein 70
9781	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
9784	PX domain. PX domains bind to phosphoinositides
9786	Herpesvirus UL6 like
9787	Guanylate-kinase-associated protein (GKAP) protein
9791	Phosphatidyl serine synthase. Phosphatidyl serine synthase is also known as serine exchange enzyme. This family represents eukaryotic PSS I and II which are membrane bound proteins which catalyses the replacement of the head group of a phospholipid (phos
9795	PX domain. PX domains bind to phosphoinositides
9796	Fibronectin type III domain
9797	TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme
9798	Eukaryotic protein of unknown function, DUF292
9801	Ribosomal protein L19
9804	MAS20 protein import receptor
9806	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
9807	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
9817	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
9819	TSC-22/dip/bun family
9820	Cullin family
9820	Anaphase-promoting complex, subunit 10 (APC10)
9833	Kinase associated domain 1
9836	Leucine carboxyl methyltransferase. Family of leucine carboxyl methyltransferases EC:2.1.1.- . This family may need divides a the full alignment contains a significantly shorter mouse sequence
9837	Uncharacterised protein family (UPF0080)
9839	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
9841	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
9842	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
9843	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
9843	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
9847	C2 domain
9853	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
9853	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
9855	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
9855	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
9857	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
9859	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
9863	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
9865	Fibronectin type III domain
9865	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
9866	B-box zinc finger
9869	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
9870	Filamin/ABP280 repeat
9878	HMG (high mobility group) box
9880	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
9885	Oxysterol-binding protein
9885	Oxysterol-binding protein
9886	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
9889	pfam02892, zf-BED, BED zinc finger
9890	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
9890	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
9892	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
9896	SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin
9899	Sugar (and other) transporter
9900	Sugar (and other) transporter
9901	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
9902	Lectin C-type domain. This family includes both long and short form C-type
9903	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
9903	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
9905	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
9905	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
9909	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
9909	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
9915	Helix-loop-helix DNA-binding domain
9920	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
9920	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
9924	Ubiquitin carboxyl-terminal hydrolase family 2
9925	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
9927	GTPase of unknown function
9928	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
9928	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
9929	Josephin
9931	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
9934	7 transmembrane receptor (rhodopsin family)
9935	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
9938	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
9947	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
9949	Protein of unknown function DUF51. This family contains members in prokaryotes and eukaryotes. None of the members has a known function
9951	Sulfotransferase protein
9953	Sulfotransferase protein
9955	Sulfotransferase protein
9956	Sulfotransferase protein
9957	Sulfotransferase protein
9961	Major Vault Protein repeat. The vault is a ubiquitous and highly conserved ribonucleoprotein particle of approximately 13 mDa of unknown function. This family corresponds to a repeat found in the amino terminal half of the major vault protein
9961	Major Vault Protein repeat. The vault is a ubiquitous and highly conserved ribonucleoprotein particle of approximately 13 mDa of unknown function. This family corresponds to a repeat found in the amino terminal half of the major vault protein
9962	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic
9963	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic
9963	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic
9970	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
9970	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
9971	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
9971	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
9972	Zn-finger in Ran binding protein and others
9973	Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in
9975	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
9975	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
9980	Dopey, N-terminal domain. DopA is the founding member of the Dopey family and is required for correct cell morphology and spatiotemporal organization of multicellular structures in the filamentous fungus Aspergillus nidulans. DopA homologues are found in
9986	CAAX amino terminal protease family. Members of this family are probably proteases. the family contains CAAX prenyl protease. The proteins contain a highly conserved Glu-Glu motif at the amino end of the alignment. The alignment also contains two histidi
9987	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
9988	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
9989	HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats
9990	K-Cl Co-transporter type 1 (KCC1)
9991	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
9992	Slow voltage-gated potassium channel
9996	Sec63 domain
9996	pfam02889, Sec63, Sec63 domain
9996	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
9996	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
10002	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
10002	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
10002	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
10002	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
10003	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
10004	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
10004	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
10007	Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase
10008	Slow voltage-gated potassium channel
10009	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
10010	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
10013	Zn-finger in ubiquitin-hydrolases and other protein
10015	BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1
10021	Cyclic nucleotide-binding domain
10021	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
10026	Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this
10038	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
10038	pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affini
10039	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
10039	pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affini
10040	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
10043	VHS domain. Domain present in VPS-27, Hrs and STAM
10043	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
10052	Connexin
10053	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
10054	Repeat in ubiquitin-activating (UBA) protein
10054	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
10055	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
10057	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
10059	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
10059	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
10059	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
10062	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
10062	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
10066	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c
10067	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c
10067	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c
10071	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
10072	Peptidase family M49
10072	Peptidase family M49
10075	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
10075	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
10076	Fibronectin type III domain
10076	Fibronectin type III domain
10076	Fibronectin type III domain
10077	Tetraspanin family
10077	Tetraspanin family
10077	Tetraspanin family
10082	Glypican
10085	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
10085	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
10086	1
10094	P21-ARC (ARP2/3 complex 21 kDa subunit). The seven component ARP2/3 actin-organizing complex is involved in actin assembly and function
10098	Tetraspanin family
10099	Tetraspanin family
10100	Tetraspanin family
10102	Elongation factor TS
10103	Tetraspanin family
10105	Cyclophilin type peptidyl-prolyl cis-trans isomerase
10106	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
10107	B-box zinc finger
10107	B-box zinc finger
10109	Arp2/3 complex, 34kD subunit p34-Arc. Arp2/3 protein complex has been implicated in the control of actin polymerization in cells. The human complex consists of seven subunits which include the actin related Arp2 and Arp3, and five others referred to as p
10109	Arp2/3 complex, 34kD subunit p34-Arc. Arp2/3 protein complex has been implicated in the control of actin polymerization in cells. The human complex consists of seven subunits which include the actin related Arp2 and Arp3, and five others referred to as p
10112	Kinesin motor domain
10123	ADP-ribosylation factor family
10124	ADP-ribosylation factor family
10125	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
10126	Dynein light chain type 1
10130	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
10139	ADP-ribosylation factor family
10140	BTG1 family. A novel family of anti-proliferative proteins
10143	Lectin C-type domain. This family includes both long and short form C-type
10147	Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding
10147	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
10149	7 transmembrane receptor (Secretin family)
10153	CBF/Mak21 family
10154	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
10154	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
10155	B-box zinc finger
10155	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
10157	Alanine dehydrogenase/pyridine nucleotide transhydrogenase. This family now also contains the lysine 2-oxoglutarate reductases
10157	Saccharopine dehydrogenase. This family comprised of three structural domains that can not be separated in the linear sequence. In some organisms this enzyme is found as a bifunctional polypeptide with lysine ketoglutarate reductase (PF). The saccharopin
10160	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
10161	7 transmembrane receptor (rhodopsin family)
10165	Mitochondrial carrier protein
10166	Mitochondrial carrier protein
10168	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
10170	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
10171	RNA 3'-terminal phosphate cyclase
10175	Cornichon protein
10190	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
10197	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b
10197	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin
10197	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b
10197	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin
10199	Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerev
10199	Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi
10201	Nucleoside diphosphate kinases
10202	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
10203	7 transmembrane receptor (Secretin family)
10206	B-box zinc finger
10206	B-box zinc finger
10209	Translation initiation factor SUI1
10211	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
10215	Helix-loop-helix DNA-binding domain
10216	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
10217	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
10220	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
10224	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
10226	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
10235	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
10235	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typ
10235	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
10239	Clathrin adaptor complex small chain
10242	Calcium-activated potassium channel, beta subunit
10242	Calcium-activated potassium channel, beta subunit
10245	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim
10247	Endoribonuclease L-PSP. Endoribonuclease active on single-stranded mRNA. Inhibits protein synthesis by cleavage of mRNA. Previously thought to inhibit protein synthesis initiation. This protein may also be involved in the regulation of purine biosynthesi
10250	PWI domain
10254	VHS domain. Domain present in VPS-27, Hrs and STAM
10260	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
10260	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
10260	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
10269	Peptidase family M48
10274	Stromal antigen (SA/STAG) protein
10279	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
10283	Cyclophilin type peptidyl-prolyl cis-trans isomerase
10287	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
10289	Translation initiation factor SUI1
10307	Phosphotyrosine interaction domain (PTB/PID)
10307	Phosphotyrosine interaction domain (PTB/PID)
10307	Phosphotyrosine interaction domain (PTB/PID)
10307	Phosphotyrosine interaction domain (PTB/PID)
10307	Phosphotyrosine interaction domain (PTB/PID)
10307	Phosphotyrosine interaction domain (PTB/PID)
10309	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
10311	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp
10312	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
10312	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
10313	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
10316	7 transmembrane receptor (rhodopsin family)
10317	Galactosyltransferase
10317	Galactosyltransferase
10317	Galactosyltransferase
10317	Galactosyltransferase
10317	Galactosyltransferase
10319	Laminin B (Domain IV)
10319	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
10322	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
10324	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
10324	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
10325	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
10325	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
10327	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
10327	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
10328	Uncharacterised protein family (UPF0172)
10331	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
10332	Lectin C-type domain. This family includes both long and short form C-type
10333	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
10343	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
10343	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
10343	REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c
10346	B-box zinc finger
10346	Zinc finger, C3HC4 type (RING finger)
10351	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f
10360	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
10361	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
10361	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
10362	HMG (high mobility group) box
10363	HMG (high mobility group) box
10368	PMP-22/EMP/MP20/Claudin family
10369	PMP-22/EMP/MP20/Claudin family
10371	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
10376	Tubulin/FtsZ family. This family includes the tubulin alpha
10379	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp
10380	Inositol monophosphatase family
10380	Inositol monophosphatase family
10385	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
10385	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
10389	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
10392	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
10392	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
10393	Anaphase-promoting complex, subunit 10 (APC10)
10397	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t
10400	Phospholipid methyltransferase. The S. cerevisiae phospholipid methyltransferase (EC:2.1.1.16) has a broad substrate specificity of unsaturated phospholipids
10400	Phospholipid methyltransferase. The S. cerevisiae phospholipid methyltransferase (EC:2.1.1.16) has a broad substrate specificity of unsaturated phospholipids
10400	Phospholipid methyltransferase. The S. cerevisiae phospholipid methyltransferase (EC:2.1.1.16) has a broad substrate specificity of unsaturated phospholipids
10401	pfam02891, zf-MIZ, MIZ zinc finger
10402	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
10403	HEC/Ndc80p family. Members of this family are components of the mitotic spindle. It has been shown that Ndc80/HEC from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle
10404	Peptidase family M28D
10406	WAP-type (Whey Acidic Protein) 'four-disulfide core'
10411	Cyclic nucleotide-binding domain
10411	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
10418	Reeler domain
10418	Thrombospondin type 1 domain
10420	Protein kinase domain
10423	CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond
10423	CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond
10424	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
10425	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
10426	Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin
10434	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
10436	Suppressor Mra1. The suppressor Mra1 is found in high-copy-number when Ras1 is mutated, that recovers the mating deficiency caused by the decrease of Ras1 activity. Mutational analysis in yeast suggests that the suppressor Mra1 is essential for cell grow
10437	Gamma interferon inducible lysosomal thiol reductase (GILT). This family includes the two characterized human gamma-interferon-inducible lysosomal thiol reductase (GILT) sequences. It also contains several other eukaryotic putative proteins with similari
10440	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim
10445	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
10449	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
10450	Cyclophilin type peptidyl-prolyl cis-trans isomerase
10451	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
10451	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
10454	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
10455	Acyl CoA binding protein
10459	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
10461	Fibronectin type III domain
10462	Hepatic lectin, N-terminal domain
10462	Lectin C-type domain. This family includes both long and short form C-type
10463	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
10465	Cyclophilin type peptidyl-prolyl cis-trans isomerase
10469	Mitochondrial import inner membrane, translocase subunit TIM44. Tim44 is an essential component of the machinery that mediates the translocation of nuclear-encoded proteins across the mitochondrial inner membrane. Tim44 is thought to bind phospholipids o
10471	KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif
10472	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
10474	Putative replicase 1 (ORF2)
10475	B-box zinc finger
10478	Mitochondrial carrier protein
10479	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
10481	Homeobox domain
10482	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
10482	TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucl
10483	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/2
10483	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/2
10483	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/2
10484	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/2
10486	CAP protein
10487	CAP protein
10497	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
10507	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
10509	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
10509	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
10512	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
10513	Cyclin
10522	MYND finger
10522	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
10523	Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding
10525	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
10529	Nebulin repeat
10533	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
10535	Ribonuclease HII
10537	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
10539	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
10544	Class I Histocompatibility antigen, domains alpha 1 and 2
10548	Endomembrane protein 70
10550	Prenylated rab acceptor (PRA1)
10554	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
10554	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
10555	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
10556	RNase P subunit p30. This protein is part of the RNase P complex that is involved in tRNA maturation
10557	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa
10559	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
10560	Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog
10567	Prenylated rab acceptor (PRA1)
10568	Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor
10570	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
10571	Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities
10575	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
10577	ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen r
10580	Sorbin homologous domain
10580	Sorbin homologous domain
10586	Mab-21 protein
10588	5-formyltetrahydrofolate cyclo-ligase family. 5-formyltetrahydrofolate cyclo-ligase or methenyl-THF synthetase EC:6.3.3.2 catalyses the interchange of 5-formyltetrahydrofolate (5-FTHF) to 5-10-methenyltetrahydrofolate, this requires ATP and Mg2+. 5-FTHF
10599	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
10599	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
10605	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA
10605	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA
10606	SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase
10606	AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this
10610	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
10611	LIM domain. This family represents two copies of the LIM structural domain
10613	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
10615	Myosin tail
10617	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has
10620	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
10625	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
10631	Fasciclin domain. This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria
10636	Raf-like Ras-binding domain
10637	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
10641	Uncharacterised protein family (UPF0171)
10654	Phosphomevalonate kinase. Phosphomevalonate kinase (EC:2.7.4.2) catalyzes the phosphorylation of 5-phosphomevalonate into 5-diphosphomevalonate, an essential step in isoprenoid biosynthesis via the mevalonate pathway. This family represents the animal ty
10660	Homeobox domain
10663	7 transmembrane receptor (rhodopsin family)
10667	tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain
10672	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
10678	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
10678	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
10686	PMP-22/EMP/MP20/Claudin family
10687	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
10690	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
10691	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
10691	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
10692	7 transmembrane receptor (rhodopsin family)
10693	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
10694	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
10699	Low-density lipoprotein receptor domain class A
10699	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
10715	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
10717	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
10720	UDP-glucoronosyl and UDP-glucosyl transferase
10726	Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration
10734	Stromal antigen (SA/STAG) protein
10735	Stromal antigen (SA/STAG) protein
10747	Trypsin
10747	CUB domain
10747	Sushi domain (SCR repeat)
10749	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
10750	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
10752	Fibronectin type III domain
10753	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
10761	Zona pellucida-like domain
10765	jmjC domain
10765	jmjN domain
10765	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
10765	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
10765	pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function
10766	BTG1 family. A novel family of anti-proliferative proteins
10768	S-adenosyl-L-homocysteine hydrolase
10769	POLO box duplicated region
10771	MYND finger
10771	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
10772	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
10773	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
10782	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
10782	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
10782	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
10791	Synaptobrevin
10794	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
10794	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
10797	Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain
10798	7 transmembrane receptor (rhodopsin family)
10800	7 transmembrane receptor (rhodopsin family)
10801	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
10802	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24
10803	7 transmembrane receptor (rhodopsin family)
10803	7 transmembrane receptor (rhodopsin family)
10804	Connexin
10807	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
10808	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
10810	WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin
10817	PTB domain (IRS-1 type)
10818	PTB domain (IRS-1 type)
10826	Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues.
10838	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
10840	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
10840	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
10844	Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin
10846	3'5'-cyclic nucleotide phosphodiesterase
10847	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
10855	Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular
10857	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
10858	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
10861	Sulfate transporter family. Mutations may lead to several human diseases
10863	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
10863	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
10863	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
10863	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
10863	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
10863	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
10864	Sugar (and other) transporter
10865	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
10867	Tetraspanin family
10869	MYND finger
10869	Ubiquitin carboxyl-terminal hydrolase family 2
10869	Ubiquitin carboxyl-terminal hydrolases family 2
10873	Malic enzyme, NAD binding domain
10874	Neuromedin U
10875	Fibrinogen beta and gamma chains, C-terminal globular domain
10877	Sushi domain (SCR repeat)
10878	Sushi domain (SCR repeat)
10880	Actin
10881	Actin
10885	WD domain, G-beta repeat
10885	Dip2/Utp12 Family. This domain is found at the C-terminus of proteins containing WD40 repeats. These proteins are part of the U3 ribonucleoprotein the yeast protein is called Utp12 or DIP2
10886	7 transmembrane receptor (rhodopsin family)
10886	7 transmembrane receptor (rhodopsin family)
10887	7 transmembrane receptor (rhodopsin family)
10888	7 transmembrane receptor (rhodopsin family)
10890	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
10892	Death domain
10892	Death domain
10893	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
10893	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
10893	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
10897	Hrf1 family. This family includes a number of eukaryotic proteins. It is an integral membrane protein, conserved in at least 1 copy in all sequenced eukaryotes. The gene name in S. pombe is hrf1+ for Heavy metal Resistance Factor 1
10900	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
10901	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
10905	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
10906	TRAF-type zinc finger
10906	7 transmembrane receptor (rhodopsin family)
10907	Mitosis protein DIM1
10908	Cyclic nucleotide-binding domain
10911	Urotensin II
10912	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
10913	Death domain
10915	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
10916	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
10916	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
10919	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
10919	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
10923	Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p1
10927	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members
10936	7 transmembrane receptor (rhodopsin family)
10941	UDP-glucoronosyl and UDP-glucosyl transferase
10945	ER lumen protein retaining receptor
10947	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
10950	BTG1 family. A novel family of anti-proliferative proteins
10951	'chromo' (CHRromatin Organization MOdifier) domain
10951	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
10952	Sec61beta family. This family consists of homologues of Sec61beta - a component of the Sec61/SecYEG protein secretory system. The domain is found in eukaryotes and archaea and is possibly homologous to the bacterial SecG
10954	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
10955	TMS membrane protein/tumour differentially expressed protein (TDE)
10959	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
10960	Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19-
10961	Transposase
10963	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
10969	Cobalt transport protein
10971	14-3-3 protein
10972	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
10983	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
10988	metallopeptidase family M24
10991	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
10992	PSP. Proline rich domain found in numerous spliceosome associated proteins
10992	Domain of unknown function (DUF382). This domain is specific to the human splicing factor 3b subunit 2 and it's orthologs
10992	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
11004	Kinesin motor domain
11012	Trypsin
11014	ER lumen protein retaining receptor
11015	ER lumen protein retaining receptor
11015	ER lumen protein retaining receptor
11018	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
11020	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
11022	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
11022	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
11031	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
11033	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
11034	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
11035	Protein kinase domain
11036	Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are rep
11036	Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are rep
11037	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
11039	Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities
11043	B-box zinc finger
11043	B-box zinc finger
11045	Tetraspanin family
11051	MutT-like domain
11059	C2 domain
11060	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
11061	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
11062	Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrou
11065	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
11069	Cyclic nucleotide-binding domain
11069	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
11073	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
11075	Stathmin family
11079	Rer1 family. RER1 family protein are involved in involved in the retrieval of some endoplasmic reticulum membrane proteins from the early golgi compartment. The C terminus of yeast Rer1p interacts with a coatomer complex
11083	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
11083	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
11083	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
11085	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11085	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11086	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11086	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11086	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11086	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11086	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11086	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11093	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11093	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11093	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11093	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11095	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11095	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11096	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11096	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11102	Rpp14 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40
11113	PH domain. PH stands for pleckstrin homology
11113	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
11113	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
11113	Rotavirus non-structural protein NSP3. This family consist of rotaviral non-structural RNA binding protein 34 (NS34 or NSP3). The NSP3 protein has been shown to bind viral RNA. The NSP3 protein consists of 3 conserved functional domains
11122	Fibronectin type III domain
11122	Fibronectin type III domain
11127	Kinesin motor domain
11128	RNA polymerase alpha subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Members of this family
11128	RNA polymerase A/beta'/A" subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This family inclu
11138	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
11139	PH domain. PH stands for pleckstrin homology
11140	Cdc37 family. In the budding yeast Saccharomyces cerevisiae, Cdc37 is required for the productive formation of Cdc28-cyclin complexes. Cdc37 may be a kinase targeting subunit of Hsp90
11141	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
11142	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
11142	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
11142	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
11143	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
11145	NLP/P60 family. The function of this domain is unknown. It is found in several lipoproteins
11153	Fic protein family. This family consists of the Fic (filamentation induced by cAMP) protein and its relatives. The Fic protein is involved in cell division and is suggested to be involved in the synthesis of PAB or folate, indicating that the Fic protein
11155	LIM domain. This family represents two copies of the LIM structural domain
11155	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
11157	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
11160	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
11161	Uncharacterised protein family (UPF0143). This family of uncharacterised proteins are integral membrane proteins. They may contain 4 transmembrane helices. The family contains a conserved arginine and histidine that may be functionally important
11162	NUDIX domain
11163	NUDIX domain
11164	NUDIX domain
11165	NUDIX domain
11166	HMG (high mobility group) box
11168	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
11168	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
11173	Thrombospondin type 1 domain
11173	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11173	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11174	Thrombospondin type 1 domain
11174	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11174	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11176	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
11176	DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors)
11177	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
11177	DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors)
11177	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
11177	DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors)
11181	Trehalase
11182	Sugar (and other) transporter
11185	NNMT/PNMT/TEMT family
11186	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
11186	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
11186	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
11186	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
11186	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
11186	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
11186	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
11186	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
11186	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
11186	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
11188	PX domain. PX domains bind to phosphoinositides
11193	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
11196	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoester
11199	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
11200	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
11201	impB/mucB/samB family. These proteins are involved in UV protection
11211	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
11213	Death domain
11213	Protein kinase domain
11221	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
11221	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
11221	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
11222	Ribosomal protein L3
11224	Sodium:alanine symporter family
11226	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
11227	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
11227	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
11238	Eukaryotic-type carbonic anhydrase
11240	Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-
11245	7 transmembrane receptor (rhodopsin family)
11250	7 transmembrane receptor (rhodopsin family)
11251	7 transmembrane receptor (rhodopsin family)
11252	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
11253	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
11253	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
11254	Sodium:neurotransmitter symporter family
11255	7 transmembrane receptor (rhodopsin family)
11260	Exportin-t. Exportin-t is a specific mediator of tRNA export. RanGTP-binding, importin beta-related factor with predominantly nuclear localization. It shuttles rapidly between nucleus and between nucleus and cytoplasm and interacts with nuclear pore comp
11262	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
11262	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
11262	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
11267	EAP30. EAP30 is a subunit of the ELL complex. The ELL is an 80-kDa RNA polymerase II transcription factor. ELL interacts with three other proteins to form the complex known as ELL complex. The ELL complex is capable of increasing that catalytic rate of t
11275	Kelch motif
11275	BTB/POZ domain
11280	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
11281	Homeobox domain
11283	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
11285	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
11287	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
11287	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
11309	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
11309	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
11311	Sec1 family
11313	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
11314	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
11315	DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117
11317	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
11318	7 transmembrane receptor (rhodopsin family)
11321	Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity
11328	FKBP-type peptidyl-prolyl cis-trans isomerase
11331	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
11332	Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C
11332	Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C
11332	Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C
11332	Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C
11332	Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C
11332	Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C
11335	'chromo' (CHRromatin Organization MOdifier) domain
11335	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
11335	'chromo' (CHRromatin Organization MOdifier) domain
11335	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
11337	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
11340	3' exoribonuclease family, domain 2. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
11340	3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
11345	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
11350	Protein kinase domain
11352	SH2 domain
11352	Protein kinase domain
11352	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
11354	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
11418	Amiloride-sensitive sodium channel
11419	Amiloride-sensitive sodium channel
11421	Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid
11421	Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid
11423	Carboxylesterase
11423	Carboxylesterase
11426	Growth-Arrest-Specific Protein 2 Domain
11426	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
11426	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
11428	Aconitase family (aconitate hydratase)
11428	Aconitase C-terminal domain. Members of this family usually also match to pfam00330. This domain undergoes conformational change in the enzyme mechanism
11430	Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA
11432	Histidine acid phosphatase
11432	Histidine acid phosphatase
11433	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte
11435	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
11435	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
11438	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
11438	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
11440	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
11440	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
11441	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
11441	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
11443	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
11443	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
11444	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
11444	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
11447	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
11447	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
11448	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
11448	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
11449	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
11449	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
11450	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
11459	Actin
11459	Actin
11461	Actin
11464	Actin
11465	Actin
11465	Actin
11468	Actin
11469	Actin
11470	Actin
11471	Actin
11472	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
11474	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
11475	Actin
11477	Protein kinase domain
11477	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
11479	Protein kinase domain
11479	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
11479	Protein kinase domain
11479	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with t
11480	Protein kinase domain
11480	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
11481	Protein kinase domain
11481	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
11482	Protein kinase domain
11482	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
11486	Adenosine/AMP deaminase
11487	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11488	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11488	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11489	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11489	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11490	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11490	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11491	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11492	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11492	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11495	Disintegrin
11495	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
11495	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
11496	Disintegrin
11496	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
11496	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
11497	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11497	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11498	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11498	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11499	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11499	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11500	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11500	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11501	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11501	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11502	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11502	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11504	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
11504	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
11512	Adenylate and Guanylate cyclase catalytic domain
11513	Adenylate and Guanylate cyclase catalytic domain
11514	Adenylate and Guanylate cyclase catalytic domain
11515	Adenylate and Guanylate cyclase catalytic domain
11516	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
11517	7 transmembrane receptor (Secretin family)
11518	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
11519	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
11520	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
11534	pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es
11535	Adrenomedullin
11536	7 transmembrane receptor (rhodopsin family)
11539	7 transmembrane receptor (rhodopsin family)
11540	7 transmembrane receptor (rhodopsin family)
11541	7 transmembrane receptor (rhodopsin family)
11542	7 transmembrane receptor (rhodopsin family)
11542	7 transmembrane receptor (rhodopsin family)
11544	ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisi
11545	Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc
11545	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
11545	pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affini
11546	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
11546	pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affini
11548	7 transmembrane receptor (rhodopsin family)
11548	7 transmembrane receptor (rhodopsin family)
11549	7 transmembrane receptor (rhodopsin family)
11551	7 transmembrane receptor (rhodopsin family)
11552	7 transmembrane receptor (rhodopsin family)
11553	7 transmembrane receptor (rhodopsin family)
11553	7 transmembrane receptor (rhodopsin family)
11554	7 transmembrane receptor (rhodopsin family)
11555	7 transmembrane receptor (rhodopsin family)
11555	7 transmembrane receptor (rhodopsin family)
11556	7 transmembrane receptor (rhodopsin family)
11565	Adenylosuccinate synthetase
11566	Adenylosuccinate synthetase
11567	Villin headpiece domain
11568	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
11593	Asparaginase
11595	Extracellular link domain
11595	Sushi domain (SCR repeat)
11596	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
11596	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
11601	Intermediate filament protein
11603	Laminin G domain
11603	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
11603	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
11603	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
11603	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
11603	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
11605	Melibiase
11606	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
11607	7 transmembrane receptor (rhodopsin family)
11608	7 transmembrane receptor (rhodopsin family)
11609	7 transmembrane receptor (rhodopsin family)
11622	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
11622	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
11625	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
11634	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
11634	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
11634	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
11636	Adenylate kinase
11637	Adenylate kinase
11639	Adenylate kinase
11640	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
11647	Alkaline phosphatase
11648	Alkaline phosphatase
11650	Alkaline phosphatase
11651	PH domain. PH stands for pleckstrin homology
11657	Serum albumin family
11668	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
11669	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
11670	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
11671	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
11674	Fructose-bisphosphate aldolase class-I
11676	Fructose-bisphosphate aldolase class-I
11677	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
11682	MAM domain. An extracellular domain found in many receptors
11684	Lipoxygenase
11685	Lipoxygenase
11686	Lipoxygenase
11686	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
11687	Lipoxygenase
11688	Lipoxygenase
11688	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
11690	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
11694	Homeobox domain
11695	OAR domain
11695	Homeobox domain
11699	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
11702	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermi
11702	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
11703	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermi
11703	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
11704	pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in biomineralization. They seem to regulate the formation of crystallites during the secretory stage of tooth enamel development. thought to play a major role in the structural organization and m
11705	Transforming growth factor beta like domain
11717	Adenosine/AMP deaminase
11722	Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwe
11722	Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta st
11723	Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwe
11723	Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta st
11727	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
11730	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
11731	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
11733	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
11735	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
11735	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
11735	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
11735	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
11735	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
11735	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
11735	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
11735	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
11735	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
11736	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
11739	Mitochondrial carrier protein
11740	Mitochondrial carrier protein
11744	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
11745	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
11746	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
11747	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
11749	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
11749	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
11750	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
11752	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
11754	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
11754	Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
11754	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog
11764	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
11764	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
11764	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
11764	Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This ig-fold domain is found
11764	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
11765	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
11765	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
11766	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
11766	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
11767	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
11768	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
11769	Clathrin adaptor complex small chain
11770	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
11771	Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site
11771	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
11771	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
11772	Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site
11772	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
11772	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
11773	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
11774	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
11775	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
11776	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
11778	Clathrin adaptor complex small chain
11779	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
11781	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
11783	NB-ARC domain
11783	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
11784	Phosphotyrosine interaction domain (PTB/PID)
11784	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
11785	Phosphotyrosine interaction domain (PTB/PID)
11785	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
11785	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
11787	Phosphotyrosine interaction domain (PTB/PID)
11787	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
11787	Phosphotyrosine interaction domain (PTB/PID)
11787	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
11789	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
11789	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
11793	Autophagy protein Apg5. Apg5p is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway
11796	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
11797	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
11803	Amyloid A4 extracellular domain
11806	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
11808	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
11808	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I, Apolipoprotein A-IV, Apolipoprotein E
11808	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
11810	Cytidine and deoxycytidylate deaminase zinc-binding region
11816	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
11818	Sushi domain (SCR repeat)
11819	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
11819	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
11820	Amyloid A4 extracellular domain
11820	Beta-amyloid peptide (beta-APP)
11821	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
11826	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
11827	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
11829	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
11829	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
11830	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
11831	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
11833	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
11835	Androgen receptor
11835	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
11835	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
11836	Protein kinase domain
11836	Raf-like Ras-binding domain
11836	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
11837	Ribosomal protein L10
11838	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
11846	Arginase family
11847	Arginase family
11855	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
11855	FF domain. This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions
11857	RHO protein GDP dissociation inhibitor
11859	Homeobox domain
11861	ADP-ribosylation factor family
11864	Helix-loop-helix DNA-binding domain
11865	Helix-loop-helix DNA-binding domain
11870	NAD:arginine ADP-ribosyltransferase
11871	NAD:arginine ADP-ribosyltransferase
11871	NAD:arginine ADP-ribosyltransferase
11872	NAD:arginine ADP-ribosyltransferase
11875	NAD:arginine ADP-ribosyltransferase
11876	Transforming growth factor beta like domain
11878	Homeobox domain
11881	Sulfatase
11883	Sulfatase
11883	Sulfatase
11886	Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in
11889	Hepatic lectin, N-terminal domain
11889	Lectin C-type domain. This family includes both long and short form C-type
11889	Hepatic lectin, N-terminal domain
11889	Lectin C-type domain. This family includes both long and short form C-type
11890	Hepatic lectin, N-terminal domain
11890	Lectin C-type domain. This family includes both long and short form C-type
11891	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
11898	Arginosuccinate synthase. This family contains a PP-loop motif
11905	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
11906	Homeobox domain
11908	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
11908	pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB
11909	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
11911	bZIP transcription factor
11920	FAT domain. The FAT domain is named after FRAP, ATM and TRRAP
11923	Helix-loop-helix DNA-binding domain
11924	Helix-loop-helix DNA-binding domain
11925	Helix-loop-helix DNA-binding domain
11927	Heavy-metal-associated domain
11931	Sodium / potassium ATPase beta chain
11932	Sodium / potassium ATPase beta chain
11933	Sodium / potassium ATPase beta chain
11936	ATP1G1/PLM/MAT8 family
11936	ATP1G1/PLM/MAT8 family
11936	ATP1G1/PLM/MAT8 family
11937	Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
11945	Sodium / potassium ATPase beta chain
11947	ATP synthase alpha/beta chain, C terminal domain
11947	ATP synthase alpha/beta family, beta-barrel domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
11947	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
11947	pfam02874, ATP-synt_ab_N, ATP synthase alpha/beta family, beta-barrel domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
11949	ATP synthase
11951	ATP synthase subunit C
11972	ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae
11973	ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit
11975	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
11977	Heavy-metal-associated domain
11979	Heavy-metal-associated domain
11984	ATP synthase subunit C
11991	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
11992	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
11993	CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2
11994	Cadherin domain
11997	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
11998	Neurohypophysial hormones, C-terminal Domain. N-terminal Domain is in hormone5
12000	7 transmembrane receptor (rhodopsin family)
12000	7 transmembrane receptor (rhodopsin family)
12005	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The D
12007	Class I Histocompatibility antigen, domains alpha 1 and 2
12013	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
12014	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
12018	Apoptosis regulator proteins, Bcl-2 family
12020	Homeobox domain
12022	Homeobox domain
12023	Homeobox domain
12028	Apoptosis regulator proteins, Bcl-2 family
12029	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
12032	Extracellular link domain
12032	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
12034	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
12034	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
12034	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
12035	Aminotransferase class IV. The D-amino acid transferases (D-AAT) are required by bacteria to catalyse the synthesis of D-glutamic acid and D-alanine, which are essential constituents of bacterial cell wall and are the building block for other D-amino aci
12036	Aminotransferase class IV. The D-amino acid transferases (D-AAT) are required by bacteria to catalyse the synthesis of D-glutamic acid and D-alanine, which are essential constituents of bacterial cell wall and are the building block for other D-amino acid
12038	Carboxylesterase
12039	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
12040	Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site
12040	Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases
12041	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
12042	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
12043	Apoptosis regulator proteins, Bcl-2 family
12043	Apoptosis regulator proteins, Bcl-2 family
12044	Apoptosis regulator proteins, Bcl-2 family
12045	Apoptosis regulator proteins, Bcl-2 family
12046	Apoptosis regulator proteins, Bcl-2 family
12047	Apoptosis regulator proteins, Bcl-2 family
12048	Apoptosis regulator proteins, Bcl-2 family
12050	Apoptosis regulator proteins, Bcl-2 family
12053	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
12057	7 transmembrane receptor (rhodopsin family)
12061	7 transmembrane receptor (rhodopsin family)
12062	7 transmembrane receptor (rhodopsin family)
12064	Nerve growth factor family
12075	Intermediate filament protein
12091	Glycosyl hydrolases family 35
12096	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox
12097	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox
12116	Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13,
12116	Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13,
12140	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
12143	SH2 domain
12143	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
12145	7 transmembrane receptor (rhodopsin family)
12151	Zinc finger, C3HC4 type (RING finger)
12153	CUB domain
12154	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
12156	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
12159	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
12160	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
12161	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
12162	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
12163	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
12164	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
12164	Transforming growth factor beta like domain
12164	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
12165	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
12165	Transforming growth factor beta like domain
12165	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
12166	Protein kinase domain
12166	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
12167	Protein kinase domain
12167	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
12168	Protein kinase domain
12168	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
12180	MYND finger
12182	ADP-ribosyl cyclase. ADP-ribosyl cyclase EC:3.2.2.5 (also know as cyclic ADP-ribose hydrolase or CD38) synthesizes cyclic-ADP ribose, a second messenger for glucose-induced insulin secretion
12182	ADP-ribosyl cyclase. ADP-ribosyl cyclase EC:3.2.2.5 (also know as cyclic ADP-ribose hydrolase or CD38) synthesizes cyclic-ADP ribose, a second messenger for glucose-induced insulin secretion
12189	Zinc finger, C3HC4 type (RING finger)
12189	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
12190	BRCA2 repeat. The alignment covers only the most conserved region of the repeat
12192	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
12209	7 transmembrane receptor (rhodopsin family)
12211	Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat
12211	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
12215	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
12224	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
12226	BTG1 family. A novel family of anti-proliferative proteins
12227	BTG1 family. A novel family of anti-proliferative proteins
12228	BTG1 family. A novel family of anti-proliferative proteins
12229	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
12235	Protein kinase domain
12235	Protein kinase domain
12236	Protein kinase domain
12236	Protein kinase domain
12257	TspO/MBR family. Tryptophan-rich sensory protein (TspO) is an integral membrane protein that acts as a negative regulator of the expression of specific photosynthesis genes in response to oxygen/light. It is involved in the efflux of porphyrin intermedia
12261	Mitochondrial glycoprotein. This mitochondrial matrix protein family contains members of the MAM33 family which bind to the globular 'heads' of C1Q. It is thought to be involved in mitochondrial oxidative phosphorylation and in nucleus-mitochondrion inte
12262	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
12263	Sushi domain (SCR repeat)
12263	von Willebrand factor type A domain
12263	Trypsin
12263	Sushi domain (SCR repeat)
12263	von Willebrand factor type A domain
12263	Trypsin
12263	Sushi domain (SCR repeat)
12263	von Willebrand factor type A domain
12266	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
12266	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
12266	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f
12266	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
12267	7 transmembrane receptor (rhodopsin family)
12268	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
12268	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
12268	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f
12268	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
12269	Sushi domain (SCR repeat)
12273	7 transmembrane receptor (rhodopsin family)
12274	Sushi domain (SCR repeat)
12274	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb
12279	Low-density lipoprotein receptor domain class A
12279	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb
12282	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
12283	Mo25 protein family
12286	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
12286	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
12287	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
12288	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
12289	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
12290	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
12290	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
12291	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
12292	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
12293	Cache domain
12294	Cache domain
12295	Dihydropyridine sensitive L-type calcium channel (Beta subunit)
12295	Dihydropyridine sensitive L-type calcium channel (Beta subunit)
12295	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
12296	Dihydropyridine sensitive L-type calcium channel (Beta subunit)
12296	Dihydropyridine sensitive L-type calcium channel (Beta subunit)
12296	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
12297	Dihydropyridine sensitive L-type calcium channel (Beta subunit)
12298	Dihydropyridine sensitive L-type calcium channel (Beta subunit)
12300	PMP-22/EMP/MP20/Claudin family
12304	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
12305	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
12305	Protein kinase domain
12305	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
12306	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
12306	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
12309	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
12309	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typ
12310	Calcitonin / CGRP / IAPP family
12311	7 transmembrane receptor (Secretin family)
12311	7 transmembrane receptor (Secretin family)
12311	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
12317	Calreticulin family
12319	Eukaryotic-type carbonic anhydrase
12325	Protein kinase domain
12326	Protein kinase domain
12330	Calreticulin family
12331	CAP protein
12333	Calpain family cysteine protease
12333	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
12334	Calpain family cysteine protease
12334	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
12334	Calpain family cysteine protease
12334	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by
12335	Calpain family cysteine protease
12335	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
12337	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
12337	Calpain family cysteine protease
12337	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by
12338	C2 domain
12338	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
12339	MIT domain
12340	F-actin capping protein alpha subunit
12340	F-actin capping protein alpha subunit
12343	F-actin capping protein alpha subunit
12344	F-actin capping protein alpha subunit
12345	F-actin capping protein, beta subunit
12346	Eukaryotic-type carbonic anhydrase
12348	Eukaryotic-type carbonic anhydrase
12349	Eukaryotic-type carbonic anhydrase
12350	Eukaryotic-type carbonic anhydrase
12351	Eukaryotic-type carbonic anhydrase
12352	Eukaryotic-type carbonic anhydrase
12353	Eukaryotic-type carbonic anhydrase
12354	Eukaryotic-type carbonic anhydrase
12354	Eukaryotic-type carbonic anhydrase
12355	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
12355	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
12359	Catalase
12361	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
12361	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
12361	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
12361	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
12362	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
12363	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
12366	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
12370	Death effector domain
12370	ICE-like protease (caspase) p10 domain
12370	ICE-like protease (caspase) p20 domain
12371	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
12372	Calsequestrin
12373	Calsequestrin
12374	7 transmembrane receptor (metabotropic glutamate family)
12374	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
12378	7 transmembrane receptor (metabotropic glutamate family)
12378	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
12379	7 transmembrane receptor (metabotropic glutamate family)
12379	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
12380	Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins
12380	Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins
12385	Vinculin family
12386	Vinculin family
12386	Vinculin family
12387	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
12388	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
12389	Caveolin
12390	Caveolin
12391	Caveolin
12393	Runt domain
12394	Runt domain
12395	MYND finger
12398	MYND finger
12399	Runt domain
12400	Core binding factor beta subunit. Core binding factor (CBF) is a heterodimeric transcription factor essential for genetic regulation of hematopoiesis and osteogenesis. The beta subunit enhances DNA-binding ability of the alpha subunit in vitro, and has b
12401	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
12402	CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and
12402	CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily co
12402	CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conser
12406	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
12406	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
12408	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
12409	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
12412	'chromo' (CHRromatin Organization MOdifier) domain
12412	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
12416	'chromo' (CHRromatin Organization MOdifier) domain
12417	'chromo' (CHRromatin Organization MOdifier) domain
12417	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
12418	'chromo' (CHRromatin Organization MOdifier) domain
12419	'chromo' (CHRromatin Organization MOdifier) domain
12419	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
12424	Gastrin/cholecystokinin family
12425	7 transmembrane receptor (rhodopsin family)
12426	7 transmembrane receptor (rhodopsin family)
12428	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
12443	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
12444	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
12445	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
12445	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
12445	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
12447	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termi
12447	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
12448	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
12449	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
12450	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
12452	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
12453	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
12454	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
12457	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN
12457	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
12458	7 transmembrane receptor (rhodopsin family)
12460	Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in
12462	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
12464	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
12465	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
12466	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
12466	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
12467	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
12469	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
12476	Tetraspanin family
12482	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
12483	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
12491	CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion
12492	CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion
12493	Tetraspanin family
12494	ADP-ribosyl cyclase. ADP-ribosyl cyclase EC:3.2.2.5 (also know as cyclic ADP-ribose hydrolase or CD38) synthesizes cyclic-ADP ribose, a second messenger for glucose-induced insulin secretion
12495	GDA1/CD39 (nucleoside phosphatase) family
12496	GDA1/CD39 (nucleoside phosphatase) family
12497	GDA1/CD39 (nucleoside phosphatase) family
12499	GDA1/CD39 (nucleoside phosphatase) family
12505	Extracellular link domain
12505	Extracellular link domain
12507	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
12508	Tetraspanin family
12509	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
12511	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
12512	Tetraspanin family
12514	Lysosome-associated membrane glycoprotein (Lamp)
12515	Lectin C-type domain. This family includes both long and short form C-type
12520	Tetraspanin family
12521	Tetraspanin family
12527	Tetraspanin family
12539	Cdc37 family. In the budding yeast Saccharomyces cerevisiae, Cdc37 is required for the productive formation of Cdc28-cyclin complexes. Cdc37 may be a kinase targeting subunit of Hsp90
12545	Protein kinase domain
12550	Cadherin domain
12550	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
12551	Cadherin domain
12551	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
12552	Cadherin domain
12552	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
12554	Cadherin domain
12555	Cadherin domain
12555	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
12556	Cadherin domain
12557	Cadherin domain
12558	Cadherin domain
12558	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
12560	Cadherin domain
12560	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
12561	Cadherin domain
12561	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
12562	Cadherin domain
12562	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
12563	Cadherin domain
12563	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
12564	Cadherin domain
12564	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
12565	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
12569	Cyclin-dependent kinase 5 activator protein
12570	Cyclin-dependent kinase 5 activator protein
12572	Protein kinase domain
12575	Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase
12576	Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase
12577	Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase
12577	Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase
12577	Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase
12590	Homeobox domain
12591	Homeobox domain
12592	Homeobox domain
12607	CBF/Mak21 family
12613	Carboxylesterase
12614	Cadherin domain
12614	7 transmembrane receptor (Secretin family)
12614	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
12614	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
12616	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
12622	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb
12623	Carboxylesterase
12628	Sushi domain (SCR repeat)
12630	Low-density lipoprotein receptor domain class A
12631	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
12633	Death effector domain
12640	Glycoprotein hormone
12648	'chromo' (CHRromatin Organization MOdifier) domain
12648	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
12651	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
12652	Granin (chromogranin or secretogranin)
12653	Granin (chromogranin or secretogranin)
12654	Glycosyl hydrolases family 18
12660	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
12661	Fibronectin type III domain
12662	GDP dissociation inhibitor
12663	GDP dissociation inhibitor
12667	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
12669	7 transmembrane receptor (rhodopsin family)
12671	7 transmembrane receptor (rhodopsin family)
12672	7 transmembrane receptor (rhodopsin family)
12677	Homeobox domain
12683	CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain
12684	CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain
12686	GNS1/SUR4 family
12704	Tropomyosin
12704	Protein kinase domain
12704	Uncharacterized ACR, COG1579
12704	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
12704	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
12704	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
12704	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
12709	ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold
12709	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
12715	ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold
12715	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
12716	ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold
12716	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
12723	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
12723	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
12724	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
12725	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
12727	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
12728	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
12729	Nucleotide-sensitive chloride conductance regulator (ICln)
12733	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
12737	PMP-22/EMP/MP20/Claudin family
12738	PMP-22/EMP/MP20/Claudin family
12739	PMP-22/EMP/MP20/Claudin family
12740	PMP-22/EMP/MP20/Claudin family
12741	PMP-22/EMP/MP20/Claudin family
12741	PMP-22/EMP/MP20/Claudin family
12745	Calreticulin family
12745	Calreticulin family
12752	CLN3 protein. This is a family of proteins from the CLN3 gene. A missense mutation of glutamic acid (E) to lysine (K) at position 295 in the human protein has been implicated in Juvenile neuronal ceroid lipofuscinosis (Batten disease)
12753	Helix-loop-helix DNA-binding domain
12753	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
12753	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
12757	Clathrin light chain
12759	Clusterin
12764	Cytidylyltransferase. This family consists of two main Cytidylyltransferase activities: 1) 3-deoxy-manno-octulosonate cytidylyltransferase,, EC:2.7.7.38 catalysing the reaction:- CTP + 3-deoxy-D-manno-octulosonate <=> diphosphate + CMP-3-deoxy-D-manno-oc
12765	7 transmembrane receptor (rhodopsin family)
12766	7 transmembrane receptor (rhodopsin family)
12767	7 transmembrane receptor (rhodopsin family)
12768	7 transmembrane receptor (rhodopsin family)
12769	7 transmembrane receptor (rhodopsin family)
12770	7 transmembrane receptor (rhodopsin family)
12771	7 transmembrane receptor (rhodopsin family)
12772	7 transmembrane receptor (rhodopsin family)
12773	7 transmembrane receptor (rhodopsin family)
12774	7 transmembrane receptor (rhodopsin family)
12775	7 transmembrane receptor (rhodopsin family)
12776	7 transmembrane receptor (rhodopsin family)
12777	7 transmembrane receptor (rhodopsin family)
12778	7 transmembrane receptor (rhodopsin family)
12788	Cyclic nucleotide-binding domain
12788	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
12789	Cyclic nucleotide-binding domain
12789	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
12790	Cyclic nucleotide-binding domain
12790	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
12793	Cornichon protein
12794	Cornichon protein
12795	POLO box duplicated region
12796	Cathelicidin. A novel protein family, showing a conserved proregion and a variable carboxyl-terminal antimicrobial domain. This region shows similarity to cystatins
12797	Calponin family repeat
12798	Calponin family repeat
12801	7 transmembrane receptor (rhodopsin family)
12802	7 transmembrane receptor (rhodopsin family)
12803	Ciliary neurotrophic factor
12803	Ciliary neurotrophic factor
12805	Fibronectin type III domain
12810	LCCL domain
12810	von Willebrand factor type A domain
12813	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
12814	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
12814	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
12815	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
12816	Fibronectin type III domain
12816	von Willebrand factor type A domain
12816	Thrombospondin N-terminal -like domain
12817	Collagen triple helix repeat (20 copies)
12818	Fibronectin type III domain
12818	von Willebrand factor type A domain
12818	von Willebrand factor type A domain
12818	Thrombospondin N-terminal -like domain
12818	Thrombospondin N-terminal -like domain
12819	Thrombospondin N-terminal -like domain
12819	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
12821	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
12822	Thrombospondin N-terminal -like domain
12822	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
12823	Thrombospondin N-terminal -like domain
12823	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
12824	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
12825	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
12826	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
12827	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
12827	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
12828	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
12829	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
12830	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
12830	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
12832	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
12832	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
12832	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
12832	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
12833	von Willebrand factor type A domain
12833	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
12834	von Willebrand factor type A domain
12834	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
12834	von Willebrand factor type A domain
12835	von Willebrand factor type A domain
12835	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
12835	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
12836	Fibronectin type III domain
12836	von Willebrand factor type A domain
12836	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
12837	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
12839	Thrombospondin N-terminal -like domain
12839	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
12847	Coatomer WD associated domain
12847	WD domain, G-beta repeat
12847	Coatomer WD associated domain
12854	Somatostatin/Cortistatin family. Members of this family are hormones. Somatostatin inhibits the release of somatotropin. Cortistatin is a peptide that is related to the Somatostatins that is found to depresses neuronal electrical activity but, unlike som
12857	pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium
12858	Cytochrome c oxidase subunit Va. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit Va
12859	Cytochrome c oxidase subunit Vb
12861	Cytochrome c oxidase subunit VIa
12862	Cytochrome c oxidase subunit VIa
12864	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
12865	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
12866	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
12867	pfam02935, COX7C, Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast me
12867	Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast member of this family
12867	pfam02935, COX7C, Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast mem
12867	Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast member of this family
12867	pfam02935, COX7C, Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast mem
12868	Cytochrome oxidase c subunit VIII. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIII
12869	Cytochrome oxidase c subunit VIII. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIII
12870	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
12873	Zinc carboxypeptidase
12873	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
12876	Zinc carboxypeptidase
12891	C2 domain
12892	Coproporphyrinogen III oxidase
12894	Choline/Carnitine o-acyltransferase
12895	Choline/Carnitine o-acyltransferase
12896	Choline/Carnitine o-acyltransferase
12902	Sushi domain (SCR repeat)
12903	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
12904	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
12908	Choline/Carnitine o-acyltransferase
12912	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
12912	pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB
12912	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
12912	pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB
12916	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
12916	pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB
12921	7 transmembrane receptor (Secretin family)
12922	7 transmembrane receptor (Secretin family)
12925	LIM domain. This family represents two copies of the LIM structural domain
12928	SH2 domain
12928	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
12929	SH2 domain
12929	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
12933	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
12933	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
12934	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
12944	Pentaxin family. Pentaxins are also known as pentraxins
12945	CUB domain
12945	Zona pellucida-like domain
12945	CUB domain
12945	Zona pellucida-like domain
12945	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
12946	Sushi domain (SCR repeat)
12950	Extracellular link domain
12951	Homeobox domain
12952	FAD binding domain of DNA photolyase
12953	FAD binding domain of DNA photolyase
12954	Hsp20/alpha crystallin family
12954	Alpha crystallin A chain, N terminal
12954	Hsp20/alpha crystallin family
12954	Alpha crystallin A chain, N terminal
12955	Hsp20/alpha crystallin family
12955	Alpha crystallin A chain, N terminal
12957	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
12958	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
12959	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
12960	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
12961	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
12962	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
12964	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
12966	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
12967	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
12968	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
12969	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
12970	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
12971	Ornithine cyclodeaminase/mu-crystallin family. This family contains the bacterial Ornithine cyclodeaminase enzyme EC:4.3.1.12, which catalyses the deamination of ornithine to proline. This family also contains mu-Crystallin the major component of the eye
12972	Zinc-binding dehydrogenase
12974	Citrate synthase
12981	Granulocyte-macrophage colony-stimulating factor
12985	Interleukin-6/G-CSF/MGF family
12986	Fibronectin type III domain
12990	Casein
12991	Casein
12992	Casein
12994	Kappa casein. Kappa-casein is a mammalian milk protein involved in a number of important physiological processes. In the gut, the ingested protein is split into an insoluble peptide (para kappa-casein) and a soluble hydrophilic glycopeptide (caseinomacro
12995	Protein kinase domain
12995	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
12999	Protein kinase domain
13000	Protein kinase domain
13001	Casein kinase II regulatory subunit
13003	Extracellular link domain
13003	Sushi domain (SCR repeat)
13003	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
13004	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
13007	LIM domain. This family represents two copies of the LIM structural domain
13008	LIM domain. This family represents two copies of the LIM structural domain
13009	LIM domain. This family represents two copies of the LIM structural domain
13012	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
13012	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family
13014	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
13030	Papain family cysteine protease
13032	Papain family cysteine protease
13033	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
13034	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
13035	Trypsin
13036	Papain family cysteine protease
13041	Papain family cysteine protease
13043	Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018
13047	Homeobox domain
13047	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
13048	Homeobox domain
13048	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
13051	7 transmembrane receptor (rhodopsin family)
13056	Cytochrome b561
13058	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
13070	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13072	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13074	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13075	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13076	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13077	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13078	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13079	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13081	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13082	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13085	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13086	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13087	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13088	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13089	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13090	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13094	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13095	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13096	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13096	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13096	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt
13097	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13098	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13099	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13099	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13101	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13105	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13106	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13107	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13108	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13109	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13110	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13112	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13113	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13114	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13115	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13116	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13117	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13118	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13119	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13120	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13121	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13122	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13123	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13124	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
13132	Phosphotyrosine interaction domain (PTB/PID)
13134	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
13135	DAD family. Members of this family are thought to be integral membrane proteins. Some members of this family have been shown to cause apoptosis if mutated, these proteins are known as DAD for defender against death. The family also includes the epsilon s
13136	Sushi domain (SCR repeat)
13137	Sushi domain (SCR repeat)
13139	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
13139	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
13142	FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1
13143	Protein kinase domain
13162	Sodium:neurotransmitter symporter family
13163	Daxx Family. The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis, but precise role played by Daxx remians to be determined
13166	Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
13166	Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
13167	Acyl CoA binding protein
13168	Acyl CoA binding protein
13172	Homeobox domain
13176	Fibronectin type III domain
13177	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
13178	Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes
13180	Pterin 4 alpha carbinolamine dehydratase. Pterin 4 alpha carbinolamine dehydratase is also known as DCoH (dimerisation cofactor of hepatocyte nuclear factor 1-alpha)
13184	Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein
13185	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp
13190	Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372
13191	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
13194	CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in p
13195	Pyridoxal-dependent decarboxylase conserved domain
13196	PH domain. PH stands for pleckstrin homology
13196	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
13196	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
13196	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
13196	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
13197	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
13200	Dolichyl-diphosphooligosaccharide-protein glycosyltransferase 48kD subunit. Members of this family are involved in asparagine-linked protein glycosylation. In particular, dolichyl-diphosphooligosaccharide-protein glycosyltransferase (DDOST), also known a
13202	Macrophage migration inhibitory factor (MIF)
13206	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
13206	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
13209	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
13209	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
13215	Beta defensin
13216	Mammalian defensin
13216	Defensin propeptide
13218	Defensin propeptide
13221	Defensin propeptide
13222	Defensin propeptide
13237	Mammalian defensin
13237	Defensin propeptide
13238	Defensin propeptide
13238	Defensin propeptide
13239	Mammalian defensin
13239	Defensin propeptide
13240	Mammalian defensin
13240	Defensin propeptide
13244	Fatty acid desaturase
13340	Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their
13345	Helix-loop-helix DNA-binding domain
13346	Intermediate filament protein
13346	Intermediate filament protein
13347	CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain
13358	Mitochondrial carrier protein
13358	Mitochondrial carrier protein
13358	Mitochondrial carrier protein
13360	Ergosterol biosynthesis ERG4/ERG24 family
13361	Dihydrofolate reductase
13363	Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module
13363	Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation
13368	CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain
13370	Iodothyronine deiodinase
13371	Iodothyronine deiodinase
13384	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
13388	Delta serrate ligand
13390	Homeobox domain
13392	Homeobox domain
13393	Homeobox domain
13394	Homeobox domain
13401	WD domain, G-beta repeat
13405	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
13405	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
13411	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
13417	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
13421	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN
13423	Deoxyribonuclease II
13424	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
13424	Actin interacting protein 3
13424	Uncharacterized ACR, COG1579
13424	Type IV secretion system CagX conjugation protein
13424	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
13424	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
13424	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
13429	Dynamin family
13429	Dynamin GTPase effector domain
13429	PH domain. PH stands for pleckstrin homology
13429	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
13430	Dynamin GTPase effector domain
13430	PH domain. PH stands for pleckstrin homology
13430	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
13433	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
13433	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
13434	C-5 cytosine-specific DNA methylase
13435	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
13435	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
13436	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
13437	Aminopeptidase I zinc metalloprotease (M18)
13446	C2 domain
13447	C2 domain
13449	PTB domain (IRS-1 type)
13476	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
13476	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
13478	Glycosyl transferase
13479	Renal dipeptidase
13480	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
13482	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
13483	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
13487	Sulfate transporter family. Mutations may lead to several human diseases
13487	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
13488	7 transmembrane receptor (rhodopsin family)
13489	7 transmembrane receptor (rhodopsin family)
13490	7 transmembrane receptor (rhodopsin family)
13491	7 transmembrane receptor (rhodopsin family)
13492	7 transmembrane receptor (rhodopsin family)
13492	7 transmembrane receptor (rhodopsin family)
13496	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
13497	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
13497	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
13497	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
13498	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
13505	Cadherin domain
13506	Cadherin domain
13507	Cadherin domain
13507	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
13508	Fibronectin type III domain
13511	Cadherin domain
13512	Cadherin domain
13512	Cadherin domain
13512	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
13516	Leucine rich repeat N-terminal domain
13518	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
13518	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
13521	Sulfate transporter family. Mutations may lead to several human diseases
13522	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
13522	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
13524	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
13524	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
13525	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
13525	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
13526	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
13526	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
13529	Somatotropin hormone family
13537	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
13542	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
13542	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
13542	Dishevelled specific domain. This domain is specific to the signaling protein dishevelled. The domain is found adjacent to the PDZ domain pfam00595, often in conjunction with DEP (pfam00610) and DIX (pfam00778)
13542	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The
13543	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
13543	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
13543	Dishevelled specific domain. This domain is specific to the signaling protein dishevelled. The domain is found adjacent to the PDZ domain pfam00595, often in conjunction with DEP (pfam00610) and DIX (pfam00778)
13543	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The
13544	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
13544	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
13544	Dishevelled specific domain. This domain is specific to the signaling protein dishevelled. The domain is found adjacent to the PDZ domain pfam00595, often in conjunction with DEP (pfam00610) and DIX (pfam00778)
13544	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The
13548	Protein kinase domain
13548	Protein kinase domain
13555	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
13557	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
13559	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
13586	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
13590	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
13591	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
13592	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
13593	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
13602	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand
13603	7 transmembrane receptor (rhodopsin family)
13605	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
13608	Death domain
13609	7 transmembrane receptor (rhodopsin family)
13610	7 transmembrane receptor (rhodopsin family)
13611	7 transmembrane receptor (rhodopsin family)
13612	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
13612	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
13612	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
13614	Endothelin family
13615	Endothelin family
13616	Endothelin family
13617	7 transmembrane receptor (rhodopsin family)
13618	7 transmembrane receptor (rhodopsin family)
13619	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
13627	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
13629	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
13631	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a
13636	Ephrin
13637	Ephrin
13638	Ephrin
13639	Ephrin
13640	Ephrin
13641	Ephrin
13642	Ephrin
13643	Ephrin
13649	Furin-like cysteine rich region
13649	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
13650	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f
13650	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. These proteins contain three strongly conserved histidines in the putative trans
13650	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa
13650	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. These proteins contain three strongly conserved histidines in the putative trans
13650	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa
13664	Eukaryotic initiation factor 1A
13667	Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac
13669	V-type ATPase 116kDa subunit family
13684	Eukaryotic initiation factor 4E
13690	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA
13706	Trypsin
13712	Ets-domain
13712	Ets-domain
13713	Ets-domain
13714	Ets-domain
13716	Occludin/ELL family
13722	Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte act
13726	LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1
13726	Translation initiation factor SUI1
13726	LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1
13728	Kinase associated domain 1
13730	PMP-22/EMP/MP20/Claudin family
13731	PMP-22/EMP/MP20/Claudin family
13732	PMP-22/EMP/MP20/Claudin family
13733	7 transmembrane receptor (Secretin family)
13733	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
13796	Homeobox domain
13797	Homeobox domain
13798	Homeobox domain
13799	Homeobox domain
13800	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
13803	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
13809	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
13813	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
13821	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
13822	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
13822	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
13823	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
13824	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
13828	Transglutaminase family
13828	Transglutaminase family, C-terminal ig like domain
13828	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
13830	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
13835	Fibronectin type III domain
13835	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13836	Fibronectin type III domain
13836	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13837	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13837	Protein kinase domain
13837	Fibronectin type III domain
13837	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13837	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
13838	Fibronectin type III domain
13838	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13839	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13839	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
13839	Protein kinase domain
13839	Fibronectin type III domain
13839	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13839	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
13840	Fibronectin type III domain
13840	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13841	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13841	Protein kinase domain
13841	Fibronectin type III domain
13841	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13841	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
13842	Fibronectin type III domain
13842	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13845	Protein kinase domain
13845	Fibronectin type III domain
13845	Giardia variant-specific surface protein
13845	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13845	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
13846	Fibronectin type III domain
13846	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13846	Protein kinase domain
13846	Fibronectin type III domain
13846	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13846	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
13848	Fibronectin type III domain
13848	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
13849	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
13850	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
13852	Syntaxin
13853	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
13855	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
13856	Erythropoietin/thrombopoietin
13856	Erythropoietin/thrombopoietin
13856	Erythropoietin/thrombopoietin
13861	Animal haem peroxidase
13863	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
13864	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
13864	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
13865	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
13865	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
13866	Furin-like cysteine rich region
13866	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
13866	Protein kinase domain
13866	Furin-like cysteine rich region
13866	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
13867	Protein kinase domain
13867	Furin-like cysteine rich region
13869	Furin-like cysteine rich region
13869	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
13875	Ets-domain
13876	Ets-domain
13876	Sterile alpha motif (SAM)/Pointed domain
13876	Ets-domain
13876	Sterile alpha motif (SAM)/Pointed domain
13877	Enhancer of rudimentary. Enhancer of rudimentary is a protein of unknown function that is highly conserved in plants and animals. This protein is found to be an enhancer of the rudimentary gene
13884	Carboxylesterase
13897	Carboxylesterase
13909	Carboxylesterase
13982	Oestrogen receptor
13982	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
13982	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
13983	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
13983	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
13984	Homeobox domain
13990	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
13990	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
13990	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u
13998	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
14008	Ets-domain
14009	Ets-domain
14011	Ets-domain
14011	Sterile alpha motif (SAM)/Pointed domain
14013	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
14025	Zinc finger, C2H2 type
14026	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
14027	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
14028	Homeobox domain
14029	Homeobox domain
14042	Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo
14043	Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo
14048	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
14049	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
14050	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
14051	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of t
14055	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
14056	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
14057	Tricarboxylate carrier
14060	Sushi domain (SCR repeat)
14061	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox
14062	7 transmembrane receptor (rhodopsin family)
14063	7 transmembrane receptor (rhodopsin family)
14064	7 transmembrane receptor (rhodopsin family)
14065	7 transmembrane receptor (rhodopsin family)
14066	Tissue factor
14067	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
14067	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
14069	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
14069	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
14071	Trypsin
14071	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
14071	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
14071	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy
14073	Amidase
14077	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
14079	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
14080	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
14082	Death effector domain
14083	Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta
14085	Fumarylacetoacetate (FAA) hydrolase family. This family consists of fumarylacetoacetate (FAA) hydrolase, or fumarylacetoacetate hydrolase (FAH) and it also includes HHDD isomerase/OPET decarboxylase from E. coli strain W. FAA is the last enzyme in the ty
14087	Fanconi anaemia group A protein
14088	Fanconi anaemia group C protein
14089	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
14102	TNFR/NGFR cysteine-rich region
14103	TNF(Tumor Necrosis Factor) family
14104	Zinc-binding dehydrogenase
14104	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
14104	Thioesterase domain. Peptide synthetases are involved in the non-ribosomal synthesis of peptide antibiotics. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid th
14104	Acyl transferase domain
14104	Zinc-binding dehydrogenase
14104	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
14104	Beta-ketoacyl synthase, C-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains
14104	Beta-ketoacyl synthase, N-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains.
14104	Phosphopantetheine attachment site. A 4'-phosphopantetheine prosthetic group is attached through a serine. This prosthetic group acts as a a 'swinging arm' for the attachment of activated fatty acid and amino-acid groups. This domain forms a four helix bu
14104	Thioesterase domain. Peptide synthetases are involved in the non-ribosomal synthesis of peptide antibiotics. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid thi
14107	Cadherin domain
14107	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
14113	Fibrillarin
14118	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
14118	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
14119	Cadherin domain
14119	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
14120	Fructose-1-6-bisphosphatase
14121	Fructose-1-6-bisphosphatase
14126	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
14132	Class I Histocompatibility antigen, domains alpha 1 and 2
14133	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
14134	Fibrinogen beta and gamma chains, C-terminal globular domain
14134	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
14137	Squalene/phytoene synthase
14137	Squalene/phytoene synthase
14151	Ferrochelatase
14154	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
14154	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
14156	XPG I-region
14156	XPG N-terminal domain
14156	5'-3' exonuclease, C-terminal SAM fold
14156	XPG I-region
14156	XPG N-terminal domain
14156	5'-3' exonuclease, C-terminal SAM fold
14158	SH2 domain
14159	SH2 domain
14159	Protein kinase domain
14160	7 transmembrane receptor (rhodopsin family)
14163	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
14163	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
14164	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
14167	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling.
14171	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
14172	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
14173	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
14174	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
14175	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
14176	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
14177	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling.
14178	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
14179	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
14180	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
14186	Protein kinase domain
14186	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
14187	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
14190	Fibrinogen beta and gamma chains, C-terminal globular domain
14190	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
14191	SH2 domain
14191	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
14194	Lyase
14198	HIT family
14200	LIM domain. This family represents two copies of the LIM structural domain
14201	LIM domain. This family represents two copies of the LIM structural domain
14204	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
14208	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
14219	Thrombospondin type 1 domain
14227	FKBP-type peptidyl-prolyl cis-trans isomerase
14228	FKBP-type peptidyl-prolyl cis-trans isomerase
14229	FKBP-type peptidyl-prolyl cis-trans isomerase
14230	FKBP-type peptidyl-prolyl cis-trans isomerase
14231	FKBP-type peptidyl-prolyl cis-trans isomerase
14232	FKBP-type peptidyl-prolyl cis-trans isomerase
14234	Fork head domain
14236	Fork head domain
14236	Fork head domain
14237	Fork head domain
14239	Fork head domain
14240	Fork head domain
14247	Ets-domain
14251	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
14252	Flotillin family. Flotillins are integral membrane proteins that have been shown to be present in several subcellular components, including caveolae (invaginated plasma membrane microdomains), lipid rafts (sphingolipid and cholesterol-rich, detergent-res
14255	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv
14255	Protein kinase domain
14255	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
14256	pfam02947, flt3_lig, flt3 ligand. The flt3 ligand is a short chain cytokine with a 4 helical bundle fold
14261	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
14262	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
14263	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
14265	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
14265	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
14265	KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
14268	Fibronectin type I domain
14268	Fibronectin type II domain
14268	Fibronectin type III domain
14269	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
14270	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
14270	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
14270	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
14275	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
14276	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
14287	Mur ligase family, catalytic domain. This family contains a number of related ligase enzymes which have EC numbers 6.3.2.*. This family includes: MurC, MurD, MurE, MurF, Mpl and FolC. MurC, MurD, Mure and MurF catalyse consecutive steps in the synthesis o
14288	7 transmembrane receptor (rhodopsin family)
14289	7 transmembrane receptor (rhodopsin family)
14290	7 transmembrane receptor (rhodopsin family)
14291	7 transmembrane receptor (rhodopsin family)
14293	7 transmembrane receptor (rhodopsin family)
14294	7 transmembrane receptor (rhodopsin family)
14297	Frataxin-like domain. This family contains proteins that have a domain related to the globular C-terminus of Frataxin the protein that is mutated in Friedreich's ataxia. This domain is found in a family of bacterial proteins. The function of this domain
14309	7 transmembrane receptor (rhodopsin family)
14311	CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain
14311	CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain
14319	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
14325	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
14337	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
14343	Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes
14344	Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes
14345	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
14347	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
14348	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
14352	ENV polyprotein (coat polyprotein)
14356	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
14357	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
14359	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
14360	SH2 domain
14360	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
14362	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
14365	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
14366	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
14367	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
14367	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
14367	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are
14368	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
14369	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
14370	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
14371	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
14371	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are
14376	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
14387	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
14390	Ets-domain
14394	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14394	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14395	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14395	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14396	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14396	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14397	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14397	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14397	Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14397	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14397	Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14397	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14399	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14399	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14400	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14400	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14401	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14401	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14402	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14402	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14403	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14403	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14404	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14404	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14405	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14405	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14406	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14406	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14406	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14406	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14407	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14407	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14408	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14408	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14409	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14409	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14411	Sodium:neurotransmitter symporter family
14412	Sodium:neurotransmitter symporter family
14415	Pyridoxal-dependent decarboxylase conserved domain
14417	Pyridoxal-dependent decarboxylase conserved domain
14420	Glycosyl hydrolase family 59
14421	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
14422	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
14423	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
14425	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
14426	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
14427	7 transmembrane receptor (rhodopsin family)
14428	7 transmembrane receptor (rhodopsin family)
14429	7 transmembrane receptor (rhodopsin family)
14432	IQ calmodulin-binding motif. Calmodulin-binding motif
14433	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
14433	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
14457	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
14457	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
14457	PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins
14459	Gastrin/cholecystokinin family
14461	GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the d
14465	GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the do
14466	O-Glycosyl hydrolase family 30
14467	ssDNA binding protein
14468	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
14468	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
14469	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
14469	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
14470	Prenylated rab acceptor (PRA1)
14470	Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length
14472	Homeobox domain
14473	Serum albumin family
14526	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
14526	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
14527	7 transmembrane receptor (Secretin family)
14531	GCM motif protein
14536	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
14536	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
14536	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
14536	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
14537	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
14537	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
14538	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
14538	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
14539	7 transmembrane receptor (rhodopsin family)
14555	NAD-dependent glycerol-3-phosphate dehydrogenase
14562	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
14563	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
14567	GDP dissociation inhibitor
14568	GDP dissociation inhibitor
14569	GDP dissociation inhibitor
14570	RHO protein GDP dissociation inhibitor
14573	Transforming growth factor beta like domain
14580	Intermediate filament protein
14583	Glutamine amidotransferases class-II
14583	SIS domain. SIS (Sugar ISomerase) domains are found in many phosphosugar isomerases and phosphosugar binding proteins. SIS domains are also found in proteins that regulate the expression of genes involved in synthesis of phosphosugars. Presumably the SIS
14585	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro
14585	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuron
14586	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro
14587	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro
14588	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro
14593	Polyprenyl synthetase
14594	Glycosyltransferase family 6
14595	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
14598	Gamma-glutamyltranspeptidase
14599	Somatotropin hormone family
14600	Fibronectin type III domain
14601	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
14602	7 transmembrane receptor (Secretin family)
14602	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
14603	Eukaryotic cobalamin-binding protein
14605	TSC-22/dip/bun family
14607	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
14608	7 transmembrane receptor (rhodopsin family)
14609	Connexin
14609	Gap junction alpha-1 protein (Cx43)
14610	Connexin
14611	Connexin
14612	Connexin
14613	Connexin
14615	Connexin
14616	Connexin
14616	Gap junction alpha-8 protein (Cx50)
14617	Connexin
14618	Connexin
14619	Connexin
14620	Connexin
14621	Connexin
14622	Connexin
14623	Connexin
14625	FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain
14625	FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain
14629	Glutamate-cysteine ligase. This family represents the catalytic subunit of glutamate-cysteine ligase (E.C. 6.3.2.2), also known as gamma-glutamylcysteine synthetase (GCS). This enzyme catalyses the rate limiting step in the biosynthesis of glutathione. T
14629	Glutamate-cysteine ligase. This family represents the catalytic subunit of glutamate-cysteine ligase (E.C. 6.3.2.2), also known as gamma-glutamylcysteine synthetase (GCS). This enzyme catalyses the rate limiting step in the biosynthesis of glutathione. Th
14645	Glutamine synthetase, catalytic domain
14651	Metallo-beta-lactamase superfamily
14652	7 transmembrane receptor (Secretin family)
14654	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14654	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14657	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14657	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14658	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
14658	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
14664	Sodium:neurotransmitter symporter family
14670	GTPase of unknown function
14672	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14673	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14674	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14675	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14676	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14678	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14679	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14679	ADP-ribosylation factor family
14679	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14680	ADP-ribosylation factor family
14680	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14681	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14682	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14683	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14683	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14685	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14686	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14687	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
14693	WD domain, G-beta repeat
14693	WD domain, G-beta repeat
14699	GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins
14715	7 transmembrane receptor (rhodopsin family)
14724	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
14725	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
14729	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
14731	Gaa1-like, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to newly
14733	Glypican
14734	Glypican
14735	Glypican
14738	7 transmembrane receptor (rhodopsin family)
14739	7 transmembrane receptor (rhodopsin family)
14744	7 transmembrane receptor (rhodopsin family)
14745	7 transmembrane receptor (rhodopsin family)
14745	7 transmembrane receptor (rhodopsin family)
14747	7 transmembrane receptor (rhodopsin family)
14747	7 transmembrane receptor (rhodopsin family)
14748	7 transmembrane receptor (rhodopsin family)
14751	Phosphoglucose isomerase. Phosphoglucose isomerase catalyses the interconversion of glucose-6-phosphate and fructose-6-phosphate
14758	Myelin proteolipid protein (PLP or lipophilin)
14760	7 transmembrane receptor (rhodopsin family)
14761	7 transmembrane receptor (rhodopsin family)
14762	7 transmembrane receptor (rhodopsin family)
14762	7 transmembrane receptor (rhodopsin family)
14763	7 transmembrane receptor (rhodopsin family)
14763	7 transmembrane receptor (rhodopsin family)
14763	7TM chemoreceptor. This large family of proteins are related to pfam00001. They are 7 transmembrane receptors. This family does not include all known members, as there are problems with overlapping specificity with pfam00001. This family is greatly expand
14763	7 transmembrane receptor (rhodopsin family)
14763	7TM chemoreceptor. This large family of proteins are related to pfam00001. They are 7 transmembrane receptors. This family does not include all known members, as there are problems with overlapping specificity with pfam00001. This family is greatly expand
14764	7 transmembrane receptor (rhodopsin family)
14765	7 transmembrane receptor (rhodopsin family)
14766	7 transmembrane receptor (Secretin family)
14767	7 transmembrane receptor (rhodopsin family)
14772	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
14773	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
14775	Glutathione peroxidase
14776	Glutathione peroxidase
14778	Glutathione peroxidase
14780	Glutathione peroxidase
14784	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
14784	SH2 domain
14784	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
14786	SH2 domain
14786	PH domain. PH stands for pleckstrin homology
14787	Hr1 repeat
14787	BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1
14788	7 transmembrane receptor (rhodopsin family)
14791	Suppressor Mra1. The suppressor Mra1 is found in high-copy-number when Ras1 is mutated, that recovers the mating deficiency caused by the decrease of Ras1 activity. Mutational analysis in yeast suggests that the suppressor Mra1 is essential for cell grow
14792	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
14797	Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation
14799	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14800	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14803	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14804	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14805	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14806	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14807	Bacterial extracellular solute-binding proteins, family 3
14807	Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors
14809	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14809	Bacterial extracellular solute-binding proteins, family 3
14809	Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors
14809	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14810	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14811	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14812	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14813	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14814	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14815	Glucocorticoid receptor
14815	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
14815	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
14816	7 transmembrane receptor (metabotropic glutamate family)
14816	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14823	7 transmembrane receptor (metabotropic glutamate family)
14823	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14825	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
14827	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
14828	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
14829	7 transmembrane receptor (rhodopsin family)
14836	Homeobox domain
14842	Homeobox domain
14843	Homeobox domain
14854	Eukaryotic glutathione synthase
14854	Eukaryotic glutathione synthase, ATP binding domain
14857	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
14858	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
14859	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
14860	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
14862	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
14863	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
14864	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
14865	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
14866	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
14866	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
14866	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
14866	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
14866	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
14867	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
14869	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
14870	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
14872	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
14872	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
14872	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
14872	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
14884	BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function
14885	Transcription factor Tfb2
14886	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
14911	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
14912	Homeobox domain
14915	Guanylin precursor
14916	Guanylin precursor
14917	Protein kinase domain
14917	Adenylate and Guanylate cyclase catalytic domain
14919	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
14924	Guanylate kinase
14924	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
14924	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
14933	FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain
14933	FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain
14934	Glycophorin A
14936	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari
14937	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari
14938	Trypsin
14940	Trypsin
14942	Trypsin
14950	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
14957	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
14958	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
14958	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
14960	Class II histocompatibility antigen, alpha domain
14961	Class II histocompatibility antigen, beta domain
14962	Sushi domain (SCR repeat)
14962	von Willebrand factor type A domain
14962	Trypsin
14962	Sushi domain (SCR repeat)
14962	von Willebrand factor type A domain
14962	Trypsin
14962	Sushi domain (SCR repeat)
14962	von Willebrand factor type A domain
14963	Class I Histocompatibility antigen, domains alpha 1 and 2
14964	Class I Histocompatibility antigen, domains alpha 1 and 2
14964	Class I Histocompatibility antigen, domains alpha 1 and 2
14964	Class I Histocompatibility antigen, domains alpha 1 and 2
14967	Class I Histocompatibility antigen, domains alpha 1 and 2
14968	Class II histocompatibility antigen, alpha domain
14968	Class II histocompatibility antigen, alpha domain
14968	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
14968	Class II histocompatibility antigen, alpha domain
14968	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
14969	Class II histocompatibility antigen, beta domain
14969	Class II histocompatibility antigen, beta domain
14969	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
14969	Class II histocompatibility antigen, beta domain
14969	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
14972	Class I Histocompatibility antigen, domains alpha 1 and 2
14972	Class I Histocompatibility antigen, domains alpha 1 and 2
14976	KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif
14977	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
14979	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
14985	Class I Histocompatibility antigen, domains alpha 1 and 2
14985	Class I Histocompatibility antigen, domains alpha 1 and 2
14985	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
14991	Class I Histocompatibility antigen, domains alpha 1 and 2
14997	Class I Histocompatibility antigen, domains alpha 1 and 2
14997	Class I Histocompatibility antigen, domains alpha 1 and 2
14997	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
14997	Class I Histocompatibility antigen, domains alpha 1 and 2
14997	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
14998	Class II histocompatibility antigen, alpha domain
14999	Class II histocompatibility antigen, beta domain
14999	Class II histocompatibility antigen, beta domain
15000	Class II histocompatibility antigen, beta domain
15001	Class II histocompatibility antigen, alpha domain
15001	Class II histocompatibility antigen, alpha domain
15001	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
15001	Class II histocompatibility antigen, alpha domain
15001	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
15002	Class II histocompatibility antigen, beta domain
15006	Class I Histocompatibility antigen, domains alpha 1 and 2
15006	Class I Histocompatibility antigen, domains alpha 1 and 2
15006	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
15007	Class I Histocompatibility antigen, domains alpha 1 and 2
15007	Class I Histocompatibility antigen, domains alpha 1 and 2
15013	Class I Histocompatibility antigen, domains alpha 1 and 2
15016	Class I Histocompatibility antigen, domains alpha 1 and 2
15018	Class I Histocompatibility antigen, domains alpha 1 and 2
15019	Class I Histocompatibility antigen, domains alpha 1 and 2
15024	Class I Histocompatibility antigen, domains alpha 1 and 2
15039	Class I Histocompatibility antigen, domains alpha 1 and 2
15040	Class I Histocompatibility antigen, domains alpha 1 and 2
15042	Class I Histocompatibility antigen, domains alpha 1 and 2
15043	Class I Histocompatibility antigen, domains alpha 1 and 2
15051	Class I Histocompatibility antigen, domains alpha 1 and 2
15064	Class I Histocompatibility antigen, domains alpha 1 and 2
15108	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
15109	Phenylalanine and histidine ammonia-lyase
15111	Helix-loop-helix DNA-binding domain
15112	FMN-dependent dehydrogenase
15117	Chitin synthase. Members of this family are fungal chitin synthase EC:2.4.1.16 enzymes. They catalyse chitin synthesis as follows: UDP-N-acetyl-D-glucosamine + {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N) <=> UDP + {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N+1
15118	Chitin synthase. Members of this family are fungal chitin synthase EC:2.4.1.16 enzymes. They catalyse chitin synthesis as follows: UDP-N-acetyl-D-glucosamine + {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N) <=> UDP + {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N+1
15122	Globin
15126	Globin
15129	Globin
15130	Globin
15132	Globin
15135	Globin
15139	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
15139	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
15139	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f
15139	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
15159	Cytochrome c/c1 heme lyase
15160	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
15162	SH2 domain
15162	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
15163	Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018
15165	Cyclic nucleotide-binding domain
15165	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
15166	Cyclic nucleotide-binding domain
15166	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
15168	Cyclic nucleotide-binding domain
15168	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
15170	SH2 domain
15170	SH2 domain
15170	Protein-tyrosine phosphatase
15170	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
15170	SH2 domain
15170	Protein-tyrosine phosphatase
15170	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
15171	Prepro-orexin
15181	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
15182	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
15185	Zn-finger in ubiquitin-hydrolases and other protein
15186	Pyridoxal-dependent decarboxylase conserved domain
15191	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
15192	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
15193	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
15194	Huntingtin
15203	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
15203	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
15204	Anaphase-promoting complex, subunit 10 (APC10)
15204	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
15205	Helix-loop-helix DNA-binding domain
15206	Helix-loop-helix DNA-binding domain
15208	Helix-loop-helix DNA-binding domain
15209	Homeobox domain
15211	Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold
15212	Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold
15213	Helix-loop-helix DNA-binding domain
15214	Helix-loop-helix DNA-binding domain
15216	Class I Histocompatibility antigen, domains alpha 1 and 2
15221	Fork head domain
15223	Fork head domain
15223	Fork head domain
15228	Fork head domain
15229	Fork head domain
15233	homogentisate 1,2-dioxygenase. Homogentisate dioxygenase cleaves the aromatic ring during the metabolic degradation of Phe and Tyr. Homogentisate dioxygenase deficiency causes alkaptonuria. The structure of homogentisate dioxygenase shows that the enzyme
15234	Trypsin
15234	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
15234	PAN domain. The PAN domain contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge the links the N and C termini of the domain. The domain is found in diverse proteins, in some t
15235	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
15235	PAN domain. The PAN domain contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge the links the N and C termini of the domain. The domain is found in diverse proteins, in some
15239	VHS domain. Domain present in VPS-27, Hrs and STAM
15242	Homeobox domain
15248	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
15254	HIT family
15254	HIT family
15257	Protein kinase domain
15260	WD domain, G-beta repeat
15273	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
15275	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
15277	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
15284	Homeobox domain
15285	Homeobox domain
15288	Porphobilinogen deaminase, C-terminal domain
15288	Porphobilinogen deaminase, dipyromethane cofactor binding domain
15289	HMG (high mobility group) box
15353	HMG (high mobility group) box
15354	HMG (high mobility group) box
15356	HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase
15356	HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase
15357	Hydroxymethylglutaryl-coenzyme A reductase
15357	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
15360	Hydroxymethylglutaryl-coenzyme A synthase
15368	Heme oxygenase
15369	Heme oxygenase
15370	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
15370	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
15371	Homeobox domain
15372	Homeobox domain
15373	Homeobox domain
15373	Homeobox domain
15375	Fork head domain
15376	Fork head domain
15377	Fork head domain
15378	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
15378	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
15379	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
15388	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
15394	Homeobox domain
15395	Homeobox domain
15396	Homeobox domain
15398	Homeobox domain
15399	Homeobox domain
15402	Homeobox domain
15403	Homeobox domain
15404	Homeobox domain
15405	Homeobox domain
15407	Homeobox domain
15408	Homeobox domain
15410	Homeobox domain
15412	Homeobox domain
15413	Homeobox domain
15414	Homeobox domain
15415	Homeobox domain
15416	Homeobox domain
15416	Homeobox domain
15421	Homeobox domain
15422	Homeobox domain
15423	Homeobox domain
15424	Homeobox domain
15425	Homeobox domain
15425	Homeobox domain
15426	Homeobox domain
15427	Homeobox domain
15429	Homeobox domain
15429	Homeobox domain
15430	Homeobox domain
15431	Homeobox domain
15432	Homeobox domain
15433	Homeobox domain
15434	Homeobox domain
15436	Homeobox domain
15437	Homeobox domain
15438	Homeobox domain
15439	Sushi domain (SCR repeat)
15441	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
15442	Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular
15446	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
15450	Lipase
15450	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
15458	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
15460	jmjC domain
15463	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
15465	7 transmembrane receptor (rhodopsin family)
15465	7 transmembrane receptor (rhodopsin family)
15466	7 transmembrane receptor (rhodopsin family)
15467	Protein kinase domain
15473	Endoribonuclease L-PSP. Endoribonuclease active on single-stranded mRNA. Inhibits protein synthesis by cleavage of mRNA. Previously thought to inhibit protein synthesis initiation. This protein may also be involved in the regulation of purine biosynthesi
15476	Sulfotransferase protein
15478	Sulfotransferase protein
15481	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
15482	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
15482	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
15482	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
15482	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
15482	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
15483	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
15484	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
15485	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
15486	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
15486	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
15487	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
15488	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
15488	MaoC like domain. The MaoC protein is found to share similarity with a wide variety of enzymes
15488	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
15490	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
15492	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
15493	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
15493	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
15494	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
15494	Polysaccharide biosynthesis protein. This is a family of diverse bacterial polysaccharide biosynthesis proteins including the CapD protein, WalL protein mannosyl-transferase and several putative epimerases (e.g. WbiI)
15494	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
15495	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
15495	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
15495	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
15496	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
15497	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
15499	HSF-type DNA-binding
15499	HSF-type DNA-binding domain
15500	HSF-type DNA-binding
15505	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
15507	Hsp20/alpha crystallin family
15512	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
15516	Hsp90 protein
15519	Hsp90 protein
15519	Hsp90 protein
15519	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
15525	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
15526	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
15528	Chaperonin 10 Kd subunit
15529	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
15530	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
15530	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
15531	Sulfotransferase protein
15547	Conserved hypothetical protein 95
15547	ubiE/COQ5 methyltransferase family
15547	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
15547	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
15547	Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat
15547	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
15547	Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat
15547	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
15550	7 transmembrane receptor (rhodopsin family)
15551	7 transmembrane receptor (rhodopsin family)
15552	7 transmembrane receptor (rhodopsin family)
15557	7 transmembrane receptor (rhodopsin family)
15558	7 transmembrane receptor (rhodopsin family)
15559	7 transmembrane receptor (rhodopsin family)
15560	7 transmembrane receptor (rhodopsin family)
15561	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
15561	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
15562	7 transmembrane receptor (rhodopsin family)
15563	7 transmembrane receptor (rhodopsin family)
15564	7 transmembrane receptor (rhodopsin family)
15565	7 transmembrane receptor (rhodopsin family)
15566	7 transmembrane receptor (rhodopsin family)
15567	Sodium:neurotransmitter symporter family
15567	Serotonin (5-HT) neurotransmitter transporter, N-terminus
15571	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
15574	Hus1-like protein. Hus1, Rad1, and Rad9 are three evolutionarily conserved proteins required for checkpoint control in fission yeast. These proteins are known to form a stable complex in vivo. Hus1-Rad1-Rad9 complex may form a PCNA-like ring structure, a
15586	Hyaluronidase
15587	Hyaluronidase
15598	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic d
15894	Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes,
15896	Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes,
15898	Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes,
15900	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp
15901	Helix-loop-helix DNA-binding domain
15901	Helix-loop-helix DNA-binding domain
15902	Helix-loop-helix DNA-binding domain
15930	Indoleamine 2,3-dioxygenase
15931	Sulfatase
15932	Glycosyl hydrolases family 39
15932	Glycosyl hydrolases family 39
15945	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
15950	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
15950	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
15951	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
15951	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
15962	Interferon alpha/beta domain
15964	Interferon alpha/beta domain
15965	Interferon alpha/beta domain
15967	Interferon alpha/beta domain
15968	Interferon alpha/beta domain
15969	Interferon alpha/beta domain
15970	Interferon alpha/beta domain
15972	Interferon alpha/beta domain
15974	Interferon alpha/beta domain
15975	Tissue factor
15976	Tissue factor
15977	Interferon alpha/beta domain
15978	Interferon gamma
15978	Interferon gamma
15979	Tissue factor
15980	Tissue factor
15980	Fibronectin type III domain
16001	Protein kinase domain
16001	Fibronectin type III domain
16001	Furin-like cysteine rich region
16001	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
16004	Cation-independent mannose-6-phosphate receptor repeat. The cation-independent mannose-6-phosphate receptor contains 15 copies of a repeat
16005	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
16006	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
16007	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
16008	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
16009	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
16010	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
16011	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
16012	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
16061	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
16065	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
16098	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
16114	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
16117	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
16147	Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module
16147	Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation
16149	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an
16150	Protein kinase domain
16153	Interleukin 10
16155	Tissue factor
16159	Interleukin-12 alpha subunit. Interleukin 12 (IL-12) is a disulphide-bonded heterodimer consisting of a 35kDa alpha subunit and a 40kDa beta subunit. It is involved in the stimulation and maintenance of Th1 cellular immune responses, including the normal
16161	Fibronectin type III domain
16162	Fibronectin type III domain
16168	Interleukin 15. Interleukin-15 (IL-15) is a cytokine that possesses a variety of biological functions, including stimulation and maintenance of cellular immune responses
16173	Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18
16174	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
16177	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
16179	Protein kinase domain
16180	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
16182	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
16183	Interleukin 2
16183	Interleukin 2
16183	Interleukin 2
16184	Sushi domain (SCR repeat)
16187	Interleukin-3
16189	Interleukin 4
16191	Interleukin 5
16197	Fibronectin type III domain
16198	Interleukin 7/9 family. IL-7 is a cytokine that acts as a growth factor for early lymphoid cells of both B- and T-cell lineages. IL-9 is a multi-functional cytokine that, although originally described as a T-cell growth factor, its function in T-cell res
16202	Protein kinase domain
16204	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
16210	Uncharacterized protein family UPF0029
16319	Inner centromere protein, ARK binding region. This region of the inner centromere protein has been found to be necessary and sufficient for binding to aurora-related kinase. This interaction has been implicated in the coordination of chromosome segregati
16319	Inner centromere protein, ARK binding region. This region of the inner centromere protein has been found to be necessary and sufficient for binding to aurora-related kinase. This interaction has been implicated in the coordination of chromosome segregatio
16322	Transforming growth factor beta like domain
16323	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
16329	Inositol monophosphatase family
16330	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
16330	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
16332	SH2 domain
16332	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
16333	Insulin/IGF/Relaxin family. Superfamily includes insulins
16334	Insulin/IGF/Relaxin family. Superfamily includes insulins
16336	Insulin/IGF/Relaxin family. Superfamily includes insulins
16337	Fibronectin type III domain
16337	Furin-like cysteine rich region
16337	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
16341	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
16348	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
16348	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
16362	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp
16363	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp
16364	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp
16365	MmgE/PrpD family. This family includes 2-methylcitrate dehydratase EC:4.2.1.79 (PrpD) that is required for propionate catabolism. It catalyses the third step of the 2-methylcitric acid cycle
16367	PTB domain (IRS-1 type)
16369	PTB domain (IRS-1 type)
16370	PTB domain (IRS-1 type)
16371	Homeobox domain
16372	Homeobox domain
16391	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp
16392	LIM domain. This family represents two copies of the LIM structural domain
16396	C2 domain
16396	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
16396	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
16398	von Willebrand factor type A domain
16402	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
16403	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
16407	von Willebrand factor type A domain
16407	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
16407	von Willebrand factor type A domain
16408	von Willebrand factor type A domain
16408	von Willebrand factor type A domain
16408	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
16409	von Willebrand factor type A domain
16411	von Willebrand factor type A domain
16412	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
16412	Integrins, beta chain. Sequences cut off at repeats due to overlap with EGF
16414	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
16415	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
16416	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
16418	eIF-6 family. This family includes eukaryotic translation initiation factor 6 as well as presumed archaebacterial homologues
16419	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
16420	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
16421	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
16428	SH2 domain
16428	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
16430	Oligosaccharyl transferase STT3 subunit. This family consists of the oligsacharyl transferase STT3 subunit and related proteins. The STT3 subunit is part of the oligosccharyl transferase (OTase) complex of proteins and is required for its activity. OTase
16431	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
16432	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
16438	RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3
16438	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16439	MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function
16449	Delta serrate ligand
16450	Delta serrate ligand
16468	jmjC domain
16468	jmjN domain
16468	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
16468	pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function
16469	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d
16469	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
16476	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
16476	Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases
16477	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
16477	Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases
16478	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
16478	Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases
16480	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
16480	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
16485	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16485	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16490	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16490	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16491	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16491	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16492	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16492	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16493	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16493	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16494	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16494	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16495	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16495	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16499	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
16500	Kv2 voltage-gated K+ channel
16500	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16500	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16502	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16502	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16504	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16504	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16506	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16506	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16508	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16508	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16509	Slow voltage-gated potassium channel
16510	Cyclic nucleotide-binding domain
16510	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
16510	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16511	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
16511	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16512	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
16512	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16513	Inward rectifier potassium channel
16514	Inward rectifier potassium channel
16515	Inward rectifier potassium channel
16516	Inward rectifier potassium channel
16516	Inward rectifier potassium channel
16517	Inward rectifier potassium channel
16518	Inward rectifier potassium channel
16519	Inward rectifier potassium channel
16520	Inward rectifier potassium channel
16521	Inward rectifier potassium channel
16522	Inward rectifier potassium channel
16522	Inward rectifier potassium channel
16522	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16523	Inward rectifier potassium channel
16524	Inward rectifier potassium channel
16525	TASK K+ channel
16526	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16528	TASK K+ channel
16529	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16529	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16530	TASK K+ channel
16531	Calcium-activated BK potassium channel alpha subunit
16531	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16532	Calcium-activated BK potassium channel alpha subunit
16532	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16533	Calcium-activated potassium channel, beta subunit
16534	Calcium-activated SK potassium channel
16534	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-
16535	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
16535	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16536	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16538	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16538	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16539	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16539	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16539	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
16539	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
16541	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
16548	pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es
16549	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
16549	KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
16552	Kinesin motor domain
16552	Kinesin motor domain
16559	Kinesin motor domain
16560	Kinesin motor domain
16560	PH domain. PH stands for pleckstrin homology
16560	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
16561	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
16561	Kinesin motor domain
16561	PH domain. PH stands for pleckstrin homology
16561	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
16562	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
16564	Kinesin motor domain
16568	Kinesin motor domain
16568	Kinesin motor domain
16569	Kinesin motor domain
16570	Kinesin motor domain
16572	Kinesin motor domain
16573	Kinesin motor domain
16574	Kinesin motor domain
16578	Kinesin motor domain
16578	Kinesin motor domain
16589	Protein kinase domain
16590	Protein kinase domain
16590	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
16591	Glycosyl hydrolase family 1
16592	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
16618	Trypsin
16625	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
16644	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
16656	Zinc finger, C2H2 type
16658	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
16660	Intermediate filament protein
16661	Intermediate filament protein
16663	Intermediate filament protein
16664	Intermediate filament protein
16665	Intermediate filament protein
16666	Intermediate filament protein
16667	Intermediate filament protein
16668	Intermediate filament protein
16669	Intermediate filament protein
16670	Intermediate filament protein
16671	Intermediate filament protein
16671	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
16672	Intermediate filament protein
16673	Intermediate filament protein
16675	Intermediate filament protein
16678	Intermediate filament protein
16679	Intermediate filament protein
16679	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
16680	Intermediate filament protein
16680	Intermediate filament protein
16681	Intermediate filament protein
16682	Intermediate filament protein
16687	Intermediate filament protein
16688	Intermediate filament protein
16691	Intermediate filament protein
16706	Protein kinase domain
16728	Fibronectin type III domain
16765	Stathmin family
16769	Cadherin domain
16769	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
16770	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzyme
16772	Laminin G domain
16772	Laminin B (Domain IV)
16772	Laminin N-terminal (Domain VI)
16772	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
16773	Laminin G domain
16773	Laminin N-terminal (Domain VI)
16773	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
16773	Tropomyosin
16773	Laminin G domain
16773	Laminin B (Domain IV)
16773	Uncharacterized ACR, COG1579
16773	Intermediate filament protein
16773	Laminin N-terminal (Domain VI)
16773	Giardia variant-specific surface protein
16773	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
16773	Nop14-like family. Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production
16773	Methyl-accepting chemotaxis protein (MCP) signaling domain. This domain is thought to transduce the signal to CheA since it is highly conserved in very diverse MCPs
16773	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
16773	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
16773	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
16773	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
16773	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
16773	PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins
16774	Laminin B (Domain IV)
16774	Ribosomal protein L15
16774	Laminin N-terminal (Domain VI)
16774	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
16775	Intermediate filament protein
16775	Thrombospondin N-terminal -like domain
16775	Giardia variant-specific surface protein
16775	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
16775	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
16776	Laminin B (Domain IV)
16776	HlyD family secretion protein
16776	Intermediate filament protein
16776	Laminin N-terminal (Domain VI)
16776	Borrelia outer surface protein D (OspD)
16776	Giardia variant-specific surface protein
16776	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
16776	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
16776	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I, Apolipoprotein A-IV, Apolipoprotein E
16776	ATP synthase B/B' CF(0). Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits a
16776	Rotavirus major capsid protein VP6. Rotaviruses consist of three concentric protein shells. The intermediate (middle) protein layer consists 260 trimers of VP6. VP6 in the most abundant protein in the virion. VP6 is also involved in virion assembly, and p
16776	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
16779	Laminin N-terminal (Domain VI)
16779	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
16780	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
16780	Laminin N-terminal (Domain VI)
16780	Giardia variant-specific surface protein
16780	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
16782	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
16783	Lysosome-associated membrane glycoprotein (Lamp)
16784	Lysosome-associated membrane glycoprotein (Lamp)
16785	Ribosomal protein S2
16790	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
16792	Golgi 4-transmembrane spanning transporter
16795	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl
16796	LIM domain. This family represents two copies of the LIM structural domain
16798	Protein kinase domain
16798	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
16803	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
16814	Homeobox domain
16815	Homeobox domain
16816	Lecithin:cholesterol acyltransferase. Lecithin:cholesterol acyltransferase (LACT) is involved in extracellular metabolism of plasma lipoproteins, including cholesterol
16818	SH2 domain
16818	Protein kinase domain
16818	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
16819	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
16820	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
16821	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
16825	LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located i
16826	LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located i
16828	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
16828	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
16835	Low-density lipoprotein receptor domain class A
16835	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
16840	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
16842	HMG (high mobility group) box
16846	Leptin
16848	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
16852	Galactoside-binding lectin
16854	Galactoside-binding lectin
16855	Galactoside-binding lectin
16867	7 transmembrane receptor (rhodopsin family)
16869	Homeobox domain
16869	LIM domain. This family represents two copies of the LIM structural domain
16870	Homeobox domain
16870	LIM domain. This family represents two copies of the LIM structural domain
16871	Homeobox domain
16871	LIM domain. This family represents two copies of the LIM structural domain
16872	Homeobox domain
16872	LIM domain. This family represents two copies of the LIM structural domain
16873	Homeobox domain
16873	LIM domain. This family represents two copies of the LIM structural domain
16874	Homeobox domain
16874	LIM domain. This family represents two copies of the LIM structural domain
16875	Homeobox domain
16875	LIM domain. This family represents two copies of the LIM structural domain
16876	Homeobox domain
16876	LIM domain. This family represents two copies of the LIM structural domain
16878	LIF / OSM family
16880	Fibronectin type III domain
16882	ATP dependent DNA ligase domain. This domain belongs to a more diverse superfamily, including pfam01331 and pfam01653
16882	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
16882	Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc
16885	Protein kinase domain
16885	LIM domain. This family represents two copies of the LIM structural domain
16885	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
16886	LIM domain. This family represents two copies of the LIM structural domain
16886	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
16886	Protein kinase domain
16886	LIM domain. This family represents two copies of the LIM structural domain
16886	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
16889	ab-hydrolase associated lipase region
16891	Lipase
16891	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
16905	Intermediate filament protein
16905	Intermediate filament tail domain
16906	Intermediate filament protein
16906	Intermediate filament tail domain
16907	Intermediate filament protein
16907	Intermediate filament tail domain
16909	LIM domain. This family represents two copies of the LIM structural domain
16911	LIM domain. This family represents two copies of the LIM structural domain
16912	Proteasome A-type and B-type
16913	Proteasome A-type and B-type
16913	Proteasome A-type and B-type
16917	Homeobox domain
16917	LIM domain. This family represents two copies of the LIM structural domain
16918	Helix-loop-helix DNA-binding domain
16918	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo
16948	Lysyl oxidase
16950	Lysyl oxidase
16952	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
16956	Lipase
16956	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
16956	Lipase
16956	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
16969	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
16971	Low-density lipoprotein receptor domain class A
16971	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
16973	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
16974	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
16975	Low-density lipoprotein receptor domain class A
16975	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
16980	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
16980	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
16981	Fibronectin type III domain
16981	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
16985	Caldesmon
16992	TNF(Tumor Necrosis Factor) family
16993	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
16994	TNF(Tumor Necrosis Factor) family
16995	7 transmembrane receptor (rhodopsin family)
16997	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
16998	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
17000	TNFR/NGFR cysteine-rich region
17001	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
17002	Transferrin
17025	Delta-aminolevulinic acid dehydratase
17057	Lectin C-type domain. This family includes both long and short form C-type
17058	Lectin C-type domain. This family includes both long and short form C-type
17063	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
17068	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
17069	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
17075	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
17076	Lectin C-type domain. This family includes both long and short form C-type
17076	Lectin C-type domain. This family includes both long and short form C-type
17079	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
17079	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
17083	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
17083	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
17084	ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen r
17087	ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen r
17101	Beige/BEACH domain
17113	Cation-dependent mannose-6-phosphate receptor
17116	Mab-21 protein
17119	Helix-loop-helix DNA-binding domain
17125	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
17125	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
17126	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
17126	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
17127	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
17127	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
17128	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
17128	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
17129	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
17129	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
17131	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
17131	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
17133	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
17134	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
17135	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
17137	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
17138	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
17139	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
17139	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
17140	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
17141	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
17142	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
17144	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
17145	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
17146	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
17147	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
17149	Mago nashi protein. This family was originally identified in Drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene. The human homologue has been shown to interact with an RNA binding protein. An RNAi knockout of the
17155	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
17156	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
17156	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
17158	Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism
17159	Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism
17160	Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism
17161	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
17164	Protein kinase domain
17167	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
17168	Methylpurine-DNA glycosylase (MPG). Methylpurine-DNA glycosylase is a base excision-repair protein. It is responsible for the hydrolysis of the deoxyribose N-glycosidic bond, excising 3-methyladenine and 3-methylguanine from damaged DNA
17169	Kinase associated domain 1
17169	Kinase associated domain 1
17171	7 transmembrane receptor (rhodopsin family)
17171	7 transmembrane receptor (rhodopsin family)
17174	Trypsin
17174	CUB domain
17174	Trypsin
17174	CUB domain
17174	Sushi domain (SCR repeat)
17178	ATP1G1/PLM/MAT8 family
17180	von Willebrand factor type A domain
17181	von Willebrand factor type A domain
17181	von Willebrand factor type A domain
17182	von Willebrand factor type A domain
17183	von Willebrand factor type A domain
17187	Helix-loop-helix DNA-binding domain
17189	Globin
17190	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
17191	Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD has
17194	Lectin C-type domain. This family includes both long and short form C-type
17195	Lectin C-type domain. This family includes both long and short form C-type
17196	Myelin basic protein
17199	7 transmembrane receptor (rhodopsin family)
17200	7 transmembrane receptor (rhodopsin family)
17201	7 transmembrane receptor (rhodopsin family)
17202	7 transmembrane receptor (rhodopsin family)
17203	7 transmembrane receptor (rhodopsin family)
17207	PH domain. PH stands for pleckstrin homology
17207	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
17207	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
17207	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
17210	Apoptosis regulator proteins, Bcl-2 family
17215	MCM2/3/5 family
17219	MCM2/3/5 family
17220	MCM2/3/5 family
17228	Trypsin
17231	Trypsin
17232	Trypsin
17242	PTN/MK heparin-binding protein family
17246	p53-associated protein (MDM2). The p53-associated protein (MDM2) is an inhibitor of the p53 tumor suppressor gene binding the transactivation domain and down regulating the ability of p53 to activate transcription. This family contains the p53 binding do
17248	p53-associated protein (MDM2). The p53-associated protein (MDM2) is an inhibitor of the p53 tumor suppressor gene binding the transactivation domain and down regulating the ability of p53 to activate transcription. This family contains the p53 binding do
17252	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
17258	SRF-type transcription factor (DNA-binding and dimerisation domain)
17259	SRF-type transcription factor (DNA-binding and dimerisation domain)
17260	SRF-type transcription factor (DNA-binding and dimerisation domain)
17261	SRF-type transcription factor (DNA-binding and dimerisation domain)
17274	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
17276	ENV polyprotein (coat polyprotein)
17279	Kinase associated domain 1
17281	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
17281	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
17281	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
17281	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
17285	Homeobox domain
17285	Homeobox domain
17286	Homeobox domain
17286	Homeobox domain
17287	MAM domain. An extracellular domain found in many receptors
17287	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
17287	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
17287	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
17288	MAM domain. An extracellular domain found in many receptors
17288	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
17288	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
17288	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
17289	Fibronectin type III domain
17292	Helix-loop-helix DNA-binding domain
17292	Helix-loop-helix DNA-binding domain
17293	Helix-loop-helix DNA-binding domain
17295	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
17295	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
17295	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
17299	Putative methyltransferase. This is a family of putative methyltransferases. The aligned region contains the GXGXG S-AdoMet binding site suggesting a putative methyltransferase activity
17300	Fork head domain
17301	Fork head domain
17304	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
17304	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
17305	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
17306	Synaptophysin / synaptoporin
17308	GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis o
17311	Stem cell factor. Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. The crystal structure of human SCF has been resolved and a potential receptor-binding site
17312	Hepatic lectin, N-terminal domain
17319	Macrophage migration inhibitory factor (MIF)
17329	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
17329	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
17329	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
17330	Histidine acid phosphatase
17339	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
17341	Helix-loop-helix DNA-binding domain
17342	Helix-loop-helix DNA-binding domain
17344	pfam02891, zf-MIZ, MIZ zinc finger
17344	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
17381	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
17381	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
17381	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
17384	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
17384	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
17384	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
17385	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
17385	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
17386	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
17386	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
17386	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
17387	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
17387	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
17387	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
17388	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
17388	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
17388	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
17389	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
17389	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
17389	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
17390	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
17390	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
17390	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
17391	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
17391	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
17391	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
17392	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
17392	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
17392	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
17393	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
17393	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
17394	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
17394	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
17394	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
17395	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
17395	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
17423	Sulfotransferase protein
17425	Fork head domain
17425	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
17427	V-type ATPase 116kDa subunit family
17428	Helix-loop-helix DNA-binding domain
17434	MoaE protein. This family contains the MoaE protein that is involved in biosynthesis of molybdopterin. Molybdopterin, the universal component of the pterin molybdenum cofactors, contains a dithiolene group serving to bind Mo. Addition of the dithiolene s
17436	Malic enzyme, N-terminal domain
17441	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
17444	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
17444	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
17451	Protein kinase domain
17463	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has
17472	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
17472	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
17472	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
17472	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
17523	Animal haem peroxidase
17524	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
17527	Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is
17533	Lectin C-type domain. This family includes both long and short form C-type
17534	Lectin C-type domain. This family includes both long and short form C-type
17535	Mer11 DNA-binding domain. The Mre11 complex is a multisubunit nuclease that is composed of Mre11, Rad50 and Nbs1/Xrs2, and is involved in checkpoint signalling and DNA replication. Mre11 has an intrinsic DNA-binding activity that is stimulated by Rad50 o
17686	MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of
17687	DNA mismatch repair proteins, mutS family
17687	MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of p
17687	DNA mismatch repair proteins, mutS family
17687	MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of p
17688	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
17698	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
17698	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
17700	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
17701	Homeobox domain
17702	Homeobox domain
17703	Homeobox domain
17713	GrpE
17714	GrpE
17714	GrpE
17749	DNA directed RNA polymerase, 7 kDa subunit
17755	Presenilin
17756	Tau and MAP protein, tubulin-binding repeat
17758	Tau and MAP protein, tubulin-binding repeat
17762	Tau and MAP protein, tubulin-binding repeat
17765	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
17766	NUDIX domain
17768	Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain
17769	Methylenetetrahydrofolate reductase. This family includes the 5,10-methylenetetrahydrofolate reductase EC:1.7.99.5 from bacteria and methylenetetrahydrofolate reductase EC: 1.5.1.20 from eukaryotes. The structure for this domain is known to be a TIM barr
17771	Tesmin/TSO1-like CXC domain. This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in
17772	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
17773	7 transmembrane receptor (rhodopsin family)
17773	7 transmembrane receptor (rhodopsin family)
17775	Golgi 4-transmembrane spanning transporter
17775	Golgi 4-transmembrane spanning transporter
17777	Lipoprotein amino terminal region. This family contains regions from: Vitellogenin, Microsomal triglyceride transfer protein and apolipoprotein B-100. These proteins are all involved in lipid transport. This family contains the LV1n chain from lipovitell
17829	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
17831	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
17831	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerbe
17832	B-box zinc finger
17832	Zinc finger, C3HC4 type (RING finger)
17832	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
17833	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
17836	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
17836	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
17837	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
17837	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
17840	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
17842	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
17843	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
17844	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
17850	Methylmalonyl-CoA mutase. The enzyme methylmalonyl-CoA mutase is a member of a class of enzymes that uses coenzyme B12 (adenosylcobalamin) as a cofactor. The enzyme induces the formation of an adenosyl radical from the cofactor. This radical then initiat
17857	Dynamin GTPase effector domain
17857	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
17858	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
17859	Helix-loop-helix DNA-binding domain
17864	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
17869	Helix-loop-helix DNA-binding domain
17869	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo
17870	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo
17873	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
17874	Death domain
17874	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
17877	Helix-loop-helix DNA-binding domain
17877	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates
17877	Helix-loop-helix DNA-binding domain
17877	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates s
17878	Helix-loop-helix DNA-binding domain
17878	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates
17880	Myosin head (motor domain)
17880	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
17880	Intermediate filament protein
17880	Myosin head (motor domain)
17880	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
17880	Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown
17880	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p
17880	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
17880	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
17882	Myosin head (motor domain)
17884	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
17886	Myosin head (motor domain)
17886	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
17886	Myosin head (motor domain)
17888	Myosin head (motor domain)
17888	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
17909	PH domain. PH stands for pleckstrin homology
17909	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
17909	Myosin head (motor domain)
17909	PH domain. PH stands for pleckstrin homology
17909	MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins
17909	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
17918	Myosin head (motor domain)
17918	DIL domain. The DIL domain has no known function
17919	DIL domain. The DIL domain has no known function
17922	MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins
17922	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
17925	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
17927	Helix-loop-helix DNA-binding domain
17927	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates
17928	Helix-loop-helix DNA-binding domain
17928	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates
17930	Fibronectin type III domain
17931	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
17931	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
17932	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
17933	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
17939	Melibiase
17952	Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat
17954	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
17955	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
17960	N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation
17961	N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation
17962	N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation
17965	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb
17967	Fibronectin type III domain
17968	Fibronectin type III domain
17969	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
17970	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
17973	SH2 domain
17973	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
17974	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
17978	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
17984	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
17988	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t
17990	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t
17996	Nebulin repeat
17999	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
18000	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
18001	DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh
18001	DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds
18001	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
18002	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
18007	Fibronectin type III domain
18008	Intermediate filament protein
18012	Helix-loop-helix DNA-binding domain
18012	Helix-loop-helix DNA-binding domain
18013	Helix-loop-helix DNA-binding domain
18014	Helix-loop-helix DNA-binding domain
18016	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
18016	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
18021	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
18027	CTF/NF-I family
18027	CTF/NF-I family
18028	CTF/NF-I family
18029	CTF/NF-I family
18032	CTF/NF-I family
18033	Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha
18034	Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha
18039	Intermediate filament protein
18039	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
18040	Intermediate filament protein
18041	Aminotransferase class-V
18041	Pyridoxal-dependent decarboxylase conserved domain
18041	DegT/DnrJ/EryC1/StrS aminotransferase family. The members of this family are probably all pyridoxal-phosphate-dependent aminotransferase enzymes with a variety of molecular functions. The family includes StsA, StsC and StsS. The aminotransferase activity
18045	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
18048	Trypsin
18049	Nerve growth factor family
18050	Trypsin
18053	TNFR/NGFR cysteine-rich region
18054	Cathelicidin. A novel protein family, showing a conserved proregion and a variable carboxyl-terminal antimicrobial domain. This region shows similarity to cystatins
18073	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
18073	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
18073	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
18073	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an
18074	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
18087	Cyclophilin type peptidyl-prolyl cis-trans isomerase
18088	Homeobox domain
18089	Homeobox domain
18091	Homeobox domain
18092	Homeobox domain
18094	Homeobox domain
18095	Homeobox domain
18096	Homeobox domain
18101	7 transmembrane receptor (rhodopsin family)
18102	Nucleoside diphosphate kinase
18103	Nucleoside diphosphate kinase
18104	Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones
18105	Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones
18109	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo
18113	NNMT/PNMT/TEMT family
18115	NAD(P) transhydrogenase beta subunit. This family corresponds to the beta subunit of NADP transhydrogenase in prokaryotes, and either the protein N- or C terminal in eukaryotes. The domain is often found in conjunction with pfam01262. Pyridine nucleotide
18117	Uncharacterised protein family (UPF0172)
18124	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
18124	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
18125	pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain
18125	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
18126	pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain
18127	pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain
18128	Notch (DSL) domain. The Notch domain is also called the 'DSL' domain. The notch proteins are transmembrane proteins with extracellular domains of repeated EGF domains and the notch (or DSL) domain N-terminal to that. These proteins are generally involved
18130	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
18131	Notch (DSL) domain. The Notch domain is also called the 'DSL' domain. The notch proteins are transmembrane proteins with extracellular domains of repeated EGF domains and the notch (or DSL) domain N-terminal to that. These proteins are generally involved
18140	YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named
18140	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
18140	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
18141	RanBP1 domain
18145	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
18148	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
18150	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
18155	Vertebrate endogenous opioids neuropeptide
18155	Vertebrate endogenous opioids neuropeptide
18158	Atrial natriuretic peptide
18159	Atrial natriuretic peptide
18160	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
18162	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
18166	7 transmembrane receptor (rhodopsin family)
18167	7 transmembrane receptor (rhodopsin family)
18168	7 transmembrane receptor (rhodopsin family)
18169	7 transmembrane receptor (rhodopsin family)
18171	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
18171	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
18171	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
18171	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
18173	Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined
18174	Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined
18175	Nebulin repeat
18183	Neuregulin family
18185	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
18186	CUB domain
18186	MAM domain. An extracellular domain found in many receptors
18186	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
18187	CUB domain
18187	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
18187	CUB domain
18187	MAM domain. An extracellular domain found in many receptors
18187	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
18188	Transforming growth factor beta like domain
18189	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
18189	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
18191	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
18193	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
18193	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
18194	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
18195	Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist o
18198	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
18201	Beige/BEACH domain
18205	Nerve growth factor family
18208	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
18209	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
18211	Protein kinase domain
18214	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
18216	7 transmembrane receptor (rhodopsin family)
18217	7 transmembrane receptor (rhodopsin family)
18217	7 transmembrane receptor (rhodopsin family)
18218	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
18221	Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration
18222	Phosphotyrosine interaction domain (PTB/PID)
18223	Phosphotyrosine interaction domain (PTB/PID)
18227	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
18227	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
18241	Ocular albinism type 1 protein
18241	Ocular albinism type 1 protein
18241	7 transmembrane receptor (Secretin family)
18241	Ocular albinism type 1 protein
18241	7 transmembrane receptor (Secretin family)
18245	Ornithine decarboxylase antizyme
18247	Ornithine decarboxylase antizyme
18249	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
18256	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t
18260	Occludin/ELL family
18263	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
18291	Homeobox domain
18292	Homeobox domain
18293	Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases
18293	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
18301	ATP1G1/PLM/MAT8 family
18302	Zona pellucida-like domain
18310	7 transmembrane receptor (rhodopsin family)
18312	7 transmembrane receptor (rhodopsin family)
18313	7 transmembrane receptor (rhodopsin family)
18314	7 transmembrane receptor (rhodopsin family)
18315	7 transmembrane receptor (rhodopsin family)
18315	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
18316	7 transmembrane receptor (rhodopsin family)
18318	7 transmembrane receptor (rhodopsin family)
18320	7 transmembrane receptor (rhodopsin family)
18321	7 transmembrane receptor (rhodopsin family)
18322	7 transmembrane receptor (rhodopsin family)
18322	7 transmembrane receptor (rhodopsin family)
18323	7 transmembrane receptor (rhodopsin family)
18324	7 transmembrane receptor (rhodopsin family)
18325	7 transmembrane receptor (rhodopsin family)
18329	7 transmembrane receptor (rhodopsin family)
18330	7 transmembrane receptor (rhodopsin family)
18331	7 transmembrane receptor (rhodopsin family)
18332	7 transmembrane receptor (rhodopsin family)
18340	7 transmembrane receptor (rhodopsin family)
18341	7 transmembrane receptor (rhodopsin family)
18342	7 transmembrane receptor (rhodopsin family)
18344	7 transmembrane receptor (rhodopsin family)
18345	7 transmembrane receptor (rhodopsin family)
18346	7 transmembrane receptor (rhodopsin family)
18347	7 transmembrane receptor (rhodopsin family)
18348	7 transmembrane receptor (rhodopsin family)
18349	7 transmembrane receptor (rhodopsin family)
18350	7 transmembrane receptor (rhodopsin family)
18351	7 transmembrane receptor (rhodopsin family)
18352	7 transmembrane receptor (rhodopsin family)
18354	7 transmembrane receptor (rhodopsin family)
18355	7 transmembrane receptor (rhodopsin family)
18356	7 transmembrane receptor (rhodopsin family)
18357	7 transmembrane receptor (rhodopsin family)
18357	7 transmembrane receptor (rhodopsin family)
18358	7 transmembrane receptor (rhodopsin family)
18359	7 transmembrane receptor (rhodopsin family)
18361	7 transmembrane receptor (rhodopsin family)
18362	7 transmembrane receptor (rhodopsin family)
18363	7 transmembrane receptor (rhodopsin family)
18365	7 transmembrane receptor (rhodopsin family)
18366	7 transmembrane receptor (rhodopsin family)
18367	7 transmembrane receptor (rhodopsin family)
18368	7 transmembrane receptor (rhodopsin family)
18369	7 transmembrane receptor (rhodopsin family)
18370	7 transmembrane receptor (rhodopsin family)
18373	7 transmembrane receptor (rhodopsin family)
18386	7 transmembrane receptor (rhodopsin family)
18387	7 transmembrane receptor (rhodopsin family)
18389	7 transmembrane receptor (rhodopsin family)
18390	7 transmembrane receptor (rhodopsin family)
18393	Origin recognition complex subunit 2. All DNA replication initiation is driven by a single conserved eukaryotic initiator complex termed he origin recognition complex (ORC). The ORC is a six protein complex. The function of ORC is reviewed in
18399	Sugar (and other) transporter
18405	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
18406	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
18408	Mitochondrial carrier protein
18408	Mitochondrial carrier protein
18413	LIF / OSM family
18414	Fibronectin type III domain
18415	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
18417	PMP-22/EMP/MP20/Claudin family
18419	von Willebrand factor type D domain
18420	Homeobox domain
18423	Homeobox domain
18423	Otx1 transcription factor
18424	Homeobox domain
18424	Otx1 transcription factor
18429	Neurohypophysial hormones, C-terminal Domain. N-terminal Domain is in hormone5
18430	7 transmembrane receptor (rhodopsin family)
18436	ATP P2X receptor
18438	ATP P2X receptor
18439	ATP P2X receptor
18440	ATP P2X receptor
18441	7 transmembrane receptor (rhodopsin family)
18442	7 transmembrane receptor (rhodopsin family)
18451	2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2
18453	Calsequestrin
18453	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
18458	Poly-adenylate binding protein, unique domain
18475	Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellu
18476	Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellu
18478	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
18479	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
18481	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
18483	Paralemmin
18484	Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
18484	Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
18488	Fibronectin type III domain
18489	Lectin C-type domain. This family includes both long and short form C-type
18491	Sushi domain (SCR repeat)
18503	'Paired box' domain
18505	Homeobox domain
18505	'Paired box' domain
18506	Homeobox domain
18506	'Paired box' domain
18507	'Paired box' domain
18508	Homeobox domain
18508	'Paired box' domain
18509	Homeobox domain
18509	'Paired box' domain
18510	'Paired box' domain
18511	'Paired box' domain
18514	Homeobox domain
18514	PBX domain. The PBX domain is a bipartite acidic domain
18515	Homeobox domain
18515	PBX domain. The PBX domain is a bipartite acidic domain
18516	Homeobox domain
18516	PBX domain. The PBX domain is a bipartite acidic domain
18518	TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway
18521	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
18530	Cadherin domain
18534	Phosphoenolpyruvate carboxykinase. Catalyses the formation of phosphoenolpyruvate by decarboxylation of oxaloacetate
18537	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
18538	Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA
18542	CUB domain
18548	Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic
18548	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
18549	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
18550	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
18551	Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic
18551	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
18551	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold cont
18552	Furin-like cysteine rich region
18552	Giardia variant-specific surface protein
18552	Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic d
18552	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold cont
18553	Furin-like cysteine rich region
18553	Giardia variant-specific surface protein
18553	Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic d
18553	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold cont
18554	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
18567	MYND finger
18567	Programmed cell death protein 2, C-terminal domain
18571	BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1
18573	3'5'-cyclic nucleotide phosphodiesterase
18574	3'5'-cyclic nucleotide phosphodiesterase
18575	3'5'-cyclic nucleotide phosphodiesterase
18576	3'5'-cyclic nucleotide phosphodiesterase
18577	3'5'-cyclic nucleotide phosphodiesterase
18578	3'5'-cyclic nucleotide phosphodiesterase
18583	3'5'-cyclic nucleotide phosphodiesterase
18584	3'5'-cyclic nucleotide phosphodiesterase
18585	3'5'-cyclic nucleotide phosphodiesterase
18587	3'5'-cyclic nucleotide phosphodiesterase
18590	Platelet-derived growth factor (PDGF)
18591	Platelet-derived growth factor (PDGF)
18595	Protein kinase domain
18595	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
18595	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
18597	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
18598	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
18599	Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-
18600	Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-
18601	Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-
18602	Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-
18604	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
18605	Somatomedin B domain
18605	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle
18606	Somatomedin B domain
18606	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle
18609	Homeobox domain
18610	Vertebrate endogenous opioids neuropeptide
18611	Death effector domain
18612	Ets-domain
18616	Actinobacillus constitutively-expressed outer membrane lipoprotein A
18616	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
18618	Phospholipid methyltransferase. The S. cerevisiae phospholipid methyltransferase (EC:2.1.1.16) has a broad substrate specificity of unsaturated phospholipids
18624	metallopeptidase family M24
18630	WD domain, G-beta repeat
18636	Thrombospondin type 1 domain
18637	KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif
18639	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
18640	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
18640	6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta
18640	Phosphoglycerate mutase family
18640	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
18640	6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme contai
18641	Phosphofructokinase
18642	Phosphofructokinase
18646	C2 domain
18646	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb
18655	Phosphoglycerate kinase
18663	Phosphoglycerate kinase
18667	Progesterone receptor
18667	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
18673	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
18674	Mitochondrial carrier protein
18676	jmjC domain
18676	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
18700	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid
18704	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
18704	PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains
18706	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
18707	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
18707	PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains
18708	SH2 domain
18708	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
18708	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
18710	SH2 domain
18710	SH2 domain
18711	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
18712	Protein kinase domain
18718	Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K cata
18719	Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K cata
18738	Phosphatidylinositol transfer protein
18738	Phosphatidylinositol transfer protein
18739	Phosphatidylinositol transfer protein
18739	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoester
18740	Homeobox domain
18741	Homeobox domain
18742	OAR domain
18742	Homeobox domain
18750	C2 domain
18750	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
18751	C2 domain
18751	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
18752	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
18753	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
18754	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
18755	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
18759	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
18760	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
18761	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
18761	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
18762	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
18763	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
18763	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
18763	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
18763	REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c
18764	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
18766	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
18766	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
18766	REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c
18767	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
18768	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
18768	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
18769	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
18770	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
18775	Somatotropin hormone family
18776	Somatotropin hormone family
18777	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
18778	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with
18780	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with
18782	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with
18787	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
18791	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
18792	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
18795	C2 domain
18795	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
18795	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
18797	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
18797	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
18798	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
18798	C2 domain
18798	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
18798	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
18798	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
18799	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
18802	PH domain. PH stands for pleckstrin homology
18803	C2 domain
18803	SH2 domain
18803	PH domain. PH stands for pleckstrin homology
18803	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
18803	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
18803	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
18803	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be
18805	PH domain. PH stands for pleckstrin homology
18805	PX domain. PX domains bind to phosphoinositides
18805	Phospholipase D. Active site motif. Phosphatidylcholine-hydrolyzing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certa
18806	PX domain. PX domains bind to phosphoinositides
18810	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
18810	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
18811	Somatotropin hormone family
18811	Somatotropin hormone family
18812	Somatotropin hormone family
18813	metallopeptidase family M24
18813	metallopeptidase family M24
18814	Somatotropin hormone family
18815	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
18817	POLO box duplicated region
18821	Phospholamban. The regulation of calcium levels across the membrane of the sarcoplasmic reticulum involves the interplay of many membrane proteins. Phospholamban is a 52 residue integral membrane protein that is involved in reversibly inhibiting the Ca(2
18823	Myelin proteolipid protein (PLP or lipophilin)
18828	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve
18844	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
18844	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
18845	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
18845	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
18845	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans prot
18845	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
18846	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
18846	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
18858	PMP-22/EMP/MP20/Claudin family
18933	Homeobox domain
18935	Homeobox domain
18946	Lipase
18946	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
18947	Lipase
18948	NNMT/PNMT/TEMT family
18950	Phosphorylase family 2
18952	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
18968	DNA polymerase family B. This region of DNA polymerase B appears to consist of more than one structural domain, possibly including elongation, DNA-binding and dNTP binding activities
18969	DNA polymerase alpha subunit B. The B subunit of the DNA polymerase alpha plays an essential role at the initial stage of DNA replication in S. cerevisiae and is phosphorylated in a cell cycle-dependent manner
18974	DNA polymerase epsilon subunit B. DNA polymerase epsilon is essential for cell viability and chromosomal DNA replication in budding yeast. In addition, DNA polymerase epsilon may be involved in DNA repair and cell-cycle checkpoint control. The enzyme con
18974	DNA polymerase alpha subunit B. The B subunit of the DNA polymerase alpha plays an essential role at the initial stage of DNA replication in S. cerevisiae and is phosphorylated in a cell cycle-dependent manner
18974	DNA polymerase epsilon subunit B. DNA polymerase epsilon is essential for cell viability and chromosomal DNA replication in budding yeast. In addition, DNA polymerase epsilon may be involved in DNA repair and cell-cycle checkpoint control. The enzyme cons
18975	DNA polymerase family A
18976	Corticotropin ACTH domain
18979	Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity.
18980	Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity.
18986	Homeobox domain
18987	Homeobox domain
18988	Homeobox domain
18996	Homeobox domain
18996	Pou domain - N-terminal to homeobox domain
18997	Pou domain - N-terminal to homeobox domain
18998	Pou domain - N-terminal to homeobox domain
18999	Pou domain - N-terminal to homeobox domain
19011	Somatomedin B domain
19012	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
19013	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
19013	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
19015	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
19015	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
19016	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
19016	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
19024	pfam02920, integrase_DNA, DNA binding domain of tn916 integrase
19024	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
19025	Serine carboxypeptidase
19027	Synaptophysin / synaptoporin
19035	Cyclophilin type peptidyl-prolyl cis-trans isomerase
19035	Cyclophilin type peptidyl-prolyl cis-trans isomerase
19038	Cyclophilin type peptidyl-prolyl cis-trans isomerase
19043	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
19047	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
19055	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
19056	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
19060	TPR Domain
19060	Uncharacterised protein family (UPF0169). Members of this family are predicted to be lipoproteins. The function of these proteins is unknown
19060	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
19062	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
19063	Palmitoyl protein thioesterase
19064	Pancreatic hormone peptide
19065	7 transmembrane receptor (rhodopsin family)
19070	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru
19075	DNA primase small subunit. DNA primase synthesizes the RNA primers for the Okazaki fragments in lagging strand DNA synthesis. DNA primase is a heterodimer of large and small subunits
19076	Eukaryotic-type DNA primase, large subunit. DNA primase is the polymerase that synthesises small RNA primers for the Okazaki fragments made during discontinuous DNA replication. DNA primase is a heterodimer of two subunits, the small subunit Pri1 (48 kDa
19079	pfam02922, isoamylase_N, Isoamylase N-terminal domain. This domain is found in a range of enzymes that act on branched substrates - isoamylase, pullulanase and branching enzyme. This family also contains the beta subunit of 5' AMP activated kinase
19084	Cyclic nucleotide-binding domain
19084	Cyclic nucleotide-binding domain
19084	Regulatory subunit of type II PKA R-subunit
19085	Cyclic nucleotide-binding domain
19087	Cyclic nucleotide-binding domain
19088	Cyclic nucleotide-binding domain
19088	Regulatory subunit of type II PKA R-subunit
19090	Phosphatidylinositol 3- and 4-kinase
19090	FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal
19109	Somatotropin hormone family
19110	Somatotropin hormone family
19111	Somatotropin hormone family
19112	Somatotropin hormone family
19113	Somatotropin hormone family
19114	Somatotropin hormone family
19123	Trypsin
19123	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
19125	Proline dehydrogenase
19127	Homeobox domain
19128	Laminin G domain
19128	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy
19132	Intermediate filament protein
19133	Tetraspanin family
19141	Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this
19146	CUB domain
19146	MAM domain. An extracellular domain found in many receptors
19146	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
19146	CUB domain
19146	MAM domain. An extracellular domain found in many receptors
19146	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
19155	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
19156	Saposin A-type domain
19156	Saposin A-type domain
19157	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
19158	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
19159	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
19164	Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans
19165	Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans
19170	Proteasome A-type and B-type
19172	Proteasome A-type and B-type
19181	Sigma-54 interaction domain
19181	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
19181	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
19186	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b
19186	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin
19188	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b
19188	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin
19191	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b
19191	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin
19192	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b
19192	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin
19197	Transforming growth factor beta like domain
19200	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
19200	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
19201	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
19202	Homeobox domain
19205	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
19206	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
19207	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
19207	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
19207	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
19210	Phosphatidyl serine synthase. Phosphatidyl serine synthase is also known as serine exchange enzyme. This family represents eukaryotic PSS I and II which are membrane bound proteins which catalyses the replacement of the head group of a phospholipid (phos
19212	Phosphotriesterase family
19213	Helix-loop-helix DNA-binding domain
19214	7 transmembrane receptor (rhodopsin family)
19215	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
19216	7 transmembrane receptor (rhodopsin family)
19217	7 transmembrane receptor (rhodopsin family)
19218	7 transmembrane receptor (rhodopsin family)
19219	7 transmembrane receptor (rhodopsin family)
19220	7 transmembrane receptor (rhodopsin family)
19222	7 transmembrane receptor (rhodopsin family)
19223	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
19224	Animal haem peroxidase
19224	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
19225	Animal haem peroxidase
19225	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
19225	Animal haem peroxidase
19225	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
19225	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
19226	Parathyroid hormone family
19227	Parathyroid hormone family
19228	7 transmembrane receptor (Secretin family)
19229	Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta
19229	Protein kinase domain
19229	Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion tar
19242	PTN/MK heparin-binding protein family
19243	Protein-tyrosine phosphatase
19243	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
19247	SH2 domain
19252	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
19258	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
19258	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
19260	Paramyxovirus P phosphoprotein. This family consists of paramyxovirus P phosphoprotein from sendai virus and human and bovine parainfluenza viruses. The P protein is an essential part of the viral RNA polymerase complex formed form the P and L proteins.
19261	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
19263	Fibronectin type III domain
19266	Fibronectin type III domain
19266	Protein-tyrosine phosphatase
19266	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
19268	Fibronectin type III domain
19270	Eukaryotic-type carbonic anhydrase
19271	Fibronectin type III domain
19272	Fibronectin type III domain
19273	Fibronectin type III domain
19274	Fibronectin type III domain
19280	Fibronectin type III domain
19280	Fibronectin type III domain
19281	Fibronectin type III domain
19283	Fibronectin type III domain
19283	Eukaryotic-type carbonic anhydrase
19283	Fibronectin type III domain
19283	Eukaryotic-type carbonic anhydrase
19283	Protein-tyrosine phosphatase
19283	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
19286	6-pyruvoyl tetrahydropterin synthase. 6-Pyruvoyl tetrahydrobiopterin synthase catalyses the conversion of dihydroneopterin triphosphate to 6-pyruvoyl tetrahydropterin, the second of three enzymatic steps in the synthesis of tetrahydrobiopterin from GTP.
19289	Fibronectin type III domain
19293	EF hand
19301	Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is
19302	Zinc finger, C3HC4 type (RING finger)
19303	Paxillin family
19303	LIM domain. This family represents two copies of the LIM structural domain
19303	Paxillin family
19303	LIM domain. This family represents two copies of the LIM structural domain
19309	Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses
19309	Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses
19324	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
19325	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
19328	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
19329	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
19331	ADP-ribosylation factor family
19331	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
19332	ADP-ribosylation factor family
19332	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
19334	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
19335	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
19336	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
19344	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
19344	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
19345	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
19345	ADP-ribosylation factor family
19345	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
19347	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
19347	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
19348	Kinesin motor domain
19353	ADP-ribosylation factor family
19353	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
19355	Repair protein Rad1/Rec1/Rad17
19356	Rad17 cell cycle checkpoint protein
19358	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
19359	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
19359	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t
19365	Rad52/22 family double-strand break repair protein
19367	Rad9. Rad9 is required for transient cell-cycle arrests and transcriptional induction of DNA repair in response to DNA damage
19368	Class I Histocompatibility antigen, domains alpha 1 and 2
19369	Class I Histocompatibility antigen, domains alpha 1 and 2
19370	Class I Histocompatibility antigen, domains alpha 1 and 2
19374	Recombination activating protein 2. V-D-J recombination is the combinatorial process by which the huge range of immunoglobulin and T cell binding specificity is generated from a limited amount of genetic material. This process is synergistically activate
19378	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
19385	RanBP1 domain
19385	RanBP1 domain
19385	RanBP1 domain
19386	RanBP1 domain
19386	Zn-finger in Ran binding protein and others
19386	Cyclophilin type peptidyl-prolyl cis-trans isomerase
19395	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
19395	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typ
19401	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
19401	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
19411	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
19411	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
19414	C2 domain
19414	PH domain. PH stands for pleckstrin homology
19414	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
19414	BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region
19415	C2 domain
19417	Guanine nucleotide exchange factor for Ras-like GTPases
19417	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
19418	PH domain. PH stands for pleckstrin homology
19418	Guanine nucleotide exchange factor for Ras-like GTPases
19418	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
19419	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
19434	OAR domain
19434	Homeobox domain
19645	Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted
19645	Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserve
19649	Fibronectin type III domain
19650	Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted
19650	Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserved
19651	Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted
19651	Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserve
19653	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members invol
19659	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
19660	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
19661	Interphotoreceptor retinoid-binding protein. Interphotoreceptor retinoid-binding protein (IRBP) mediates retinoid trafficking between the photoreceptors and pigmented epithelium. This Pfam family represents a repeat domain found four times in the mammali
19662	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
19663	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
19664	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
19668	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
19671	CAAX amino terminal protease family. Members of this family are probably proteases. the family contains CAAX prenyl protease. The proteins contain a highly conserved Glu-Glu motif at the amino end of the alignment. The alignment also contains two histidi
19679	Phosphatidylinositol transfer protein
19679	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoester
19682	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
19683	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
19684	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
19684	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
19687	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
19694	Lectin C-type domain. This family includes both long and short form C-type
19696	Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha
19697	Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha
19698	Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain ha
19699	Reeler domain
19701	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
19702	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
19708	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
19712	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
19713	Cadherin domain
19714	DNA polymerase family B, exonuclease domain. This domain has 3' to 5' exonuclease activity and adopts a ribonuclease H type fold
19714	DNA polymerase family B. This region of DNA polymerase B appears to consist of more than one structural domain, possibly including elongation, DNA-binding and dNTP binding activities
19718	Rad17 cell cycle checkpoint protein
19718	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
19719	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
19720	B-box zinc finger
19720	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
19724	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
19725	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
19726	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
19726	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
19729	MtN3/saliva family. This family includes proteins such as drosophila saliva, MtN3 involved in root nodule development and a protein involved in activation and expression of recombination activation genes (RAGs). Although the molecular function of these p
19730	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
19731	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
19732	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
19733	Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis
19734	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
19735	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
19736	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
19743	Ammonium Transporter Family
19746	Ammonium Transporter Family
19763	Zinc finger, C3HC4 type (RING finger)
19763	Zinc finger, C3HC4 type (RING finger)
19765	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
19766	Death domain
19771	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
19771	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
19773	Insulin/IGF/Relaxin family. Superfamily includes insulins
19775	EXS family. We have named this region the EXS family after (ERD1, XPR1, and SYG1). This family includes C-terminus portions from the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisae, and sequences that are thought to be
19775	SPX domain. We have named this region the SPX domain after (SYG1, Pho81 and XPR1). This 180 residue length domain is found at the amino terminus of a variety of proteins. In the yeast protein SYG1, the N-terminus directly binds to the G- protein beta sub
19777	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription
19876	Fibronectin type III domain
19876	Fibronectin type III domain
19876	Alphaherpesvirus glycoprotein E. Glycoprotein E (gE) of Alphaherpesvirus forms a complex with glycoprotein I (gI) (pfam01688), functioning as an immunoglobulin G (IgG) Fc binding protein. gE is involved in virus spread but is not essential for propagation
19877	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
19879	Sugar (and other) transporter
19881	Tetraspanin family
19882	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
19882	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
19882	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
19883	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
19883	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
19885	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
19885	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
19886	Protein kinase domain
19886	Fibronectin type III domain
19888	Doublecortin
19892	Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc
19894	C2 domain
19894	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
19896	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
19899	Eukaryotic ribosomal protein L18
19899	Eukaryotic ribosomal protein L18
19921	Ribosomal protein L19e
19933	Ribosomal protein L21e
19934	Ribosomal L22e protein family
19935	Ribosomal protein L23
19942	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
19942	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
19943	Ribosomal L28e protein family
19944	Ribosomal L29e protein family
19946	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
19951	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
19981	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
19982	Ribosomal protein L44
19983	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
19988	Ribosomal protein L6e
19988	Ribosomal protein L6, N-terminal domain
19989	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
20014	ENV polyprotein (coat polyprotein)
20017	RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase com
20018	RNA polymerases L / 13 to 16 kDa subunit
20019	RNA polymerase A/beta'/A" subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This family inclu
20019	RNA polymerase alpha subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Members of this family
20020	RNA polymerase alpha subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Members of this family
20020	RNA polymerase A/beta'/A" subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This family inclu
20022	RNA polymerases L / 13 to 16 kDa subunit
20024	Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p1
20028	Phosducin
20042	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
20055	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
20055	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
20055	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
20068	Ribosomal S17
20084	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
20085	Ribosomal protein S19e
20090	Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes
20091	Ribosomal S3Ae family
20103	Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes
20104	Ribosomal protein S6e
20115	Ribosomal protein S7e
20116	Ribosomal protein S8e
20128	Zinc finger, C3HC4 type (RING finger)
20129	High molecular weight glutenin subunit. Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, gluteni
20132	7 transmembrane receptor (rhodopsin family)
20133	Ribonucleotide reductase, barrel domain
20135	Ribonucleotide reductase, small chain
20147	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
20167	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
20168	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
20181	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
20181	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
20182	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
20182	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
20182	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
20183	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
20183	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
20185	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
20186	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
20186	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
20187	Protein kinase domain
20187	WIF domain. The WIF domain is found in the RYK tyrosine kinase receptors and WIF the Wnt-inhibitory- factor. The domain is extracellular and and contains two conserved cysteines that may form a disulphide bridge. This domain is Wnt binding in WIF, and it
20191	MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function
20191	RyR domain. This domain is called RyR for Ryanodine receptor. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown
20191	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
20191	RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3
20193	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
20194	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
20195	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
20196	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
20197	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
20198	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
20199	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
20200	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
20200	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
20201	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
20202	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
20203	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
20203	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
20203	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
20204	Homeobox domain
20208	Serum amyloid A protein
20210	Serum amyloid A protein
20211	Serum amyloid A protein
20211	Serum amyloid A protein
20215	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
20215	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
20219	Pentaxin family. Pentaxins are also known as pentraxins
20226	Seryl-tRNA synthetase N-terminal domain. This domain is found associated with the Pfam tRNA synthetase class II domain (pfam00587) and represents the N-terminal domain of seryl-tRNA synthetase
20226	tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt
20227	SART-1 family. This family of proteins appear to contain a leucine zipper and may therefore be a family of transcription factors
20229	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
20230	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
20231	Homeobox domain
20234	Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein
20239	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
20248	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
20254	Granin (chromogranin or secretogranin)
20257	Stathmin family
20262	Stathmin family
20264	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
20264	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
20265	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
20269	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
20271	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
20272	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
20272	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
20273	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
20276	Amiloride-sensitive sodium channel
20277	Amiloride-sensitive sodium channel
20278	Amiloride-sensitive sodium channel
20280	Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
20280	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
20280	Beta-ketoacyl synthase, C-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains
20280	Beta-ketoacyl synthase, N-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains.
20286	TPR Domain
20287	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
20288	Macrophage scavenger receptor
20288	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
20290	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
20296	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
20302	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
20303	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
20304	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
20304	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
20305	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
20306	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
20309	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
20310	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
20311	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
20315	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
20315	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
20321	Reeler domain
20334	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/2
20334	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24
20336	Sec8 exocyst complex component specific domain
20338	Fibronectin type II domain
20339	Lectin C-type domain. This family includes both long and short form C-type
20340	Cysteine rich repeat. This cysteine rich repeat contains four cysteines. It is found in multiple copies in a protein that binds to fibroblast growth factors. The repeat is also found in MG160 and E-selectin ligand (ESL-1)
20343	Lectin C-type domain. This family includes both long and short form C-type
20344	Sushi domain (SCR repeat)
20344	Lectin C-type domain. This family includes both long and short form C-type
20346	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20346	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20347	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20347	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20348	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20349	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20350	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20351	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20352	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20352	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans prot
20352	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
20353	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20353	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
20354	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20355	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20355	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20356	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20356	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
20356	Thrombospondin type 1 domain
20356	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20356	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans prot
20356	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
20357	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
20357	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
20362	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
20362	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
20370	Sushi domain (SCR repeat)
20375	Ets-domain
20378	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
20379	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
20387	Lectin C-type domain. This family includes both long and short form C-type
20388	Surfactant protein B
20388	Saposin A-type domain
20388	Saposin A-type domain
20388	Surfactant protein B
20390	Lectin C-type domain. This family includes both long and short form C-type
20401	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
20411	Sorbin homologous domain
20415	Laminin G domain
20416	SH2 domain
20416	Phosphotyrosine interaction domain (PTB/PID)
20418	Phosphotyrosine interaction domain (PTB/PID)
20423	Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module
20423	Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation
20425	Serine hydroxymethyltransferase
20429	Homeobox domain
20437	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result
20438	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result
20439	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result
20439	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results
20440	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20441	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20442	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20443	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20444	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20444	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20445	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20446	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20447	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20448	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20449	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20450	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20451	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20452	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20454	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
20463	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
20465	Helix-loop-helix DNA-binding domain
20465	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
20465	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
20469	Rap/ran-GAP
20469	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
20471	Homeobox domain
20472	Homeobox domain
20473	Homeobox domain
20474	Homeobox domain
20479	MIT domain
20481	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
20482	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
20482	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
20482	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
20493	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f
20494	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f
20498	K-Cl Co-transporter type 1 (KCC1)
20500	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
20505	Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor
20509	Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog
20510	Sodium:dicarboxylate symporter family
20511	Sodium:dicarboxylate symporter family
20511	Sodium:dicarboxylate symporter family
20512	Sodium:dicarboxylate symporter family
20513	Sodium:dicarboxylate symporter family
20514	Sodium:dicarboxylate symporter family
20515	Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukemia virus receptor
20516	Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukemia virus receptor
20516	Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukemia virus receptor
20517	Sugar (and other) transporter
20518	Sugar (and other) transporter
20519	Sugar (and other) transporter
20519	Sugar (and other) transporter
20520	Sugar (and other) transporter
20521	Sugar (and other) transporter
20522	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic
20523	Mitochondrial carrier protein
20524	Mitochondrial carrier protein
20524	Mitochondrial carrier protein
20525	Sugar (and other) transporter
20526	Sugar (and other) transporter
20527	Sugar (and other) transporter
20528	Sugar (and other) transporter
20529	Ctr copper transporter family. The redox active metal copper is an essential cofactor in critical biological processes such as respiration, iron transport, oxidative stress protection, hormone production, and pigmentation. A widely conserved family of hi
20530	Ctr copper transporter family. The redox active metal copper is an essential cofactor in critical biological processes such as respiration, iron transport, oxidative stress protection, hormone production, and pigmentation. A widely conserved family of hi
20531	Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor
20532	Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str
20533	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
20535	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
20536	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
20537	Sodium:solute symporter family
20538	Sodium:neurotransmitter symporter family
20541	Calx-beta domain
20541	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
20544	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
20563	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
20563	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
20563	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
20563	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
20563	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
20563	Leucine Rich Repeat. CAUTION: This Pfam may not find all Leucine Rich Repeats in a protein. Leucine Rich Repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in
20563	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
20564	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
20564	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
20564	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
20567	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
20567	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
20567	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f
20567	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
20568	WAP-type (Whey Acidic Protein) 'four-disulfide core'
20585	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
20586	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
20586	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
20586	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
20586	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u
20591	jmjC domain
20591	jmjN domain
20591	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
20591	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
20591	pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function
20592	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
20596	Synaptophysin / synaptoporin
20596	7 transmembrane receptor (Secretin family)
20596	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
20596	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are
20597	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte
20598	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN
20602	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
20603	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
20604	Somatostatin/Cortistatin family. Members of this family are hormones. Somatostatin inhibits the release of somatotropin. Cortistatin is a peptide that is related to the Somatostatins that is found to depresses neuronal electrical activity but, unlike som
20605	7 transmembrane receptor (rhodopsin family)
20606	7 transmembrane receptor (rhodopsin family)
20607	7 transmembrane receptor (rhodopsin family)
20608	7 transmembrane receptor (rhodopsin family)
20611	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members
20611	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members
20614	SNAP-25 family. SNAP-25 (synaptosome-associated protein 25 kDa) proteins are components of SNARE complexes. Members of this family contain a cluster of cysteine residues that can be palmitoylated for membrane attachment
20616	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
20617	Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid
20618	Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid
20620	POLO box duplicated region
20623	Protein kinase domain
20638	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
20639	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
20641	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
20643	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
20648	PH domain. PH stands for pleckstrin homology
20649	PH domain. PH stands for pleckstrin homology
20649	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
20650	PH domain. PH stands for pleckstrin homology
20655	Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in
20657	Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in
20660	Fibronectin type III domain
20663	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
20664	HMG (high mobility group) box
20665	HMG (high mobility group) box
20666	HMG (high mobility group) box
20669	HMG (high mobility group) box
20670	HMG (high mobility group) box
20671	HMG (high mobility group) box
20672	HMG (high mobility group) box
20672	HMG (high mobility group) box
20674	HMG (high mobility group) box
20675	HMG (high mobility group) box
20677	HMG (high mobility group) box
20677	HMG (high mobility group) box
20678	HMG (high mobility group) box
20679	HMG (high mobility group) box
20680	HMG (high mobility group) box
20681	HMG (high mobility group) box
20681	HMG (high mobility group) box
20682	HMG (high mobility group) box
20684	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
20684	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
20687	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
20687	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
20688	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
20689	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
20689	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
20690	Hyaluronidase
20692	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand
20700	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
20701	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
20702	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
20703	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
20704	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
20715	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
20716	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
20717	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
20720	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
20724	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
20729	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members
20729	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members
20730	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand
20733	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
20739	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
20739	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
20740	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
20740	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
20740	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
20740	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
20741	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
20742	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
20742	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
20743	PH domain. PH stands for pleckstrin homology
20743	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
20744	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
20745	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
20747	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
20747	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
20750	Osteopontin
20751	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
20766	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
20778	CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion
20779	SH2 domain
20779	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
20787	Helix-loop-helix DNA-binding domain
20788	Helix-loop-helix DNA-binding domain
20807	SRF-type transcription factor (DNA-binding and dimerisation domain)
20810	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
20811	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
20813	Signal recognition particle 14kD protein. The signal recognition particle (SRP) is a multimeric protein involved in targeting secretory proteins to the rough endoplasmic reticulum membrane. SRP14 and SRP9 form a complex essential for SRP RNA binding
20818	ADP-ribosylation factor family
20821	B-box zinc finger
20821	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
20826	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family
20826	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
20833	HMG (high mobility group) box
20833	Structure-specific recognition protein
20840	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
20842	Stromal antigen (SA/STAG) protein
20843	Stromal antigen (SA/STAG) protein
20844	VHS domain. Domain present in VPS-27, Hrs and STAM
20846	SH2 domain
20846	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
20846	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
20846	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
20847	SH2 domain
20847	Minor capsid protein VI. This minor capsid protein may act as a link between the external capsid and the internal DNA-protein core. The C-terminal 11 residues may function as a protease cofactor leading to enzyme activation
20847	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
20847	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
20847	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
20848	SH2 domain
20848	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
20848	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
20848	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
20849	SH2 domain
20849	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
20849	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
20849	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
20849	SH2 domain
20849	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fac
20849	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. Th
20849	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STAT
20850	SH2 domain
20850	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
20850	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
20850	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
20851	SH2 domain
20851	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
20851	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
20852	SH2 domain
20852	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
20852	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
20852	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
20853	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
20855	Stanniocalcin family
20856	Stanniocalcin family
20860	Sulfotransferase protein
20862	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
20862	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family
20863	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
20865	Sulfotransferase protein
20865	Sulfotransferase protein
20867	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
20872	Protein kinase domain
20877	Protein kinase domain
20878	Protein kinase domain
20887	Sulfotransferase protein
20888	Sulfotransferase protein
20893	Helix-loop-helix DNA-binding domain
20905	Sulfatase
20907	Syntaxin
20910	Sec1 family
20911	Sec1 family
20912	Sec1 family
20913	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
20916	ATP-grasp domain. This family does not contain all known ATP-grasp domain members. This family includes a diverse set of enzymes that possess ATP-dependent carboxylate-amine ligase activity
20916	CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP
20918	Translation initiation factor SUI1
20924	Supt5 repeat. This short region of similarity is found in two tandem copies in Supt5 proteins that are involved in chromatin regulation. The function of this region is unknown
20927	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
20927	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
20930	SURF1 family
20931	Sodium transport protein
20932	SURF4 family
20937	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
20947	PH domain. PH stands for pleckstrin homology
20955	Synaptobrevin
20955	Synaptobrevin
20957	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
20957	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
20963	SH2 domain
20964	Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown
20965	Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown
20969	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
20970	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
20972	Synaptogyrin. This family of proteins is distantly related to pfam01284
20973	Synaptogyrin. This family of proteins is distantly related to pfam01284
20974	Synaptogyrin. This family of proteins is distantly related to pfam01284
20975	SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin
20977	Synaptophysin / synaptoporin
20979	C2 domain
20981	C2 domain
20997	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
21334	Neurokinin B
21336	7 transmembrane receptor (rhodopsin family)
21337	7 transmembrane receptor (rhodopsin family)
21338	7 transmembrane receptor (rhodopsin family)
21346	Calponin family repeat
21346	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
21349	Helix-loop-helix DNA-binding domain
21351	Transaldolase
21357	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
21366	Sodium:neurotransmitter symporter family
21367	Fibronectin type III domain
21371	Tubulin binding cofactor A
21372	WD domain, G-beta repeat
21386	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
21388	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
21390	7 transmembrane receptor (rhodopsin family)
21391	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
21401	Transcription factor S-II (TFIIS)
21406	Helix-loop-helix DNA-binding domain
21414	HMG (high mobility group) box
21415	HMG (high mobility group) box
21416	HMG (high mobility group) box
21417	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
21418	Transcription factor AP-2
21419	Transcription factor AP-2
21420	Transcription factor AP-2
21423	Helix-loop-helix DNA-binding domain
21425	Helix-loop-helix DNA-binding domain
21426	Helix-loop-helix DNA-binding domain
21428	Helix-loop-helix DNA-binding domain
21429	HMG (high mobility group) box
21429	HMG (high mobility group) box
21432	TCL1/MTCP1 family. Two related oncogenes, TCL-1 and MTCP-1, are overexpressed in T cell prolymphocytic leukemias as a result of chromosomal rearrangements that involve the translocation of one T cell receptor gene to either chromosome 14q32 or Xq28
21452	Eukaryotic cobalamin-binding protein
21453	Treacher Collins syndrome protein Treacle
21454	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
21461	Myosin tail
21462	Myosin tail
21645	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
21647	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
21648	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
21652	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
21665	Uracil DNA glycosylase superfamily
21665	Uracil DNA glycosylase superfamily
21673	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
21674	HMG (high mobility group) box
21676	TEA/ATTS domain family
21677	TEA/ATTS domain family
21678	TEA/ATTS domain family
21679	TEA/ATTS domain family
21682	BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region
21683	Zona pellucida-like domain
21683	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
21687	Fibronectin type III domain
21689	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
21743	NNMT/PNMT/TEMT family
21744	Adenosine-deaminase (editase) domain
21744	Adenosine-deaminase (editase) domain
21744	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea
21753	LIM domain. This family represents two copies of the LIM structural domain
21766	Protein of unknown function (DUF396). A family of conserved eukaryotic transmembrane proteins
21769	AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets i
21770	Protein phosphatase 2A regulatory B subunit (B56 family)
21780	HMG (high mobility group) box
21781	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
21784	Trefoil (P-type) domain
21785	Trefoil (P-type) domain
21785	Trefoil (P-type) domain
21786	Trefoil (P-type) domain
21788	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
21803	Transforming growth factor beta like domain
21803	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
21804	LIM domain. This family represents two copies of the LIM structural domain
21807	TSC-22/dip/bun family
21808	Transforming growth factor beta like domain
21808	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
21809	Transforming growth factor beta like domain
21809	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
21810	Fasciclin domain. This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria
21812	Protein kinase domain
21812	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
21813	Protein kinase domain
21813	Protein kinase domain
21814	Zona pellucida-like domain
21816	Transglutaminase family
21816	Transglutaminase family, C-terminal ig like domain
21816	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
21817	Transglutaminase family
21817	Transglutaminase family, C-terminal ig like domain
21817	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
21818	Transglutaminase family
21818	Transglutaminase family, C-terminal ig like domain
21818	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
21819	Carboxylesterase
21819	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
21823	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
21825	Thrombospondin N-terminal -like domain
21825	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
21826	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
21828	Thrombospondin N-terminal -like domain
21832	Erythropoietin/thrombopoietin
21833	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
21833	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
21833	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
21833	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
21834	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
21834	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
21844	Raf-like Ras-binding domain
21846	Fibronectin type III domain
21847	Zinc finger, C2H2 type
21848	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
21848	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
21849	B-box zinc finger
21849	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
21854	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim
21854	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1
21855	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim
21855	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1
21856	Mitochondrial import inner membrane, translocase subunit TIM44. Tim44 is an essential component of the machinery that mediates the translocation of nuclear-encoded proteins across the mitochondrial inner membrane. Tim44 is thought to bind phospholipids o
21856	Mitochondrial import inner membrane, translocase subunit TIM44. Tim44 is an essential component of the machinery that mediates the translocation of nuclear-encoded proteins across the mitochondrial inner membrane. Tim44 is thought to bind phospholipids of
21857	Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species
21858	Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species
21869	Homeobox domain
21871	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
21872	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
21873	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
21877	Thymidine kinase
21884	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
21884	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
21885	Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation
21886	Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation
21887	Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation
21892	CUB domain
21892	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
21894	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
21897	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
21898	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
21899	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
21906	Protein of unknown function, DUF270
21906	Protein of unknown function, DUF270
21907	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
21907	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
21908	Homeobox domain
21909	Homeobox domain
21912	Tetraspanin family
21915	Thymidylate kinase
21916	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
21917	LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1
21923	Fibronectin type III domain
21927	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
21929	A20-like zinc finger. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappa B activation
21930	CUB domain
21930	Extracellular link domain
21933	Death domain
21933	TNFR/NGFR cysteine-rich region
21935	Sugar (and other) transporter
21937	TNFR/NGFR cysteine-rich region
21937	TNFR/NGFR cysteine-rich region
21938	TNFR/NGFR cysteine-rich region
21942	TNFR/NGFR cysteine-rich region
21947	TNF(Tumor Necrosis Factor) family
21948	TNF(Tumor Necrosis Factor) family
21949	TNF(Tumor Necrosis Factor) family
21950	TNF(Tumor Necrosis Factor) family
21951	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
21952	Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom
21953	Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom
21954	Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom
21955	Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropomy
21959	TPR Domain
21959	Nuclear transition protein 2
21959	FKBP-type peptidyl-prolyl cis-trans isomerases
21960	Fibronectin type III domain
21960	Fibronectin type III domain
21960	Fibrinogen beta and gamma chains, C-terminal globular domain
21968	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
21975	Topoisomerase DNA binding C4 zinc finger
21981	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
21981	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
21981	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
21981	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
21985	Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction
21987	Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction
21990	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
21991	Triosephosphate isomerase
22003	Tropomyosin
22004	Tropomyosin
22018	Animal haem peroxidase
22018	Sushi domain (SCR repeat)
22019	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
22021	Sulfotransferase protein
22022	Sulfotransferase protein
22025	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
22025	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
22026	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
22029	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
22030	TRAF-type zinc finger
22030	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
22030	TRAF-type zinc finger
22030	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
22031	TRAF-type zinc finger
22031	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
22032	TRAF-type zinc finger
22032	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
22033	TRAF-type zinc finger
22033	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
22034	TRAF-type zinc finger
22034	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
22038	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve
22041	Transferrin
22042	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
22042	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
22045	7 transmembrane receptor (rhodopsin family)
22051	LIM domain. This family represents two copies of the LIM structural domain
22057	BTG1 family. A novel family of anti-proliferative proteins
22059	P53
22061	P53
22062	P53
22062	P53
22063	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
22064	XRCC1 N terminal domain
22064	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
22065	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
22066	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
22067	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
22068	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
22070	Translationally controlled tumor protein
22072	Trypsin
22072	Trypsin
22073	Trypsin
22074	Trypsin
22074	Trypsin
22084	Tuberin
22084	Rap/ran-GAP
22095	7 transmembrane receptor (rhodopsin family)
22099	Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen
22110	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
22114	Protein kinase domain
22114	Protein kinase domain
22121	Ribosomal protein L13
22130	Myb-like DNA-binding domain
22138	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
22139	Transthyretin precursor (formerly prealbumin). Transthyretin is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. Mutations in the human transthyretin are associated with several genetic disorders
22141	Tub family
22151	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
22151	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
22157	Tub family
22157	Tub family
22158	Tub family
22158	Tub family
22160	Helix-loop-helix DNA-binding domain
22163	TNFR/NGFR cysteine-rich region
22164	TNF(Tumor Necrosis Factor) family
22165	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
22166	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
22171	Thymidylate synthase
22173	Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372
22174	Fibronectin type III domain
22178	Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372
22183	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
22184	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
22186	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
22186	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
22186	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
22186	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
22186	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
22186	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
22187	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
22190	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
22192	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
22195	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
22200	Repeat in ubiquitin-activating (UBA) protein
22200	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
22201	Repeat in ubiquitin-activating (UBA) protein
22201	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
22202	Repeat in ubiquitin-activating (UBA) protein
22202	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
22217	Ubiquitin carboxyl-terminal hydrolase family 2
22222	Putative zinc finger in N-recognin
22223	Ubiquitin carboxyl-terminal hydrolase, family 1
22227	Mitochondrial carrier protein
22228	Mitochondrial carrier protein
22229	Mitochondrial carrier protein
22230	Ubiquitin fusion degradation protein UFD1. Post-translational ubiquitin-protein conjugates are recognized for degradation by the ubiquitin fusion degradation (UFD) pathway. Several proteins involved in this pathway have been identified. This family inclu
22232	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
22236	UDP-glucoronosyl and UDP-glucosyl transferase
22238	UDP-glucoronosyl and UDP-glucosyl transferase
22239	UDP-glucoronosyl and UDP-glucosyl transferase
22240	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
22242	Zona pellucida-like domain
22245	Phosphoribulokinase / Uridine kinase family
22247	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
22247	Orotidine 5'-phosphate decarboxylase / HUMPS family. This family includes Orotidine 5'-phosphate decarboxylase enzymes EC:4.1.1.23 that are involved in the final step of pyrimidine biosynthesis. The family also includes enzymes such as hexulose-6-phosphat
22249	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
22255	Homeobox domain
22259	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
22259	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
22260	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
22260	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
22268	Tetraspanin family
22268	Tetraspanin family
22272	pfam02939, UcrQ, UcrQ family. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This family represents the 9.5 kDa subunit of the complex
22275	Uroporphyrinogen decarboxylase (URO-D)
22278	Helix-loop-helix DNA-binding domain
22282	Helix-loop-helix DNA-binding domain
22283	Fibronectin type III domain
22283	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
22287	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
22288	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
22288	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
22288	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
22288	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
22289	TPR Domain
22289	jmjC domain
22290	jmjC domain
22294	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription
22295	Cadherin domain
22296	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
22296	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
22296	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
22297	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
22297	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
22301	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
22302	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
22303	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
22304	7 transmembrane receptor (metabotropic glutamate family)
22304	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
22305	7 transmembrane receptor (metabotropic glutamate family)
22305	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
22308	7 transmembrane receptor (metabotropic glutamate family)
22308	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
22309	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
22310	7 transmembrane receptor (metabotropic glutamate family)
22310	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
22311	7 transmembrane receptor (metabotropic glutamate family)
22312	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
22315	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
22317	Synaptobrevin
22318	Synaptobrevin
22319	Synaptobrevin
22320	Synaptobrevin
22321	tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included
22321	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
22321	tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included
22321	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
22323	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
22324	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
22324	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
22325	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
22325	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
22326	Homeobox domain
22327	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription
22327	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids, some interacts with c-Myc and repress its transcriptional activity. Suggesting that other members of this family may also regulate transcription by intera
22327	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription b
22330	Vinculin family
22333	Eukaryotic porin
22334	Eukaryotic porin
22335	Eukaryotic porin
22337	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
22337	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
22339	Platelet-derived growth factor (PDGF)
22341	Platelet-derived growth factor (PDGF)
22342	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
22343	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
22346	von Hippel-Lindau disease tumor suppressor protein. VHL forms a ternary complex with the elonginB and elonginC proteins. This complex binds Cul2, which then is involved in regulation of vascular endothelial growth factor mRNA
22348	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
22350	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
22350	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
22351	Villin headpiece domain
22352	Intermediate filament protein
22353	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
22353	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
22354	7 transmembrane receptor (Secretin family)
22355	7 transmembrane receptor (Secretin family)
22359	Low-density lipoprotein receptor domain class A
22359	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
22361	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
22364	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
22364	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
22365	Sec1 family
22367	Protein kinase domain
22370	Somatomedin B domain
22370	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
22371	von Willebrand factor type A domain
22371	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
22371	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
22371	von Willebrand factor type D domain
22371	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
22371	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
22371	von Willebrand factor type D domain
22371	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
22375	WHEP-TRS domain
22376	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
22376	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
22378	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
22385	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
22390	Protein kinase domain
22402	Thrombospondin type 1 domain
22408	wnt family
22408	wnt family
22409	wnt family
22410	wnt family
22412	wnt family
22413	wnt family
22414	wnt family
22415	wnt family
22416	wnt family
22417	wnt family
22418	wnt family
22419	wnt family
22420	wnt family
22422	wnt family
22427	HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease
22427	3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termi
22427	HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease
22427	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
22427	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
22427	3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termin
22431	Wilm's tumour protein
22431	Wilm's tumour protein
22436	[2Fe-2S] binding domain
22436	CO dehydrogenase flavoprotein C-terminal domain
22436	FAD binding domain in molybdopterin dehydrogenase
22436	Aldehyde oxidase and xanthine dehydrogenase, a/b hammerhead domain
22436	Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain
22437	7 transmembrane receptor (rhodopsin family)
22441	Occludin/ELL family
22589	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
22589	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
22590	XPA protein
22591	DNA repair protein Rad4
22592	XPG I-region
22594	XRCC1 N terminal domain
22594	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
22596	Ku70/Ku80 C-terminal arm. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the C terminal arm. Thi
22596	Ku70/Ku80 N-terminal alpha/beta domain. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the amino
22596	Ku70/Ku80 beta-barrel domain. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the central DNA-bin
22598	Sodium:neurotransmitter symporter family
22599	Sodium:neurotransmitter symporter family
22612	SH2 domain
22612	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
22612	SH2 domain
22612	Protein kinase domain
22612	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
22626	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predict
22627	14-3-3 protein
22628	14-3-3 protein
22629	14-3-3 protein
22630	14-3-3 protein
22631	14-3-3 protein
22634	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
22635	von Willebrand factor type D domain
22635	MAM domain. An extracellular domain found in many receptors
22635	TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr
22635	TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pr
22635	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
22637	SH2 domain
22640	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
22640	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
22642	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
22643	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
22654	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
22658	Zinc finger, C3HC4 type (RING finger)
22661	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
22666	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
22670	B-box zinc finger
22673	LIM domain. This family represents two copies of the LIM structural domain
22682	AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets i
22687	ZPR1 zinc-finger domain. The zinc-finger protein ZPR1 is ubiquitous among eukaryotes. It is indeed known to be an essential protein in yeast. In quiescent cells, ZPR1 is localized to the cytoplasm. But in proliferating cells treated with EGF or with othe
22688	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
22689	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
22691	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
22695	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
22698	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
22701	Transposase. Transposase proteins are necessary for efficient DNA transposition. Contains transposases for IS204, IS1001, IS1096 and IS1165
22709	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
22712	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
22715	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
22715	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
22755	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
22759	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
22775	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
22784	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
22786	Zona pellucida-like domain
22787	Zona pellucida-like domain
22788	Zona pellucida-like domain
22789	Sushi domain (SCR repeat)
22793	LIM domain. This family represents two copies of the LIM structural domain
22795	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
22797	Helix-loop-helix DNA-binding domain
22798	Uncharacterized ACR, COG1579
22798	Intermediate filament protein
22798	Laminin N-terminal (Domain VI)
22798	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
22798	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
22798	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
22798	PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins
22801	von Willebrand factor type A domain
22818	Clathrin adaptor complex small chain
22821	BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region
22823	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
22824	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
22829	Carboxylesterase
22836	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
22836	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
22839	Guanylate-kinase-associated protein (GKAP) protein
22841	C2 domain
22846	5'-nucleotidase
22856	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
22856	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
22859	Galactose binding lectin domain
22859	7 transmembrane receptor (Secretin family)
22859	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
22859	Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein cou
22861	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
22861	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
22861	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
22861	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
22861	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
22862	Fibronectin type III domain
22864	R3H domain
22864	R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA
22865	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
22870	FHIPEP family
22871	Carboxylesterase
22875	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle
22879	SAND family protein. Members of this family are uncharacterised proteins that have been called SAND proteins. These protein do not contain a SAND domain. The members of this family are 500-600 amino acids long and contain several conserved motifs
22885	Villin headpiece domain
22885	LIM domain. This family represents two copies of the LIM structural domain
22890	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
22893	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
22895	C2 domain
22895	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
22898	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
22898	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
22900	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
22901	Sulfatase
22902	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
22903	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
22903	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
22904	Helicase conserved C-terminal domain
22908	SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin
22914	Lectin C-type domain. This family includes both long and short form C-type
22915	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
22925	Lectin C-type domain. This family includes both long and short form C-type
22932	Zona pellucida-like domain
22932	Zona pellucida-like domain
22933	Sir2 family
22933	Sir2 family
22936	Occludin/ELL family
22937	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
22947	Homeobox domain
22948	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
22950	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
22953	ATP P2X receptor
22953	ATP P2X receptor
22953	ATP P2X receptor
22953	ATP P2X receptor
22953	ATP P2X receptor
22953	ATP P2X receptor
22954	NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in some members, proteolysis has shown that the Peptidyl-alp
22955	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
22976	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
22976	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
22982	AMP-binding enzyme
22983	Protein kinase domain
22983	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
22985	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
22986	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
22987	Sugar (and other) transporter
22989	Myosin head (motor domain)
22989	Intermediate filament protein
22989	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
22989	Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown
22989	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
22989	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
22992	jmjC domain
22992	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
22993	HMG (high mobility group) box
22994	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
22994	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
22997	Fibronectin type III domain
22997	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
22998	LIM domain. This family represents two copies of the LIM structural domain
22999	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
23001	Beige/BEACH domain
23005	WD domain, G-beta repeat
23007	C2 domain
23007	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
23007	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
23008	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
23011	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
23017	Uncharacterized protein family UPF0005
23019	CCR4-Not complex component, Not1. The Ccr4-Not complex is a global regulator of transcription that affects genes positively and negatively and is thought to regulate transcription factor TFIID
23024	TRAF-type zinc finger
23024	Zinc finger, C3HC4 type (RING finger)
23024	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
23025	C2 domain
23028	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
23028	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
23029	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
23030	jmjN domain
23030	jmjC domain
23031	Protein kinase domain
23031	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
23035	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
23040	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
23043	Protein kinase domain
23043	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
23048	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
23049	Phosphatidylinositol 3- and 4-kinase
23049	FAT domain. The FAT domain is named after FRAP, ATM and TRRAP
23049	FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal
23049	Phosphatidylinositol 3- and 4-kinase
23049	FAT domain. The FAT domain is named after FRAP, ATM and TRRAP
23049	FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal
23058	Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal
23062	VHS domain. Domain present in VPS-27, Hrs and STAM
23062	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
23062	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
23062	VHS domain. Domain present in VPS-27, Hrs and STAM
23063	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p
23067	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
23070	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
23070	FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl
23071	Calsequestrin
23071	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
23075	PH domain. PH stands for pleckstrin homology
23081	jmjC domain
23081	jmjN domain
23087	B-box zinc finger
23087	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
23087	B-box zinc finger
23089	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
23091	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
23094	Rap/ran-GAP
23094	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
23095	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
23096	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
23097	Protein kinase domain
23099	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
23105	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
23105	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
23108	Rap/ran-GAP
23111	MIT domain
23113	Cullin family
23113	Anaphase-promoting complex, subunit 10 (APC10)
23113	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
23114	Fibronectin type III domain
23114	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
23118	CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2
23118	CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2
23119	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
23119	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
23119	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
23119	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
23120	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
23125	CG-1 domain. CG-1 domains are highly conserved domains of about 130 amino-acid residues containing a predicted bipartite NLS and named after a partial cDNA clone isolated from parsley encoding a sequence-specific DNA-binding protein. CG-1 domains are ass
23125	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
23126	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
23127	Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of va
23129	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
23133	jmjC domain
23133	jmjC domain
23133	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
23135	jmjC domain
23136	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
23142	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
23142	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
23143	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
23145	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
23148	NAC domain
23150	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
23150	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
23151	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
23157	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
23157	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
23157	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
23157	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
23158	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
23158	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
23160	WD domain, G-beta repeat
23162	Myosin tail
23162	Uncharacterized ACR
23162	Ezrin/radixin/moesin family
23162	Intermediate filament protein
23163	VHS domain. Domain present in VPS-27, Hrs and STAM
23163	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
23163	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
23163	VHS domain. Domain present in VPS-27, Hrs and STAM
23163	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
23163	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
23168	Plus-3 domain. This domain is about 90 residues in length and is often found associated with the pfam02213 domain. The function of this domain is uncertain. It is possible that this domain is involved in DNA binding as it has three conserved positively c
23170	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
23171	NAD-dependent glycerol-3-phosphate dehydrogenase
23176	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
23179	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
23181	AMP-binding enzyme
23186	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
23186	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
23187	Uncharacterized ACR, COG1579
23187	PH domain. PH stands for pleckstrin homology
23187	PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins
23192	Peptidase family C54
23192	Peptidase family C54
23193	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
23194	F-box domain
23197	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
23200	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
23200	E1-E2 ATPase
23207	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
23210	jmjC domain
23213	Sulfatase
23218	Beige/BEACH domain
23218	WD domain, G-beta repeat
23219	F-box domain
23220	Zinc finger, C3HC4 type (RING finger)
23220	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
23221	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
23224	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
23224	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
23225	Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins
23228	PH domain. PH stands for pleckstrin homology
23232	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
23233	Exocyst complex subunit Sec15-like
23234	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con
23235	Protein kinase domain
23236	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
23236	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
23236	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
23236	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
23237	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
23239	PH domain. PH stands for pleckstrin homology
23239	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
23243	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
23249	Thrombospondin type 1 domain
23250	E1-E2 ATPase
23254	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
23256	Sec1 family
23256	Sec1 family
23258	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
23258	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
23258	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
23258	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
23258	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
23259	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
23259	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera
23261	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
23261	CG-1 domain. CG-1 domains are highly conserved domains of about 130 amino-acid residues containing a predicted bipartite NLS and named after a partial cDNA clone isolated from parsley encoding a sequence-specific DNA-binding protein. CG-1 domains are asso
23262	Histidine acid phosphatase
23265	Exo70 exocyst complex subunit. The Exo70 protein forms one subunit of the exocyst complex. First discovered in S. cerevisiae , Exo70 and other exocyst proteins have been observed in several other eukaryotes, including humans. In S. cerevisiae, the exocys
23266	Galactose binding lectin domain
23266	7 transmembrane receptor (Secretin family)
23266	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
23266	Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein cou
23268	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
23268	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
23268	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
23269	Herpesvirus UL6 like. This family consists of various proteins from the herpesviridae that are similar to herpes simplex virus type I UL6 virion protein. UL6 is essential for cleavage and packaging of the viral genome
23270	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
23271	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
23273	CDC45-like protein. CDC45 is an essential gene required for initiation of DNA replication in S. cerevisiae, forming a complex with MCM5/CDC46. Homologs of CDC45 have been identified in human, mouse and smut fungus among others
23276	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
23283	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
23284	Galactose binding lectin domain
23284	7 transmembrane receptor (Secretin family)
23284	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
23284	Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein cou
23294	Phosphotyrosine interaction domain (PTB/PID)
23294	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
23294	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
23302	Sulfotransferase protein
23302	WSC domain. This domain may be involved in carbohydrate binding
23303	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
23307	Uncharacterized ACR, COG1579
23307	Intermediate filament protein
23307	Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown
23307	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
23307	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
23307	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
23307	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
23309	Paired amphipathic helix repeat. This family contains the paired amphipathic helix repeat. The family contains the yeast SIN3 gene (also known as SDI1) that is a negative regulator of the yeast HO gene. This repeat may be distantly related to the helix-lo
23314	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
23315	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
23316	Homeobox domain
23316	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
23316	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
23317	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
23318	PAP/25A associated domain
23318	Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to
23322	C2 domain
23322	Microtubule associated protein (MAP65/ASE1 family)
23322	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
23324	Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism
23326	Ubiquitin carboxyl-terminal hydrolase family 2
23326	Ubiquitin carboxyl-terminal hydrolases family 2
23326	Zn-finger in ubiquitin-hydrolases and other protein
23327	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
23329	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
23336	Intermediate filament protein
23336	Intermediate filament protein
23339	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
23339	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
23341	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
23341	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
23345	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
23347	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
23350	Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding
23350	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
23353	Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length
23358	Ubiquitin carboxyl-terminal hydrolase family 2
23358	Ubiquitin carboxyl-terminal hydrolases family 2
23361	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
23362	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
23363	Fibronectin type III domain
23363	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
23368	PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is abou
23380	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
23380	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
23382	S-adenosyl-L-homocysteine hydrolase
23384	Uncharacterized ACR, COG1579
23384	Intermediate filament protein
23384	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
23384	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
23386	Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration
23396	Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal
23399	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
23405	Domain of unknown function. This putative domain is found in members of the Dicer protein family. This protein is a dsRNA nuclease that is involved in RNAi and related processes. This domain of about 100 amino acids has no known function, but does contai
23405	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
23405	Domain of unknown function. This putative domain is found in members of the Dicer protein family. This protein is a dsRNA nuclease that is involved in RNAi and related processes. This domain of about 100 amino acids has no known function, but does contai
23405	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
23408	Sir2 family
23409	Sir2 family
23410	Sir2 family
23411	Sir2 family
23415	Cyclic nucleotide-binding domain
23415	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
23415	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
23416	Cyclic nucleotide-binding domain
23416	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
23416	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
23423	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
23424	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
23426	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
23431	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
23432	7 transmembrane receptor (rhodopsin family)
23432	7 transmembrane receptor (rhodopsin family)
23439	Sodium / potassium ATPase beta chain
23440	Homeobox domain
23443	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
23450	CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in p
23458	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
23462	Helix-loop-helix DNA-binding domain
23463	Isoprenylcysteine carboxyl methyltransferase (ICMT) family. The isoprenylcysteine o-methyltransferase (EC:2.1.1.100) family carry out carboyxl methylation of cleaved eukaryotic proteins that terminate in a CaaX motif. In Saccharomyces cerevisiae this met
23463	Isoprenylcysteine carboxyl methyltransferase (ICMT) family. The isoprenylcysteine o-methyltransferase (EC:2.1.1.100) family carry out carboyxl methylation of cleaved eukaryotic proteins that terminate in a CaaX motif. In Saccharomyces cerevisiae this met
23468	'chromo' (CHRromatin Organization MOdifier) domain
23468	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
23471	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
23473	MIT domain
23478	Signal peptidase I
23479	NifU-like N terminal domain. This domain is found in NifU in combination with pfam01106. This domain is found on isolated in several bacterial species. The nif genes are responsible for nitrogen fixation. However this domain is found in bacteria that do
23484	Vacuolar protein sorting 55. Vps55 is involved in the secretion of the Golgi form of the soluble vacuolar carboxypeptidase Y, but not the trafficking of the membrane-bound vacuolar alkaline phosphatase. Both Vps55 and obesity receptor gene-related protei
23492	'chromo' (CHRromatin Organization MOdifier) domain
23493	Helix-loop-helix DNA-binding domain
23499	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
23499	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
23499	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
23503	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
23504	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
23510	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
23515	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
23516	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
23521	Ribosomal protein L13
23522	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
23526	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
23526	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
23527	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
23529	Ciliary neurotrophic factor
23530	NAD(P) transhydrogenase beta subunit. This family corresponds to the beta subunit of NADP transhydrogenase in prokaryotes, and either the protein N- or C terminal in eukaryotes. The domain is often found in conjunction with pfam01262. Pyridine nucleotide
23538	7 transmembrane receptor (rhodopsin family)
23539	Monocarboxylate transporter
23541	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
23541	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
23542	Phosphotyrosine interaction domain (PTB/PID)
23542	Phosphotyrosine interaction domain (PTB/PID)
23542	Phosphotyrosine interaction domain (PTB/PID)
23544	Sushi domain (SCR repeat)
23545	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
23546	Synaptogyrin. This family of proteins is distantly related to pfam01284
23549	Aminopeptidase I zinc metalloprotease (M18)
23550	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
23553	Hyaluronidase
23554	Tetraspanin family
23555	Tetraspanin family
23558	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
23562	PMP-22/EMP/MP20/Claudin family
23562	PMP-22/EMP/MP20/Claudin family
23564	Amidinotransferase. This family contains glycine (EC:2.1.4.1) and inosamine (EC:2.1.4.2) amidinotransferases, enzymes involved in creatine and streptomycin biosynthesis respectively. This family also includes arginine deiminases, EC:3.5.3.6. These enzyme
23566	7 transmembrane receptor (rhodopsin family)
23569	Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-
23576	Amidinotransferase. This family contains glycine (EC:2.1.4.1) and inosamine (EC:2.1.4.2) amidinotransferases, enzymes involved in creatine and streptomycin biosynthesis respectively. This family also includes arginine deiminases, EC:3.5.3.6. These enzyme
23581	ICE-like protease (caspase) p10 domain
23583	Uracil DNA glycosylase superfamily
23592	LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1
23596	7 transmembrane receptor (rhodopsin family)
23597	Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C
23598	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
23598	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
23598	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
23598	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
23607	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
23613	MYND finger
23613	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
23616	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
23620	7 transmembrane receptor (rhodopsin family)
23621	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
23621	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
23621	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
23621	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
23623	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
23624	CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and
23624	CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily co
23624	CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conser
23627	Prion/Doppel alpha-helical domain. The prion protein is thought to be the infectious agent that causes transmissible spongiform encephalopathies, such as scrapie and BSE. It is thought that the prion protein can exist in two different forms: one is the n
23632	Eukaryotic-type carbonic anhydrase
23643	ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen r
23649	DNA polymerase alpha subunit B. The B subunit of the DNA polymerase alpha plays an essential role at the initial stage of DNA replication in S. cerevisiae and is phosphorylated in a cell cycle-dependent manner
23658	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
23659	Lecithin:cholesterol acyltransferase. Lecithin:cholesterol acyltransferase (LACT) is involved in extracellular metabolism of plasma lipoproteins, including cholesterol
23660	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
23660	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
23677	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
23678	PX domain. PX domains bind to phosphoinositides
23678	PX domain. PX domains bind to phosphoinositides
23683	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
23705	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv
23710	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
23732	DOMON domain. The DOMON (named after dopamine beta-monooxygenase N-terminal) domain is 110-125 residues long. It is predicted to form an all beta fold with 7-8 strands. The domain may mediate extracellular adhesive interactions
23743	Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13,
23760	Phosphatidylinositol transfer protein
23762	Oxysterol-binding protein
23764	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
23764	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
23768	Fibronectin type III domain
23768	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
23768	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
23770	FKBP-type peptidyl-prolyl cis-trans isomerase
23775	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
23775	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
23779	RhoGAP domain
23786	Apoptosis regulator proteins
23787	Mitochondrial carrier protein
23788	Mitochondrial carrier protein
23792	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
23792	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
23792	Disintegrin
23792	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
23792	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
23793	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
23793	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
23794	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
23794	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
23796	7 transmembrane receptor (rhodopsin family)
23801	Lipoxygenase
23801	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
23802	CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2
23805	AP endonuclease family 1
23806	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
23807	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
23821	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
23825	Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration
23827	Inositol monophosphatase family
23831	Eukaryotic-type carbonic anhydrase
23832	7 transmembrane receptor (rhodopsin family)
23835	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
23836	Cadherin domain
23836	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
23856	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
23856	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
23857	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
23871	Ets-domain
23871	Sterile alpha motif (SAM)/Pointed domain
23871	Ets-domain
23871	Sterile alpha motif (SAM)/Pointed domain
23872	Ets-domain
23882	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
23885	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
23887	Gamma-glutamyltranspeptidase
23888	Glypican
23890	7 transmembrane receptor (rhodopsin family)
23892	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb
23893	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb
23894	Ssl1-like. Ssl1-like proteins are 40kDa subunits of the Transcription factor II H complex
23917	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
23918	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
23920	Fibronectin type III domain
23920	Furin-like cysteine rich region
23920	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
23925	Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be
23925	Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be
23934	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
23936	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
23937	Mab-21 protein
23942	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
23943	C2 domain
23947	B-box zinc finger
23948	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
23948	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
23948	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
23957	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
23958	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
23958	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
23962	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
23964	Plant PEC family metallothionein
23970	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
23971	Adenylylsulfate kinase. Enzyme that catalyses the phosphorylation of adenylylsulfate to 3'-phosphoadenylylsulfate. This domain contains an ATP binding P-loop motif
23971	ATP-sulfurylase. This family consists of ATP-sulfurylase or sulfate adenylyltransferase EC:2.7.7.4 some of which are part of a bifunctional polypeptide chain associated with adenosyl phosphosulphate (APS) kinase pfam01583. Both enzymes are required for P
23972	Adenylylsulfate kinase. Enzyme that catalyses the phosphorylation of adenylylsulfate to 3'-phosphoadenylylsulfate. This domain contains an ATP binding P-loop motif
23972	ATP-sulfurylase. This family consists of ATP-sulfurylase or sulfate adenylyltransferase EC:2.7.7.4 some of which are part of a bifunctional polypeptide chain associated with adenosyl phosphosulphate (APS) kinase pfam01583. Both enzymes are required for P
23980	Phosphatidylethanolamine-binding protein
23984	3'5'-cyclic nucleotide phosphodiesterase
23985	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
23986	Acyl CoA binding protein
23988	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
23988	PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline
23992	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
24012	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
24012	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
24013	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
24014	Protein kinase domain
24017	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
24018	mRNA capping enzyme, catalytic domain. This family represents the ATP binding catalytic domain of the mRNA capping enzyme
24018	mRNA capping enzyme, C-terminal domain
24018	mRNA capping enzyme, catalytic domain. This family represents the ATP binding catalytic domain of the mRNA capping enzyme
24030	Ribosomal protein S12
24044	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c
24045	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c
24046	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
24050	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
24058	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
24059	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
24059	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
24060	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
24067	SRP54-type protein, helical bundle domain
24067	pfam02881, SRP54_N, SRP54-type protein, helical bundle domain
24067	SRP54-type protein, GTPase domain. This family includes relatives of the G-domain of the SRP54 family of proteins
24070	PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How
24084	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
24086	Protein kinase domain
24086	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
24087	CUB domain
24087	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
24088	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
24105	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
24108	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
24109	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
24112	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
24113	Homeobox domain
24115	Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown
24132	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
24136	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
24139	HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes
24140	FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl
24140	FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl
24140	FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl
24141	Lysosome-associated membrane glycoprotein (Lamp)
24146	PMP-22/EMP/MP20/Claudin family
24153	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
24153	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
24162	Histidine acid phosphatase
24165	Adenosine/AMP deaminase
24166	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
24167	7 transmembrane receptor (Secretin family)
24170	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
24171	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
24173	7 transmembrane receptor (rhodopsin family)
24174	7 transmembrane receptor (rhodopsin family)
24175	7 transmembrane receptor (rhodopsin family)
24176	7 transmembrane receptor (rhodopsin family)
24177	Serum albumin family
24179	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
24180	7 transmembrane receptor (rhodopsin family)
24182	7 transmembrane receptor (rhodopsin family)
24183	Adenylate kinase
24184	Adenylate kinase
24185	PH domain. PH stands for pleckstrin homology
24186	Serum albumin family
24188	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
24189	Fructose-bisphosphate aldolase class-I
24190	Fructose-bisphosphate aldolase class-I
24191	Fructose-bisphosphate aldolase class-I
24192	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
24197	Alkaline phosphatase
24203	Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwe
24203	Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta st
24208	Androgen receptor
24208	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
24208	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
24210	Hepatic lectin, N-terminal domain
24210	Lectin C-type domain. This family includes both long and short form C-type
24214	Sodium / potassium ATPase beta chain
24216	E1-E2 ATPase
24216	Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
24216	Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
24216	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
24217	Sodium / potassium ATPase beta chain
24218	Heavy-metal-associated domain
24221	Neurohypophysial hormones, C-terminal Domain. N-terminal Domain is in hormone5
24224	Apoptosis regulator proteins, Bcl-2 family
24225	Nerve growth factor family
24227	Zinc finger, C3HC4 type (RING finger)
24227	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
24230	TspO/MBR family. Tryptophan-rich sensory protein (TspO) is an integral membrane protein that acts as a negative regulator of the expression of specific photosynthesis genes in response to oxygen/light. It is involved in the efflux of porphyrin intermedia
24231	Sushi domain (SCR repeat)
24231	von Willebrand factor type A domain
24231	Trypsin
24232	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
24232	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
24232	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f
24232	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
24233	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
24233	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
24233	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f
24233	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
24233	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
24233	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
24233	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fo
24233	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
24235	Sushi domain (SCR repeat)
24236	Sushi domain (SCR repeat)
24237	Sushi domain (SCR repeat)
24237	Thrombospondin type 1 domain
24237	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb
24239	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
24247	7 transmembrane receptor (metabotropic glutamate family)
24247	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24248	Catalase
24249	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
24249	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typ
24251	Tetraspanin family
24254	Carboxylesterase
24255	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
24255	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
24258	Granin (chromogranin or secretogranin)
24259	Granin (chromogranin or secretogranin)
24260	7 transmembrane receptor (rhodopsin family)
24261	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
24261	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
24264	ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold
24264	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
24265	ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold
24265	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
24268	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
24269	Zinc carboxypeptidase
24269	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
24271	Zinc carboxypeptidase
24271	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
24273	Hsp20/alpha crystallin family
24273	Alpha crystallin A chain, N terminal
24277	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
24278	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
24279	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
24279	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
24282	Somatotropin hormone family
24284	Casein
24293	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
24294	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
24296	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
24297	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
24298	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
24299	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
24303	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
24303	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
24307	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
24310	Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid
24311	Pyridoxal-dependent decarboxylase conserved domain
24312	Dihydrofolate reductase
24314	Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones
24315	Somatotropin hormone family
24316	7 transmembrane receptor (rhodopsin family)
24318	7 transmembrane receptor (rhodopsin family)
24323	Endothelin family
24324	Endothelin family
24326	7 transmembrane receptor (rhodopsin family)
24329	Furin-like cysteine rich region
24329	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
24331	Trypsin
24334	Enolase, N-terminal domain
24334	Enolase, C-terminal TIM barrel domain
24335	Erythropoietin/thrombopoietin
24337	Furin-like cysteine rich region
24337	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
24346	Carboxylesterase
24356	Ets-domain
24360	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
24361	Fanconi anaemia group C protein
24362	Fructose-1-6-bisphosphatase
24366	Fibrinogen beta and gamma chains, C-terminal globular domain
24366	Fibrinogen beta and gamma chains, C-terminal globular domain
24367	Fibrinogen beta and gamma chains, C-terminal globular domain
24367	ATP synthase B/B' CF(0). Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits
24370	Fork head domain
24375	Alpha-L-fucosidase
24379	Pyridoxal-dependent decarboxylase conserved domain
24380	Pyridoxal-dependent decarboxylase conserved domain
24385	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
24387	Intermediate filament protein
24392	Connexin
24392	Gap junction alpha-1 protein (Cx43)
24397	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
24397	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
24404	Glutathione peroxidase
24406	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24407	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24407	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24408	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24409	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24410	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24411	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24412	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24413	Glucocorticoid receptor
24413	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
24413	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
24414	7 transmembrane receptor (metabotropic glutamate family)
24414	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24415	7 transmembrane receptor (metabotropic glutamate family)
24415	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24416	7 transmembrane receptor (metabotropic glutamate family)
24416	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24417	7 transmembrane receptor (metabotropic glutamate family)
24417	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24418	7 transmembrane receptor (metabotropic glutamate family)
24418	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24419	7 transmembrane receptor (metabotropic glutamate family)
24419	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24419	7 transmembrane receptor (metabotropic glutamate family)
24419	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24420	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
24420	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
24420	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
24421	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
24422	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
24423	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
24424	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
24426	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
24434	Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities
24434	Glycosyl hydrolases family 2, sugar binding domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities and has a jelly-roll fold
24437	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
24438	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
24439	Metallo-beta-lactamase superfamily
24440	Globin
24443	Pyridoxal-dependent decarboxylase conserved domain
24443	Pyridoxal-dependent decarboxylase conserved domain
24446	Trypsin
24446	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
24448	7 transmembrane receptor (rhodopsin family)
24450	Hydroxymethylglutaryl-coenzyme A synthase
24450	Hydroxymethylglutaryl-coenzyme A synthase
24451	Heme oxygenase
24452	Homeobox domain
24456	Homeobox domain
24457	Homeobox domain
24459	Homeobox domain
24464	Sushi domain (SCR repeat)
24465	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
24468	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
24470	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
24471	Hsp20/alpha crystallin family
24472	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
24473	7 transmembrane receptor (rhodopsin family)
24481	Interferon alpha/beta domain
24482	Insulin/IGF/Relaxin family. Superfamily includes insulins
24484	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
24494	Interleukin-1 propeptide. The Interleukin-1 cytokines are translated as precursor proteins. The N terminal approx. 115 amino acids form a propeptide that is cleaved off to release the active interleukin-1
24495	Interleukin-3
24497	Interleukin 5
24503	Intermediate filament protein
24504	Transforming growth factor beta like domain
24505	Insulin/IGF/Relaxin family. Superfamily includes insulins
24506	Insulin/IGF/Relaxin family. Superfamily includes insulins
24508	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved tryp
24511	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
24516	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
24516	Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases
24517	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
24517	Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases
24518	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
24518	Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases
24520	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
24520	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
24520	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
24520	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
24521	Inward rectifier potassium channel
24523	Trypsin
24528	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzyme
24530	Lecithin:cholesterol acyltransferase. Lecithin:cholesterol acyltransferase (LACT) is involved in extracellular metabolism of plasma lipoproteins, including cholesterol
24538	Lipase
24538	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
24539	Lipase
24539	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
24546	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
24546	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
24547	Myelin basic protein
24548	Lectin C-type domain. This family includes both long and short form C-type
24553	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
24553	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
24553	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
24559	Protein kinase domain
24561	Methylpurine-DNA glycosylase (MPG). Methylpurine-DNA glycosylase is a base excision-repair protein. It is responsible for the hydrolysis of the deoxyribose N-glycosidic bond, excising 3-methyladenine and 3-methylguanine from damaged DNA
24566	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
24566	PAN domain. The PAN domain contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge the links the N and C termini of the domain. The domain is found in diverse proteins, in some
24575	Dynamin GTPase effector domain
24575	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
24575	Dynamin family
24575	Dynamin GTPase effector domain
24575	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
24577	Helix-loop-helix DNA-binding domain
24577	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo
24581	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo
24583	Myosin head (motor domain)
24586	Fibronectin type III domain
24587	Intermediate filament protein
24588	Intermediate filament protein
24596	TNFR/NGFR cysteine-rich region
24598	Flavodoxin
24598	pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain
24598	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
24599	pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain
24600	Flavodoxin
24600	FAD binding domain. This domain is found in sulfite reductase, NADPH cytochrome P450 reductase, Nitric oxide synthase and methionine synthase reductase
24602	Atrial natriuretic peptide
24603	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
24604	Pancreatic hormone peptide
24609	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
24613	7 transmembrane receptor (rhodopsin family)
24614	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
24615	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
24616	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
24617	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
24618	Lectin C-type domain. This family includes both long and short form C-type
24620	Lectin C-type domain. This family includes both long and short form C-type
24624	Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction
24626	3'5'-cyclic nucleotide phosphodiesterase
24627	3'5'-cyclic nucleotide phosphodiesterase
24638	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
24638	6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta
24640	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
24640	6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta
24644	Phosphoglycerate kinase
24648	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
24649	Protein kinase domain
24651	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
24654	C2 domain
24654	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
24654	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
24655	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
24656	Somatotropin hormone family
24658	Somatotropin hormone family
24660	PMP-22/EMP/MP20/Claudin family
24663	RNA polymerase alpha subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Members of this family
24664	Corticotropin ACTH domain
24677	Pancreatic hormone peptide
24679	Cyclic nucleotide-binding domain
24681	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
24683	Somatotropin hormone family
24684	Fibronectin type III domain
24686	Prion/Doppel alpha-helical domain. The prion protein is thought to be the infectious agent that causes transmissible spongiform encephalopathies, such as scrapie and BSE. It is thought that the prion protein can exist in two different forms: one is the no
24688	Intermediate filament protein
24693	Animal haem peroxidase
24693	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
24694	Parathyroid hormone family
24695	Parathyroid hormone family
24699	Fibronectin type III domain
24699	Fibronectin type III domain
24699	Protein-tyrosine phosphatase
24701	Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses
24703	Protein kinase domain
24703	Raf-like Ras-binding domain
24703	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
24705	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
24705	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
24708	Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted
24708	Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserved
24710	Lipocalin / cytosolic fatty-acid binding protein family
24711	Interphotoreceptor retinoid-binding protein. Interphotoreceptor retinoid-binding protein (IRBP) mediates retinoid trafficking between the photoreceptors and pigmented epithelium. This Pfam family represents a repeat domain found four times in the mammalia
24715	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
24717	7 transmembrane receptor (rhodopsin family)
24718	Reeler domain
24737	Class I Histocompatibility antigen, domains alpha 1 and 2
24747	Class I Histocompatibility antigen, domains alpha 1 and 2
24748	Class I Histocompatibility antigen, domains alpha 1 and 2
24765	Granin (chromogranin or secretogranin)
24766	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
24767	Amiloride-sensitive sodium channel
24768	Amiloride-sensitive sodium channel
24769	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
24770	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
24772	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
24773	Lectin C-type domain. This family includes both long and short form C-type
24775	Laminin G domain
24777	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f
24778	Sugar (and other) transporter
24779	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
24780	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
24781	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
24782	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
24783	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
24784	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
24785	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
24785	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
24786	Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in
24791	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand
24794	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
24795	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
24797	Somatostatin/Cortistatin family. Members of this family are hormones. Somatostatin inhibits the release of somatotropin. Cortistatin is a peptide that is related to the Somatostatins that is found to depresses neuronal electrical activity but, unlike som
24800	Sulfatase
24803	Synaptobrevin
24804	Synaptophysin / synaptoporin
24807	7 transmembrane receptor (rhodopsin family)
24808	7 transmembrane receptor (rhodopsin family)
24816	7 transmembrane receptor (rhodopsin family)
24818	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
24822	Uncharacterized protein family UPF0005
24825	Transferrin
24826	Carboxylesterase
24826	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
24831	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
24831	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
24833	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand
24842	P53
24848	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
24848	ACT domain. This family of domains generally have a regulatory role. ACT domains are linked to a wide range of metabolic enzymes that are regulated by amino acid concentration. Pairs of ACT domains bind specifically to a particular amino acid leading to r
24849	Triosephosphate isomerase
24851	Tropomyosin
24852	Tropomyosin
24854	Clusterin
24854	Clusterin
24855	Tuberin
24855	Rap/ran-GAP
24856	Transthyretin precursor (formerly prealbumin). Transthyretin is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. Mutations in the human transthyretin are associated with several genetic disorders
24860	Mitochondrial carrier protein
24861	UDP-glucoronosyl and UDP-glucosyl transferase
24862	UDP-glucoronosyl and UDP-glucosyl transferase
24864	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
24873	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
24873	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
24874	von Hippel-Lindau disease tumor suppressor protein. VHL forms a ternary complex with the elonginB and elonginC proteins. This complex binds Cul2, which then is involved in regulation of vascular endothelial growth factor mRNA
24875	7 transmembrane receptor (Secretin family)
24875	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
24881	wnt family
24882	wnt family
24883	Wilm's tumour protein
24886	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
24887	Apoptosis regulator proteins, Bcl-2 family
24888	Apoptosis regulator proteins, Bcl-2 family
24889	7 transmembrane receptor (rhodopsin family)
24890	Oestrogen receptor
24890	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
24890	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
24894	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
24895	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
24896	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
24898	Sodium:neurotransmitter symporter family
24902	Sulfotransferase protein
24903	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
24904	Sugar (and other) transporter
24906	Phospholipase A2 inhibitor
24907	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
24912	Sulfotransferase protein
24912	Sulfotransferase protein
24913	NLP/P60 family. The function of this domain is unknown. It is found in several lipoproteins
24914	Lysyl oxidase
24915	LIM domain. This family represents two copies of the LIM structural domain
24915	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
24918	SH2 domain
24918	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
24918	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
24918	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
24922	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
24922	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
24923	Syntaxin
24924	PTN/MK heparin-binding protein family
24925	7 transmembrane receptor (rhodopsin family)
24929	7 transmembrane receptor (rhodopsin family)
24933	Lectin C-type domain. This family includes both long and short form C-type
24936	Tetraspanin family
24938	7 transmembrane receptor (rhodopsin family)
24941	Heavy-metal-associated domain
24941	Heavy-metal-associated domain
24942	GDP dissociation inhibitor
24943	Myelin proteolipid protein (PLP or lipophilin)
24944	Lysosome-associated membrane glycoprotein (Lamp)
24946	Trypsin
24947	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
24947	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
24948	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
24949	Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown
24950	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster
24952	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
24953	7 transmembrane receptor (Secretin family)
24953	7 transmembrane receptor (Secretin family)
24954	Fibronectin type III domain
24954	Furin-like cysteine rich region
24954	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
24956	MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase and the domain is also found in Carbamoyl phosphate synthetase (CPS) where it forms a regulatory domain that binds to the allosteric effector ornithine. This family also
24957	Glutamine synthetase, catalytic domain
24962	Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OA
24967	Proteasome A-type and B-type
25006	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
25006	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
25007	7 transmembrane receptor (rhodopsin family)
25008	TNF(Tumor Necrosis Factor) family
25009	tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included
25010	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
25011	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
25014	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
25017	Myosin head (motor domain)
25021	von Willebrand factor type A domain
25022	Protein kinase domain
25023	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
25024	7 transmembrane receptor (Secretin family)
25026	Adrenomedullin
25028	Adenosine/AMP deaminase
25029	7 transmembrane receptor (Secretin family)
25031	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
25033	7 transmembrane receptor (rhodopsin family)
25035	Oxidoreductase FAD-binding domain
25040	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
25041	ATP P2X receptor
25045	Acyl CoA binding protein
25051	7 transmembrane receptor (Secretin family)
25052	Trypsin
25053	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
25055	ab-hydrolase associated lipase region
25056	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
25058	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
25059	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
25060	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
25060	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF03727. Some members of the family have two copies of each of these domains
25061	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
25073	CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion
25075	7 transmembrane receptor (rhodopsin family)
25080	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
25081	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
25082	BRCA2 repeat. The alignment covers only the most conserved region of the repeat
25083	7 transmembrane receptor (rhodopsin family)
25085	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
25086	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
25087	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
25095	Adenosine/AMP deaminase
25096	PX domain. PX domains bind to phosphoinositides
25097	PX domain. PX domains bind to phosphoinositides
25098	Fork head domain
25099	Fork head domain
25100	Fork head domain
25101	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
25101	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
25102	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
25102	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
25103	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
25103	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
25105	Atrial natriuretic peptide
25106	Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis
25107	7 transmembrane receptor (rhodopsin family)
25108	7 transmembrane receptor (rhodopsin family)
25111	7 transmembrane receptor (rhodopsin family)
25112	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
25116	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
25117	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
25118	von Willebrand factor type A domain
25121	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
25122	Amiloride-sensitive sodium channel
25124	SH2 domain
25124	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
25124	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
25124	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
25125	SH2 domain
25125	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
25125	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fa
25125	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
25126	SH2 domain
25126	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. T
25126	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STA
25128	Zona pellucida-like domain
25130	Syntaxin
25132	DIL domain. The DIL domain has no known function
25136	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro
25139	Sugar (and other) transporter
25146	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
25147	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
25149	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
25149	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
25150	SH2 domain
25150	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
25151	Cation-independent mannose-6-phosphate receptor repeat. The cation-independent mannose-6-phosphate receptor contains 15 copies of a repeat
25153	7 transmembrane receptor (rhodopsin family)
25154	Progesterone receptor
25154	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
25155	SH2 domain
25156	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
25156	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
25159	GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the d
25162	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
25165	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
25166	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
25169	Tetraspanin family
25176	Mitochondrial carrier protein
25183	GDP dissociation inhibitor
25186	Ephrin
25187	7 transmembrane receptor (rhodopsin family)
25192	Lectin C-type domain. This family includes both long and short form C-type
25193	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
25195	7 transmembrane receptor (rhodopsin family)
25196	Transforming growth factor beta like domain
25196	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
25197	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
25197	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
25202	Adenylate and Guanylate cyclase catalytic domain
25204	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
25216	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
25218	Vps52 / Sac2 family. Vps52 complexes with Vps53 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events
25223	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
25227	Sulfatase
25229	7 transmembrane receptor (rhodopsin family)
25230	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
25231	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
25232	Fibronectin type III domain
25236	Glypican
25237	Protein kinase domain
25237	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
25238	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
25240	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
25241	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
25242	Helix-loop-helix DNA-binding domain
25243	Helix-loop-helix DNA-binding domain
25244	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
25244	Transketolase, thiamine diphosphate binding domain. This family includes transketolase enzymes EC:2.2.1.1. and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit EC:1.2.4.4. Both these enzymes utilise thiamine pyrophosphate as a cofacto
25245	7 transmembrane receptor (rhodopsin family)
25245	7 transmembrane receptor (rhodopsin family)
25248	7 transmembrane receptor (rhodopsin family)
25250	Cytochrome oxidase c subunit VIII. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIII
25251	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
25253	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
25254	Sodium:sulfate symporter transmembrane region
25256	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
25259	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
25261	Helix-loop-helix DNA-binding domain
25264	7 transmembrane receptor (rhodopsin family)
25265	7 transmembrane receptor (rhodopsin family)
25266	Platelet-derived growth factor (PDGF)
25266	Platelet-derived growth factor (PDGF)
25268	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
25268	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox
25271	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
25271	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
25273	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
25274	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
25275	6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme contai
25277	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
25277	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
25278	Cytochrome c oxidase subunit VIa
25280	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
25281	Zn-finger in Ran binding protein and others
25283	Glutamate-cysteine ligase. This family represents the catalytic subunit of glutamate-cysteine ligase (E.C. 6.3.2.2), also known as gamma-glutamylcysteine synthetase (GCS). This enzyme catalyses the rate limiting step in the biosynthesis of glutathione. T
25285	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
25289	Adenylate and Guanylate cyclase catalytic domain
25290	Lipoxygenase
25291	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
25293	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
25294	Class I Histocompatibility antigen, domains alpha 1 and 2
25295	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox
25296	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
25297	Dihydropyridine sensitive L-type calcium channel (Beta subunit)
25298	Gastrin/cholecystokinin family
25302	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
25302	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
25307	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
25308	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
25311	Fibronectin type III domain
25314	PMP-22/EMP/MP20/Claudin family
25315	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
25316	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
25317	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
25319	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
25320	Gastrin/cholecystokinin family
25321	7 transmembrane receptor (Secretin family)
25322	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
25323	7 transmembrane receptor (rhodopsin family)
25324	7 transmembrane receptor (rhodopsin family)
25325	Interleukin 10
25327	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
25327	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
25328	Lysosome-associated membrane glycoprotein (Lamp)
25334	Helix-loop-helix DNA-binding domain
25335	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
25336	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
25339	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
25340	7 transmembrane receptor (rhodopsin family)
25342	7 transmembrane receptor (rhodopsin family)
25343	Phosducin
25344	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
25344	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
25346	Fibronectin type III domain
25350	Lectin C-type domain. This family includes both long and short form C-type
25351	Sugar (and other) transporter
25352	Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in
25353	Osteopontin
25354	7 transmembrane receptor (rhodopsin family)
25355	Sulfotransferase protein
25356	Fibronectin type III domain
25359	LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1
25360	7 transmembrane receptor (rhodopsin family)
25364	Amiloride-sensitive sodium channel
25365	Actin
25366	Protein kinase domain
25366	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with t
25367	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
25368	pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es
25369	7 transmembrane receptor (rhodopsin family)
25370	7 transmembrane receptor (rhodopsin family)
25372	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
25373	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
25374	Delta-aminolevulinic acid dehydratase
25375	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
25377	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
25380	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
25382	Amyloid A4 extracellular domain
25382	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
25383	Cytidine and deoxycytidylate deaminase zinc-binding region
25385	TNF(Tumor Necrosis Factor) family
25386	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
25387	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
25387	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
25388	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
25388	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
25390	Sodium / potassium ATPase beta chain
25393	Extracellular link domain
25394	Intermediate filament protein
25397	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
25397	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
25397	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fo
25398	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
25399	Cache domain
25401	OAR domain
25401	Homeobox domain
25403	Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins
25404	Caveolin
25405	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
25406	Extracellular link domain
25409	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
25411	Cyclic nucleotide-binding domain
25411	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
25413	Choline/Carnitine o-acyltransferase
25415	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
25416	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
25416	N-acetylglucosamine-6-phosphate deacetylase. This family are predicted to adopt a TIM barrel fold. This is family of N-acetylglucosamine-6-phosphate deacetylases, EC:3.5.1.25 These enzymes catalyse the reaction N-acetyl-D-glucosamine 6-phosphate + H2O <=>
25418	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
25420	Hsp20/alpha crystallin family
25420	Alpha crystallin A chain, N terminal
25421	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
25422	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
25423	Papain family cysteine protease
25424	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
25425	Papain family cysteine protease
25426	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
25427	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
25428	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
25429	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
25430	Iodothyronine deiodinase
25431	Homeobox domain
25432	7 transmembrane receptor (rhodopsin family)
25434	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand
25435	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a
25437	LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1
25439	7 transmembrane receptor (rhodopsin family)
25440	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
25444	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
25447	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
25449	7 transmembrane receptor (rhodopsin family)
25450	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
25450	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
25451	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
25451	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
25453	Transforming growth factor beta like domain
25454	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro
25456	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
25456	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
25457	7 transmembrane receptor (rhodopsin family)
25458	Eukaryotic glutathione synthase
25458	Eukaryotic glutathione synthase, ATP binding domain
25461	7 transmembrane receptor (rhodopsin family)
25462	Chaperonin 10 Kd subunit
25464	Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes,
25466	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
25467	PTB domain (IRS-1 type)
25468	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
25468	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
25469	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
25469	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
25470	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
25470	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
25471	Slow voltage-gated potassium channel
25472	Inward rectifier potassium channel
25473	Laminin N-terminal (Domain VI)
25473	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
25474	Galactoside-binding lectin
25477	7 transmembrane receptor (rhodopsin family)
25479	Vacuolar protein sorting-associated protein 35. Vacuolar protein sorting-associated protein (Vps) 35 is one of around 50 proteins involved in protein trafficking. In particular, Vps35 assembles into a retromer complex with at least four other proteins Vps
25480	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
25481	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
25481	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
25483	Homeobox domain
25484	Myosin head (motor domain)
25484	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
25486	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
25489	C2 domain
25489	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
25489	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
25490	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
25491	Intermediate filament protein
25492	CTF/NF-I family
25493	Coatomer WD associated domain
25493	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
25497	RanBP1 domain
25498	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
25502	Ornithine decarboxylase antizyme
25504	Neurohypophysial hormones, C-terminal Domain. N-terminal Domain is in hormone5
25505	ATP P2X receptor
25506	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
25507	Giardia variant-specific surface protein
25507	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
25508	Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
25508	Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
25509	Homeobox domain
25509	'Paired box' domain
25511	Phospholipid methyltransferase. The S. cerevisiae phospholipid methyltransferase (EC:2.1.1.16) has a broad substrate specificity of unsaturated phospholipids
25513	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
25514	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
25515	POLO box duplicated region
25516	Vertebrate endogenous opioids neuropeptide
25518	Cyclophilin type peptidyl-prolyl cis-trans isomerase
25522	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
25524	Saposin A-type domain
25525	Transforming growth factor beta like domain
25526	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
25527	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
25527	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
25529	Fibronectin type III domain
25534	Tetraspanin family
25538	Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes
25539	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
25539	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
25540	Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein
25541	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
25542	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
25544	Lectin C-type domain. This family includes both long and short form C-type
25546	Homeobox domain
25547	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
25548	Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor
25550	Sodium:dicarboxylate symporter family
25551	Sugar (and other) transporter
25552	Sodium:solute symporter family
25553	Sodium:neurotransmitter symporter family
25553	Serotonin (5-HT) neurotransmitter transporter, N-terminus
25555	7 transmembrane receptor (rhodopsin family)
25558	Sec1 family
25563	Trefoil (P-type) domain
25564	TSC-22/dip/bun family
25565	Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation
25566	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
25567	Fibronectin type III domain
25570	7 transmembrane receptor (rhodopsin family)
25571	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
25572	TNFR/NGFR cysteine-rich region
25574	HMG (high mobility group) box
25575	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
25576	14-3-3 protein
25577	14-3-3 protein
25578	14-3-3 protein
25579	Protein kinase domain
25583	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
25584	Tissue factor
25586	Alkaline phosphatase
25587	Helix-loop-helix DNA-binding domain
25588	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
25590	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
25590	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
25591	Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc
25591	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
25591	pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affini
25592	Laminin G domain
25592	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
25592	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
25592	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand
25592	Laminin G domain
25592	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
25592	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
25592	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
25593	7 transmembrane receptor (rhodopsin family)
25595	Tau and MAP protein, tubulin-binding repeat
25597	PH domain. PH stands for pleckstrin homology
25597	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
25597	BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region
25598	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
25600	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
25601	7 transmembrane receptor (rhodopsin family)
25604	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
25605	Orexin receptor type 2
25605	7 transmembrane receptor (rhodopsin family)
25607	Homeobox domain
25608	Leptin
25609	Tub family
25610	Interleukin-6/G-CSF/MGF family
25613	Fibronectin type III domain
25613	Eukaryotic-type carbonic anhydrase
25614	Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta
25615	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
25616	Insulin/IGF/Relaxin family. Superfamily includes insulins
25617	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
25619	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
25620	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
25620	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
25621	Tetraspanin family
25622	SH2 domain
25623	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari
25624	Synaptobrevin
25625	TNFR/NGFR cysteine-rich region
25625	TNFR/NGFR cysteine-rich region
25626	Intermediate filament protein
25626	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
25626	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
25627	Trypsin
25629	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
25631	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
25631	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
25632	Globin
25635	7 transmembrane receptor (rhodopsin family)
25637	7 transmembrane receptor (rhodopsin family)
25638	3'5'-cyclic nucleotide phosphodiesterase
25641	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
25642	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
25643	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
25644	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
25644	Transforming growth factor beta like domain
25644	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
25645	7 transmembrane receptor (rhodopsin family)
25646	Homeobox domain
25646	Otx1 transcription factor
25650	Sodium / potassium ATPase beta chain
25651	Sushi domain (SCR repeat)
25651	Lectin C-type domain. This family includes both long and short form C-type
25652	7 transmembrane receptor (rhodopsin family)
25654	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
25655	Connexin
25656	Guanylin precursor
25657	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
25659	pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an es
25661	Fibronectin type I domain
25661	Fibronectin type III domain
25662	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
25663	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
25664	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
25664	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
25665	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
25666	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
25668	ADP-ribosyl cyclase. ADP-ribosyl cyclase EC:3.2.2.5 (also know as cyclic ADP-ribose hydrolase or CD38) synthesizes cyclic-ADP ribose, a second messenger for glucose-induced insulin secretion
25669	Zinc carboxypeptidase
25670	Interleukin 15. Interleukin-15 (IL-15) is a cytokine that possesses a variety of biological functions, including stimulation and maintenance of cellular immune responses
25671	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
25671	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
25672	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
25672	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
25673	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
25674	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
25674	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
25675	Hydroxymethylglutaryl-coenzyme A reductase
25675	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
25676	SH2 domain
25676	PH domain. PH stands for pleckstrin homology
25676	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
25678	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
25679	MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function
25679	RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3
25681	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
25682	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
25682	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
25684	MAM domain. An extracellular domain found in many receptors
25684	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
25684	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
25684	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
25685	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
25686	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
25687	Caldesmon
25688	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
25689	7 transmembrane receptor (rhodopsin family)
25690	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
25690	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
25691	Fibronectin type III domain
25691	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
25692	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
25696	Low-density lipoprotein receptor domain class A
25696	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
25698	Arginosuccinate synthase. This family contains a PP-loop motif
25699	Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
25699	Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
25701	Helix-loop-helix DNA-binding domain
25702	Lipase
25702	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
25703	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
25704	Sushi domain (SCR repeat)
25705	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
25706	7 transmembrane receptor (rhodopsin family)
25707	Ciliary neurotrophic factor
25708	Mitochondrial carrier protein
25711	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
25712	Interferon gamma
25713	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
25714	Helix-loop-helix DNA-binding domain
25714	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates
25715	Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined
25716	C2 domain
25717	Transforming growth factor beta like domain
25717	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
25718	Fibronectin type III domain
25718	Furin-like cysteine rich region
25718	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
25720	Helix-loop-helix DNA-binding domain
25722	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
25723	Prepro-orexin
25724	Fibronectin type III domain
25724	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
25725	Cyclic nucleotide-binding domain
25726	7 transmembrane receptor (rhodopsin family)
25727	MAM domain. An extracellular domain found in many receptors
25727	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
25727	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
25728	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
25730	Nerve growth factor family
25731	C2 domain
25732	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte
25734	SH2 domain
25734	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
25735	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
25735	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
25736	Kv2 voltage-gated K+ channel
25736	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
25736	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
25737	Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA
25738	SH2 domain
25738	PH domain. PH stands for pleckstrin homology
25738	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
25738	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
25739	Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses
25740	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
25741	Phosphofructokinase
25742	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
25743	Inward rectifier potassium channel
25744	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
25744	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
25745	Myosin head (motor domain)
25745	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
25747	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
25747	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
25748	Aminotransferase class I and II
25749	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
25750	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
25751	Dynamin family
25751	PH domain. PH stands for pleckstrin homology
25752	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
25753	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
25753	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
25756	Choline/Carnitine o-acyltransferase
25757	Choline/Carnitine o-acyltransferase
25759	SH2 domain
25759	Phosphotyrosine interaction domain (PTB/PID)
25769	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
25778	Protein kinase domain
25780	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
25780	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
25794	Fibroblast growth factor
25797	Glutaminyl cyclase. Glutaminyl cyclase catalyses the formation of the pyroglutamyl residue present at the amino terminus of numerous secretory peptides and proteins. Glutaminyl cyclase posses a zinc aminopeptidase domain in which the four functionally im
25800	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
25802	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
25806	Homeobox domain
25807	UBA domain
25809	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
25816	Glucose inhibited division protein A
25819	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN
25820	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
25821	Glucose inhibited division protein A
25821	Glucose inhibited division protein A
25823	Trypsin
25825	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
25825	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
25825	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
25828	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
25829	Domain of unknown function
25830	Sulfotransferase protein
25830	Sulfotransferase protein
25831	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
25833	Homeobox domain
25836	Caldesmon
25836	Caldesmon
25841	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
25843	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru
25851	PH domain. PH stands for pleckstrin homology
25851	PH domain. PH stands for pleckstrin homology
25861	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
25862	Zn-finger in ubiquitin-hydrolases and other protein
25865	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
25870	Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown
25873	Ribosomal protein L36e
25873	Ribosomal protein L36e
25874	Uncharacterised protein family (UPF0041)
25878	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
25879	Sof1-like domain. Sof1 is essential for cell growth and is a component of the nucleolar rRNA processing machinery
25884	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
25885	RNA polymerase alpha subunit
25885	RNA polymerase A/beta'/A" subunit
25888	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
25893	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
25894	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
25897	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
25900	Intermediate filament protein
25900	Intermediate filament protein
25902	Formate--tetrahydrofolate ligase
25903	Olfactomedin-like domain
25912	6-phosphogluconate dehydrogenase, C-terminal domain. This family represents the C-terminal all-alpha domain of 6-phosphogluconate dehydrogenase. The domain contains two structural repeats of 5 helices each
25913	Telomere-binding protein alpha subunit, N-terminal domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. A
25917	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
25921	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
25923	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
25924	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
25930	BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1
25943	Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans
25947	Sugar (and other) transporter
25948	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
25948	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
25953	Metallo-beta-lactamase superfamily
25953	Metallo-beta-lactamase superfamily
25959	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
25960	7 transmembrane receptor (Secretin family)
25960	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
25966	C2 domain
25975	MAM domain. An extracellular domain found in many receptors
25976	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
25976	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
25976	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
25976	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
25978	Eukaryotic protein of unknown function, DUF279
25981	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
25983	Sas10/Utp3 family. This family contains Sas10 which hash been identified as a regulator of chromatin silencing. The family also contains Utp3 a component of the U3 ribonucleoprotein complex. The exact molecular function of this family is unknown
25984	Intermediate filament protein
25984	Intermediate filament protein
25989	MIT domain
25989	Protein kinase domain
25992	Fibronectin type III domain
25992	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
26002	Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
26002	Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
26009	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
26013	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
26013	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
26019	Up-frameshift suppressor 2. Transcripts harboring premature signals for translation termination are recognized and rapidly degraded by eukaryotic cells through a pathway known as nonsense-mediated mRNA decay. In Saccharomyces cerevisiae, three trans-acti
26019	Up-frameshift suppressor 2. Transcripts harboring premature signals for translation termination are recognized and rapidly degraded by eukaryotic cells through a pathway known as nonsense-mediated mRNA decay. In Saccharomyces cerevisiae, three trans-acti
26031	Oxysterol-binding protein
26031	Oxysterol-binding protein
26031	Oxysterol-binding protein
26031	Oxysterol-binding protein
26031	Oxysterol-binding protein
26031	Oxysterol-binding protein
26032	Extracellular link domain
26033	Lectin C-type domain. This family includes both long and short form C-type
26033	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
26037	Rap/ran-GAP
26038	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
26043	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
26047	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
26047	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
26048	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
26048	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
26048	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
26050	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
26052	Dynamin family
26052	Dynamin GTPase effector domain
26052	PH domain. PH stands for pleckstrin homology
26054	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
26060	PH domain. PH stands for pleckstrin homology
26063	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
26088	VHS domain. Domain present in VPS-27, Hrs and STAM
26088	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
26088	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
26103	Fibronectin type III domain
26108	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
26108	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
26130	Vacuolar sorting protein 9 (VPS9) domain
26130	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
26136	LIM domain. This family represents two copies of the LIM structural domain
26136	LIM domain. This family represents two copies of the LIM structural domain
26137	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
26137	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
26140	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
26147	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
26153	Kinesin motor domain
26155	Uncharacterised protein family (UPF0120)
26156	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
26157	GTPase of unknown function
26168	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
26184	7 transmembrane receptor (rhodopsin family)
26188	7 transmembrane receptor (rhodopsin family)
26189	7 transmembrane receptor (rhodopsin family)
26205	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
26207	Phosphatidylinositol transfer protein
26207	Phosphatidylinositol transfer protein
26211	7 transmembrane receptor (rhodopsin family)
26219	7 transmembrane receptor (rhodopsin family)
26223	F-box domain
26224	F-box domain
26225	ADP-ribosylation factor family
26225	ADP-ribosylation factor family
26229	Glycosyltransferase family 43
26232	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate
26234	F-box domain
26234	F-box domain
26245	7 transmembrane receptor (rhodopsin family)
26249	BTB/POZ domain. The BTB (for BR-C
26249	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Kelch is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Sev
26251	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
26251	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
26257	Homeobox domain
26259	F-box domain
26261	F-box domain
26261	F-box domain
26263	F-box domain
26263	F-box domain
26266	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
26267	F-box domain
26268	F-box domain
26268	F-box domain
26268	F-box domain
26270	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate
26272	F-box domain
26272	F-box domain
26273	F-box domain
26273	F-box domain
26276	Sec1 family
26279	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with
26280	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
26281	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
26285	PMP-22/EMP/MP20/Claudin family
26286	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
26289	Adenylate kinase
26289	Adenylate kinase
26290	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
26295	TPR Domain
26295	ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain
26301	Glycosyltransferase family 6
26333	7 transmembrane receptor (rhodopsin family)
26338	7 transmembrane receptor (rhodopsin family)
26341	7 transmembrane receptor (rhodopsin family)
26353	Hsp20/alpha crystallin family
26354	GTPase of unknown function
26358	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
26359	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
26361	7 transmembrane receptor (rhodopsin family)
26362	Fibronectin type III domain
26363	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
26364	7 transmembrane receptor (Secretin family)
26364	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
26369	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typ
26372	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
26373	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
26373	CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate de
26377	SH2 domain
26377	PH domain. PH stands for pleckstrin homology
26378	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
26379	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
26380	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
26380	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
26381	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
26381	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
26384	Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase
26385	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
26386	HSF-type DNA-binding
26388	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
26388	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
26388	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
26394	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
26399	Protein kinase domain
26403	Protein kinase domain
26403	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
26404	Protein kinase domain
26406	Protein kinase domain
26409	Protein kinase domain
26411	Protein kinase domain
26412	Protein kinase domain
26412	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
26421	Somatotropin hormone family
26422	Beige/BEACH domain
26423	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
26423	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
26424	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
26424	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
26431	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
26434	Prion/Doppel alpha-helical domain. The prion protein is thought to be the infectious agent that causes transmissible spongiform encephalopathies, such as scrapie and BSE. It is thought that the prion protein can exist in two different forms: one is the n
26440	Proteasome A-type and B-type
26445	Proteasome A-type and B-type
26446	Proteasome A-type and B-type
26447	impB/mucB/samB family. These proteins are involved in UV protection
26462	Pyridine nucleotide-disulphide oxidoreductase, dimerisation domain. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases
26462	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
26464	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
26468	Homeobox domain
26468	LIM domain. This family represents two copies of the LIM structural domain
26470	Sushi domain (SCR repeat)
26476	7 transmembrane receptor (rhodopsin family)
26492	7 transmembrane receptor (rhodopsin family)
26493	7 transmembrane receptor (rhodopsin family)
26496	7 transmembrane receptor (rhodopsin family)
26499	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
26502	pfam02906, Fe_hyd_lg_C, Iron only hydrogenase large subunit, C-terminal domain
26502	pfam02906, Fe_hyd_lg_C, Iron only hydrogenase large subunit, C-terminal domain
26508	Helix-loop-helix DNA-binding domain
26509	C2 domain
26509	C2 domain
26515	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
26517	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
26519	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
26520	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
26521	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
26523	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
26523	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
26526	Tetraspanin family
26529	7 transmembrane receptor (rhodopsin family)
26531	7 transmembrane receptor (rhodopsin family)
26532	7 transmembrane receptor (rhodopsin family)
26538	7 transmembrane receptor (rhodopsin family)
26539	7 transmembrane receptor (rhodopsin family)
26554	Cullin family
26556	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
26556	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
26557	Dor1-like family. Dor1 is involved in vesicle targeting to the yeast Golgi apparatus and complexes with a number of other trafficking proteins, which include Sec34 and Sec35
26557	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
26557	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl
26558	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl
26561	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
26563	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
26564	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
26565	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with
26568	AMP-binding enzyme
26569	AMP-binding enzyme
26575	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
26577	CUB domain
26581	Homeobox domain
26582	Homeobox domain
26583	Homeobox domain
26584	Homeobox domain
26585	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb
26658	7 transmembrane receptor (rhodopsin family)
26659	7 transmembrane receptor (rhodopsin family)
26689	7 transmembrane receptor (rhodopsin family)
26692	7 transmembrane receptor (rhodopsin family)
26696	7 transmembrane receptor (rhodopsin family)
26707	7 transmembrane receptor (rhodopsin family)
26707	7 transmembrane receptor (rhodopsin family)
26707	7 transmembrane receptor (rhodopsin family)
26716	7 transmembrane receptor (rhodopsin family)
26750	MIT domain
26750	PX domain. PX domains bind to phosphoinositides
26757	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
26759	Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C
26873	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
26873	Hydantoinase B/oxoprolinase. This family includes N-methylhydaintoinase B which converts hydantoin to N-carbamyl-amino acids, and 5-oxoprolinase EC:3.5.2.9 which catalyses the formation of L-glutamate from 5-oxo-L-proline. These enzymes are part of the ox
26877	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
26878	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
26879	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
26887	Sulfotransferase protein
26893	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
26903	C2 domain
26910	Helix-loop-helix DNA-binding domain
26914	Core histone H2A/H2B/H3/H4
26914	Appr-1"-p processing enzyme family. This domain is found in a number of otherwise unrelated proteins. This domain is found at the C-terminus of the macro-H2A histone protein. This domain is found in the non-structural proteins of several types of ssRNA vi
26926	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
26931	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy
26932	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
26934	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
26938	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
26942	TPR Domain
26943	TMS membrane protein/tumour differentially expressed protein (TDE)
26944	Papain family cysteine protease
26946	Transient receptor
26949	Zinc-binding dehydrogenase
26955	DIL domain. The DIL domain has no known function
26955	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
26955	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
26956	Adenylate kinase
26960	Beige/BEACH domain
26962	Ribosomal Proteins L2, C-terminal domain
26962	Ribosomal Proteins L2, RNA binding domain
26965	Cullin family
26968	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
26970	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with
26972	SPO11 homologue
26984	Synaptobrevin
26985	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
26986	Poly-adenylate binding protein, unique domain
26987	Eukaryotic initiation factor 4E
26992	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
26998	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
27000	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
27000	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
27010	Thiamin pyrophosphokinase, catalytic domain. Family of thiamin pyrophosphokinase (EC:2.7.6.2). Thiamin pyrophosphokinase (TPK) catalyzes the transfer of a pyrophosphate group from ATP to vitamin B1 (thiamin) to form the coenzyme thiamin pyrophosphate (TP
27012	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
27012	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
27022	Fork head domain
27023	Fork head domain
27027	Tetraspanin family
27029	Sulfatase
27033	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
27035	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
27035	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
27035	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
27044	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
27045	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
27049	Ets-domain
27052	Lipase/Acylhydrolase with GDSL-like motif
27053	Glutamine amidotransferases class-II
27053	Asparagine synthase. This family is always found associated with pfam00310. Members of this family catalyse the conversion of aspartate to asparagine
27054	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/2
27055	FKBP-type peptidyl-prolyl cis-trans isomerase
27060	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
27067	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
27068	Inorganic pyrophosphatase
27068	Inorganic pyrophosphatase
27068	Inorganic pyrophosphatase
27068	Inorganic pyrophosphatase
27071	SH2 domain
27071	PH domain. PH stands for pleckstrin homology
27074	Lysosome-associated membrane glycoprotein (Lamp)
27075	Tetraspanin family
27087	Glycosyltransferase family 43
27087	Glycosyltransferase family 43
27089	pfam02939, UcrQ, UcrQ family. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This family represents the 9.5 kDa subunit of the complex
27090	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
27090	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
27090	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
27091	PMP-22/EMP/MP20/Claudin family
27091	PMP-22/EMP/MP20/Claudin family
27092	PMP-22/EMP/MP20/Claudin family
27094	Calcium-activated potassium channel, beta subunit
27094	Calcium-activated potassium channel, beta subunit
27094	Calcium-activated potassium channel, beta subunit
27094	Calcium-activated potassium channel, beta subunit
27095	Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits
27096	Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits
27102	Protein kinase domain
27103	Protein kinase domain
27103	tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt
27104	Protein kinase domain
27104	tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt
27106	Arrestin (or S-antigen)
27107	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
27115	3'5'-cyclic nucleotide phosphodiesterase
27125	Monocarboxylate transporter
27126	HMG (high mobility group) box
27128	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
27129	Hsp20/alpha crystallin family
27131	PX domain. PX domains bind to phosphoinositides
27131	PX domain. PX domains bind to phosphoinositides
27132	C2 domain
27132	C2 domain
27133	Cyclic nucleotide-binding domain
27133	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
27133	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
27133	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
27133	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
27133	Cyclic nucleotide-binding domain
27133	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
27133	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
27136	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
27139	Ribosomal protein S26e
27140	Homeobox domain
27141	CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain
27145	Intermediate filament protein
27145	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
27147	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
27147	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
27147	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
27150	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
27150	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
27151	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
27151	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
27154	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
27154	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
27154	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
27161	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
27161	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
27163	Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in
27163	Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in
27173	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
27176	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
27177	Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18
27177	Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18
27197	7 transmembrane receptor (rhodopsin family)
27198	7 transmembrane receptor (rhodopsin family)
27199	7 transmembrane receptor (rhodopsin family)
27201	7 transmembrane receptor (rhodopsin family)
27202	7 transmembrane receptor (rhodopsin family)
27204	Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown
27206	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
27207	Ribosomal protein S17
27216	7 transmembrane receptor (rhodopsin family)
27217	Homeobox domain
27223	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
27226	Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl
27229	Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin
27233	Sulfotransferase protein
27239	7 transmembrane receptor (rhodopsin family)
27243	Eukaryotic protein of unknown function, DUF279
27246	Zinc finger, C3HC4 type (RING finger)
27252	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
27255	Fibronectin type III domain
27256	Fibronectin type III domain
27257	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
27260	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
27261	PTB domain (IRS-1 type)
27263	Protein kinase domain
27263	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
27273	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
27283	Papain family cysteine protease
27284	Sulfotransferase protein
27285	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
27286	HYR domain. This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is unc
27287	Homeobox domain
27295	LIM domain. This family represents two copies of the LIM structural domain
27299	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
27299	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
27302	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
27306	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
27319	Helix-loop-helix DNA-binding domain
27324	HMG (high mobility group) box
27328	Cadherin domain
27328	Cadherin domain
27328	Cadherin domain
27328	Cadherin domain
27334	7 transmembrane receptor (rhodopsin family)
27336	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
27342	Vacuolar sorting protein 9 (VPS9) domain
27343	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
27345	Calcium-activated potassium channel, beta subunit
27350	Cytidine and deoxycytidylate deaminase zinc-binding region
27352	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
27354	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
27354	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
27359	C2 domain
27359	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
27360	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
27366	Mitosis protein DIM1
27367	Ribosomal protein L3
27369	Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes
27370	Ribosomal protein S26e
27372	Somatotropin hormone family
27373	Protein kinase domain
27376	Mitochondrial carrier protein
27383	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
27384	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
27385	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
27388	Phosphatidyl serine synthase. Phosphatidyl serine synthase is also known as serine exchange enzyme. This family represents eukaryotic PSS I and II which are membrane bound proteins which catalyses the replacement of the head group of a phospholipid (phos
27395	Ribosomal protein L15 amino terminal region. This family is always associated with pfam00256. This family is diagnostic of ribosomal L15 proteins
27398	Ribosomal Proteins L2, C-terminal domain
27399	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
27400	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
27400	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
27402	2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain
27402	Biotin-requiring enzyme. This alignment covers two families, the conserved lysine residue binds biotin in one group and lipoic acid in the other
27402	e3 binding domain. This family represents a small domain of the E2 subunit of 2-oxo-acid dehydrogenases responsible for the binding of the E3 subunit
27402	2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain
27402	Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins
27406	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
27410	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
27411	Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue
27418	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
27418	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
27421	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
27421	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
27433	ATPase family associated with various cellular activities (AAA)
27436	HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes
27445	C2 domain
27445	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
27643	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
27681	EAP30. EAP30 is a subunit of the ELL complex. The ELL is an 80-kDa RNA polymerase II transcription factor. ELL interacts with three other proteins to form the complex known as ELL complex. The ELL complex is capable of increasing that catalytic rate of t
27756	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
27756	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
27756	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
27967	Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding
27979	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
27993	IMP4 family. This family includes IMP4, which is involved ribosomal RNA processing
28078	Somatotropin hormone family
28080	ATP synthase delta (OSCP) subunit. The ATP D subunit from E. coli is the same as the OSCP subunit which is this family. The ATP D subunit from metazoa are found in family pfam00401
28088	Uncharacterized protein family UPF0027
28105	B-box zinc finger
28185	TPR Domain
28193	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
28200	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
28231	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28231	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28232	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28232	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28234	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28234	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28240	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
28240	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
28240	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
28240	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
28240	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
28240	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
28248	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28248	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28250	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28250	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28253	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28253	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28253	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28253	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28254	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28254	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
28298	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
28316	Cadherin domain
28316	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
28511	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
28512	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
28513	Cadherin domain
28513	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
28514	Delta serrate ligand
28956	Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn
28966	PX domain. PX domains bind to phosphoinositides
28968	Sodium:neurotransmitter symporter family
28978	Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised
28983	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
28986	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
29065	Lipid-A-disaccharide synthetase. This is a family of lipid-A-disaccharide synthetases
29068	BTB/POZ domain
29072	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
29072	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
29074	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
29095	ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER
29097	Cornichon protein
29103	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con
29108	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
29108	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
29108	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
29116	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
29117	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
29119	Vinculin family
29124	Galactoside-binding lectin
29127	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
29128	YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named
29128	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
29128	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
29130	Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown
29135	Trypsin
29139	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
29141	Galanin
29144	Calreticulin family
29146	Delta serrate ligand
29147	Delta serrate ligand
29148	Helix-loop-helix DNA-binding domain
29148	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates
29152	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
29153	Calpain family cysteine protease
29153	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
29154	Calpain family cysteine protease
29154	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
29155	Calpain family cysteine protease
29155	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
29157	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
29160	pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in biomineralization. They seem to regulate the formation of crystallites during the secretory stage of tooth enamel development. thought to play a major role in the structural organization and m
29161	Caveolin
29162	Cadherin domain
29162	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
29163	Cadherin domain
29163	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
29165	Trypsin
29167	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
29168	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
29169	Somatomedin B domain
29169	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
29170	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
29172	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
29173	Casein
29179	Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown
29181	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
29182	Cadherin domain
29184	CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion
29185	Tetraspanin family
29186	Tetraspanin family
29188	Kappa casein. Kappa-casein is a mammalian milk protein involved in a number of important physiological processes. In the gut, the ingested protein is split into an insoluble peptide (para kappa-casein) and a soluble hydrophilic glycopeptide (caseinomacro
29190	Vertebrate endogenous opioids neuropeptide
29191	Neurokinin B
29192	Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans
29194	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
29197	Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18
29199	Class I Histocompatibility antigen, domains alpha 1 and 2
29200	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
29202	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
29203	Fork head domain
29205	Synaptogyrin. This family of proteins is distantly related to pfam01284
29209	Calsequestrin
29210	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
29211	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
29215	Arginase family
29216	Low-density lipoprotein receptor domain class A
29216	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
29219	Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid
29221	Arginase family
29223	Adenylate kinase
29224	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
29225	Carboxylesterase
29227	CTF/NF-I family
29228	CTF/NF-I family
29231	WD domain, G-beta repeat
29232	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
29233	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
29233	CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate de
29234	7 transmembrane receptor (rhodopsin family)
29235	7 transmembrane receptor (rhodopsin family)
29236	Ribosomal protein S2
29237	Vertebrate endogenous opioids neuropeptide
29238	7 transmembrane receptor (rhodopsin family)
29241	Adenylate and Guanylate cyclase catalytic domain
29242	Eukaryotic-type carbonic anhydrase
29243	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
29246	Stathmin family
29248	Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom
29251	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
29251	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox
29252	Trypsin
29253	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
29256	7 transmembrane receptor (rhodopsin family)
29257	Ribosomal protein L6
29258	Ribosomal protein S7e
29259	Lectin C-type domain. This family includes both long and short form C-type
29260	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
29261	Thymidylate synthase
29264	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
29264	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
29270	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
29272	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
29276	LIM domain. This family represents two copies of the LIM structural domain
29277	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
29278	LIM domain. This family represents two copies of the LIM structural domain
29279	Homeobox domain
29282	Ribosomal protein L14p/L23e
29283	Ribosomal L29e protein family
29286	Ribosomal S17
29287	Ribosomal protein S19e
29288	Ribosomal S3Ae family
29290	7 transmembrane receptor (rhodopsin family)
29292	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
29295	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
29296	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
29298	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
29301	Phenylalanine and histidine ammonia-lyase
29302	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
29304	Ribosomal protein S6e
29307	7 transmembrane receptor (rhodopsin family)
29308	ADP-ribosylation factor family
29310	7 transmembrane receptor (rhodopsin family)
29313	Lectin C-type domain. This family includes both long and short form C-type
29316	7 transmembrane receptor (rhodopsin family)
29317	LIM domain. This family represents two copies of the LIM structural domain
29318	Macrophage migration inhibitory factor (MIF)
29319	Eukaryotic cobalamin-binding protein
29321	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
29322	C2 domain
29322	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
29322	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
29323	Sodium:neurotransmitter symporter family
29324	F-actin capping protein alpha subunit
29331	Extracellular link domain
29332	Stathmin family
29334	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
29335	7 transmembrane receptor (rhodopsin family)
29337	SH2 domain
29337	PH domain. PH stands for pleckstrin homology
29337	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
29340	Protein kinase domain
29340	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
29340	Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans
29345	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
29347	Amidase
29348	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
29349	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
29352	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
29355	Hr1 repeat
29356	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
29357	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
29357	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
29358	7 transmembrane receptor (rhodopsin family)
29358	7TM chemoreceptor. This large family of proteins are related to pfam00001. They are 7 transmembrane receptors. This family does not include all known members, as there are problems with overlapping specificity with pfam00001. This family is greatly expand
29359	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with
29361	HMG (high mobility group) box
29363	7 transmembrane receptor (rhodopsin family)
29366	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
29367	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
29368	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
29369	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
29371	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
29373	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
29374	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
29374	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
29375	Homeobox domain
29381	Protein kinase domain
29381	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with t
29383	Fumarylacetoacetate (FAA) hydrolase family. This family consists of fumarylacetoacetate (FAA) hydrolase, or fumarylacetoacetate hydrolase (FAH) and it also includes HHDD isomerase/OPET decarboxylase from E. coli strain W. FAA is the last enzyme in the ty
29385	7 transmembrane receptor (rhodopsin family)
29388	Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom
29389	Troponin. Troponin (Tn) contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). this Pfam contains members of the TnT subunit. Troponin is a complex of three proteins, Ca2+ binding (TnC), inhibitory (TnI), and tropom
29392	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
29394	GCM motif protein
29395	HMG (high mobility group) box
29403	Hepatic lectin, N-terminal domain
29403	Lectin C-type domain. This family includes both long and short form C-type
29410	Helix-loop-helix DNA-binding domain
29411	Palmitoyl protein thioesterase
29412	7 transmembrane receptor (rhodopsin family)
29413	7 transmembrane receptor (rhodopsin family)
29415	7 transmembrane receptor (rhodopsin family)
29421	Uroporphyrinogen decarboxylase (URO-D)
29422	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
29422	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
29423	IQ calmodulin-binding motif. Calmodulin-binding motif
29425	Proteasome A-type and B-type
29426	Ribosomal family S4e
29431	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
29432	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
29433	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
29434	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
29437	Actin
29438	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
29440	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
29443	S-adenosyl-L-homocysteine hydrolase
29445	pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium
29446	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
29449	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
29454	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
29458	Helix-loop-helix DNA-binding domain
29460	Protein kinase domain
29462	7 transmembrane receptor (rhodopsin family)
29464	Sodium:neurotransmitter symporter family
29465	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
29466	Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation
29468	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
29469	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
29471	7 transmembrane receptor (rhodopsin family)
29475	Iodothyronine deiodinase
29476	Cysteine rich repeat. This cysteine rich repeat contains four cysteines. It is found in multiple copies in a protein that binds to fibroblast growth factors. The repeat is also found in MG160 and E-selectin ligand (ESL-1)
29477	Tau and MAP protein, tubulin-binding repeat
29480	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
29482	Sodium:dicarboxylate symporter family
29483	Sodium:dicarboxylate symporter family
29489	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
29492	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
29492	Notch (DSL) domain. The Notch domain is also called the 'DSL' domain. The notch proteins are transmembrane proteins with extracellular domains of repeated EGF domains and the notch (or DSL) domain N-terminal to that. These proteins are generally involved
29496	Furin-like cysteine rich region
29496	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
29497	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
29498	6-pyruvoyl tetrahydropterin synthase. 6-Pyruvoyl tetrahydrobiopterin synthase catalyses the conversion of dihydroneopterin triphosphate to 6-pyruvoyl tetrahydropterin, the second of three enzymatic steps in the synthesis of tetrahydrobiopterin from GTP.
29499	Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module
29499	Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation
29500	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f
29501	K-Cl Co-transporter type 1 (KCC1)
29502	Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukemia virus receptor
29503	Sugar (and other) transporter
29504	Sugar (and other) transporter
29507	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
29509	Sugar (and other) transporter
29511	Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-
29512	Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-
29515	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
29516	3'5'-cyclic nucleotide phosphodiesterase
29520	Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-
29521	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a cy
29524	LIM domain. This family represents two copies of the LIM structural domain
29525	Phosphatidylinositol transfer protein
29526	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with
29527	Animal haem peroxidase
29527	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
29528	Synaptobrevin
29530	Protein kinase domain
29530	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
29533	Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is
29534	Zinc finger, C3HC4 type (RING finger)
29535	Homeobox domain
29539	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
29540	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
29541	HhH-GPD superfamily base excision DNA repair protein. This family contains a diverse range of structurally related DNA repair proteins. The superfamily is called the HhH-GPD family after its hallmark Helix-hairpin-helix and Gly/Pro rich loop followed by a
29542	Phosphatidylethanolamine-binding protein
29543	Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species
29544	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
29545	Ubiquitin carboxyl-terminal hydrolase, family 1
29546	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
29547	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
29548	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
29552	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
29553	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
29555	7 transmembrane receptor (Secretin family)
29556	Myosin head (motor domain)
29557	Myosin head (motor domain)
29558	Class I Histocompatibility antigen, domains alpha 1 and 2
29559	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
29563	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
29566	Gamma-glutamyltranspeptidase
29567	Helix-loop-helix DNA-binding domain
29569	CUB domain
29571	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
29577	Helix-loop-helix DNA-binding domain
29580	Squalene/phytoene synthase
29581	7 transmembrane receptor (rhodopsin family)
29584	Connexin
29585	Connexin
29586	Connexin
29591	Protein kinase domain
29591	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
29592	Aminotransferase class IV. The D-amino acid transferases (D-AAT) are required by bacteria to catalyse the synthesis of D-glutamic acid and D-alanine, which are essential constituents of bacterial cell wall and are the building block for other D-amino aci
29593	ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold
29593	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
29595	7 transmembrane receptor (rhodopsin family)
29597	7 transmembrane receptor (rhodopsin family)
29600	E1-E2 ATPase
29600	Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
29600	Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
29600	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
29601	Connexin
29601	Gap junction alpha-8 protein (Cx50)
29603	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
29603	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
29605	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
29606	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
29609	OAR domain
29609	Homeobox domain
29610	Zona pellucida-like domain
29611	Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins
29614	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b
29614	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin
29615	Fibronectin type III domain
29616	Protein-tyrosine phosphatase
29617	Fibronectin type III domain
29618	BTG1 family. A novel family of anti-proliferative proteins
29619	BTG1 family. A novel family of anti-proliferative proteins
29622	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
29623	UDP-glucoronosyl and UDP-glucosyl transferase
29623	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
29624	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
29627	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
29628	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
29629	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
29630	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex b
29630	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bin
29632	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
29633	Sulfotransferase protein
29635	Mitochondrial import inner membrane, translocase subunit TIM44. Tim44 is an essential component of the machinery that mediates the translocation of nuclear-encoded proteins across the mitochondrial inner membrane. Tim44 is thought to bind phospholipids o
29637	Hydroxymethylglutaryl-coenzyme A synthase
29639	ATP1G1/PLM/MAT8 family
29639	ATP1G1/PLM/MAT8 family
29641	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
29641	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
29642	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
29642	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
29643	Sugar (and other) transporter
29647	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
29647	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
29648	Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration
29651	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
29657	Helix-loop-helix DNA-binding domain
29659	ATP P2X receptor
29662	GDP dissociation inhibitor
29663	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
29663	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
29665	ATP P2X receptor
29667	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
29674	Proteasome A-type and B-type
29675	Proteasome A-type and B-type
29676	Proteasome A-type and B-type
29678	SART-1 family. This family of proteins appear to contain a leucine zipper and may therefore be a family of transcription factors
29680	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
29681	Mitochondrial glycoprotein. This mitochondrial matrix protein family contains members of the MAM33 family which bind to the globular 'heads' of C1Q. It is thought to be involved in mitochondrial oxidative phosphorylation and in nucleus-mitochondrion inte
29682	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
29682	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
29683	Lectin C-type domain. This family includes both long and short form C-type
29685	MCM2/3/5 family
29685	Sigma-54 interaction domain
29686	Immunoglobulin domain
29688	Histidine acid phosphatase
29689	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
29689	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
29691	3'5'-cyclic nucleotide phosphodiesterase
29694	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
29694	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
29695	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
29695	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
29699	Cyclic nucleotide-binding domain
29699	Cyclic nucleotide-binding domain
29700	Pterin 4 alpha carbinolamine dehydratase. Pterin 4 alpha carbinolamine dehydratase is also known as DCoH (dimerisation cofactor of hepatocyte nuclear factor 1-alpha)
29701	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
29707	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
29707	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
29708	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
29708	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
29710	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
29711	Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site
29711	Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases
29712	Inward rectifier potassium channel
29713	Inward rectifier potassium channel
29715	Calx-beta domain
29715	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
29716	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
29717	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
29718	Inward rectifier potassium channel
29719	Inward rectifier potassium channel
29721	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
29722	Phosphotyrosine interaction domain (PTB/PID)
29722	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
29723	Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog
29726	Sugar (and other) transporter
29731	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
29731	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
29733	7 transmembrane receptor (rhodopsin family)
29734	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
29740	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
29743	Mitochondrial carrier protein
29747	Calcium-activated potassium channel, beta subunit
29750	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
29751	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
29752	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
29753	14-3-3 protein
29754	ATP synthase subunit C
29757	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
29758	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
29760	Occludin/ELL family
29761	Ubiquitin carboxyl-terminal hydrolase family 2
29765	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
29766	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
29767	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
29780	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
29785	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
29799	Yippee putative zinc-binding protein
29800	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
29801	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
29807	Thiamin pyrophosphokinase, catalytic domain. Family of thiamin pyrophosphokinase (EC:2.7.6.2). Thiamin pyrophosphokinase (TPK) catalyzes the transfer of a pyrophosphate group from ATP to vitamin B1 (thiamin) to form the coenzyme thiamin pyrophosphate (TP
29809	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
29810	BAG domain. Domain present in Hsp70 regulators
29811	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t
29812	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t
29818	Hsp20/alpha crystallin family
29819	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
29842	Sterile alpha motif (SAM)/Pointed domain
29845	7 transmembrane receptor (rhodopsin family)
29845	7 transmembrane receptor (rhodopsin family)
29845	7 transmembrane receptor (rhodopsin family)
29846	7 transmembrane receptor (rhodopsin family)
29846	7 transmembrane receptor (rhodopsin family)
29847	7 transmembrane receptor (rhodopsin family)
29847	7 transmembrane receptor (rhodopsin family)
29849	7 transmembrane receptor (rhodopsin family)
29849	7 transmembrane receptor (rhodopsin family)
29849	7 transmembrane receptor (rhodopsin family)
29850	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
29856	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
29858	Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions
29858	Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions
29859	Sulfotransferase protein
29861	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
29861	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
29863	3'5'-cyclic nucleotide phosphodiesterase
29871	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
29877	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
29880	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
29881	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
29882	Cullin family
29884	Apoptosis regulator proteins, Bcl-2 family
29886	PX domain. PX domains bind to phosphoinositides
29887	PX domain. PX domains bind to phosphoinositides
29889	GTPase of unknown function
29890	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
29894	CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in p
29901	SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1
29904	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a
29906	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
29907	MIT domain
29907	PX domain. PX domains bind to phosphoinositides
29907	PX domain. PX domains bind to phosphoinositides
29909	7 transmembrane receptor (rhodopsin family)
29911	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB
29916	PX domain. PX domains bind to phosphoinositides
29916	PX domain. PX domains bind to phosphoinositides
29920	Delta 1-pyrroline-5-carboxylate reductase
29922	Nucleoside diphosphate kinases
29928	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim
29929	ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum
29933	7 transmembrane receptor (rhodopsin family)
29934	PX domain. PX domains bind to phosphoinositides
29937	Steroid binding domain
29941	Hr1 repeat
29943	Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-
29952	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
29953	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
29956	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
29956	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
29956	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
29957	Mitochondrial carrier protein
29957	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
29958	Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in
29964	LIM domain. This family represents two copies of the LIM structural domain
29967	CUB domain
29978	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
29979	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
29979	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
29985	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
29986	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
29988	Sugar (and other) transporter
29989	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
29991	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
29994	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
29995	LIM domain. This family represents two copies of the LIM structural domain
30001	Endoplasmic Reticulum Oxidoreductin 1 (ERO1). Members of this family are required for the formation of disulphide bonds in the ER
30012	Homeobox domain
30044	7 transmembrane receptor (rhodopsin family)
30045	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con
30053	Lectin C-type domain. This family includes both long and short form C-type
30055	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
30056	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
30057	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
30058	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
30059	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
30060	Transferrin
30062	OAR domain
30062	Homeobox domain
30794	LIM domain. This family represents two copies of the LIM structural domain
30794	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
30795	FKBP-type peptidyl-prolyl cis-trans isomerase
30800	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
30800	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
30805	Sugar (and other) transporter
30806	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
30806	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
30812	HMG (high mobility group) box
30813	Homeobox domain
30814	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with
30815	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
30816	ENV polyprotein (coat polyprotein)
30817	7 transmembrane receptor (Secretin family)
30817	7 transmembrane receptor (Secretin family)
30817	7 transmembrane receptor (Secretin family)
30817	7 transmembrane receptor (Secretin family)
30817	7 transmembrane receptor (Secretin family)
30817	7 transmembrane receptor (Secretin family)
30817	7 transmembrane receptor (Secretin family)
30827	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
30832	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
30837	SH2 domain
30841	F-box domain
30841	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
30843	F-box domain
30877	GTPase of unknown function
30923	Fork head domain
30924	Uncharacterized ACR, COG1579
30924	Fibrinogen beta and gamma chains, C-terminal globular domain
30928	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
30930	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp
30937	LIM domain. This family represents two copies of the LIM structural domain
30940	Ubiquitin carboxyl-terminal hydrolase family 2
30942	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
30942	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
30945	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
30945	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
30948	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
30951	'chromo' (CHRromatin Organization MOdifier) domain
30952	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
30955	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
30955	Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation
30956	Alanine dehydrogenase/pyridine nucleotide transhydrogenase. This family now also contains the lysine 2-oxoglutarate reductases
30956	Saccharopine dehydrogenase. This family comprised of three structural domains that can not be separated in the linear sequence. In some organisms this enzyme is found as a bifunctional polypeptide with lysine ketoglutarate reductase (PF). The saccharopin
30957	Intermediate filament protein
30960	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
30968	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
49854	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
49860	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
50484	Ribonucleotide reductase, small chain
50484	Ribonucleotide reductase, small chain
50485	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
50490	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
50496	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
50497	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
50500	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
50500	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
50506	Animal haem peroxidase
50506	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
50507	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
50508	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
50515	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
50522	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
50527	Endoplasmic Reticulum Oxidoreductin 1 (ERO1). Members of this family are required for the formation of disulphide bonds in the ER
50528	Trypsin
50529	Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes
50540	TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway
50545	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
50549	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
50554	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
50554	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
50555	UDP-glucoronosyl and UDP-glucosyl transferase
50556	Exocyst complex subunit Sec15-like
50560	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
50563	Connexin
50564	Connexin
50566	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb
50570	Ribosomal protein L5
50570	PH domain. PH stands for pleckstrin homology
50570	ribosomal L5P family C-terminus. This region is found associated with pfam00281
50572	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
50572	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
50577	7 transmembrane receptor (rhodopsin family)
50592	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
50594	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb
50599	Inward rectifier potassium channel
50604	Interleukin 10
50613	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
50614	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
50616	Interleukin 10
50617	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
50617	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
50617	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
50621	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic
50622	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic
50640	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of
50645	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
50646	Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion ta
50654	Papain family cysteine protease
50659	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
50659	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
50662	Runt domain
50664	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
50665	Thymosin beta-4 family
50669	C2 domain
50669	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb
50671	Zinc-binding dehydrogenase
50671	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
50671	Thioesterase domain. Peptide synthetases are involved in the non-ribosomal synthesis of peptide antibiotics. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid th
50672	7 transmembrane receptor (rhodopsin family)
50672	7 transmembrane receptor (rhodopsin family)
50674	Helix-loop-helix DNA-binding domain
50676	Sodium:neurotransmitter symporter family
50677	Fibronectin type III domain
50678	3'5'-cyclic nucleotide phosphodiesterase
50681	Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA
50682	ab-hydrolase associated lipase region
50687	Fibronectin type III domain
50688	Dihydropyridine sensitive L-type calcium channel (Beta subunit)
50690	Sodium:neurotransmitter symporter family
50694	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
50702	Sushi domain (SCR repeat)
50706	Fasciclin domain. This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria
50708	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
50709	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
50721	Sir2 family
50755	F-box domain
50755	UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Some members have large insertions near
50759	F-box domain
50760	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate
50762	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate
50765	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
50765	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
50766	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
50767	Cyclic nucleotide-binding domain
50776	Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain
50779	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
50779	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
50779	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
50779	GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins
50779	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
50780	C2 domain
50782	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
50782	GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins
50782	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
50784	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
50786	FlhB HrpN YscU SpaS Family
50789	F-box domain
50789	F-box domain
50791	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
50791	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
50791	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
50797	Coatomer WD associated domain
50797	WD domain, G-beta repeat
50797	Coatomer WD associated domain
50798	ROK family
50798	FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain
50798	UDP-N-acetylglucosamine 2-epimerase. This family consists of UDP-N-acetylglucosamine 2-epimerases EC:5.1.3.14 this enzyme catalyses the production of UDP-ManNAc from UDP-GlcNAc. Note that some of the enzymes is this family are bifunctional, in this instan
50799	Mitochondrial carrier protein
50801	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
50808	Adenylate kinase
50810	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
50814	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
50846	Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module
50846	Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation
50850	MIT domain
50850	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
50859	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
50861	Stathmin family
50868	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
50873	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
50873	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
50874	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
50875	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
50876	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
50878	Stromal antigen (SA/STAG) protein
50908	CUB domain
50908	Sushi domain (SCR repeat)
50909	CUB domain
50912	3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termi
50912	HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease
50912	3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termin
50913	Helix-loop-helix DNA-binding domain
50914	Helix-loop-helix DNA-binding domain
50917	Sulfatase
50929	Interleukin 10
50932	Protein kinase domain
50932	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
50933	Ubiquitin carboxyl-terminal hydrolase, family 1
50933	Ubiquitin carboxyl-terminal hydrolase, family 1
50935	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
50937	Fibronectin type III domain
50938	Fibronectin type III domain
50939	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
50940	3'5'-cyclic nucleotide phosphodiesterase
50944	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
50964	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
50993	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
50995	Repeat in ubiquitin-activating (UBA) protein
50995	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
50997	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
50999	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
51001	mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein
51014	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
51015	Isochorismatase family. This family are hydrolase enzymes
51016	Uncharacterised protein family (UPF0172)
51025	Uncharacterised protein family (UPF0108)
51026	Got1-like family. Traffic through the yeast Golgi complex depends on a member of the syntaxin family of SNARE proteins, Sed5, present in early Golgi cisternae. Got1 is thought to facilitate Sed5-dependent fusion events
51028	Vacuolar protein sorting 36. Vps36 is involved in Golgi to endosome trafficking
51031	Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily
51032	Trypsin
51043	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
51046	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
51053	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either asso
51058	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
51061	Intermediate filament protein
51061	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
51062	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
51062	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
51064	2-hydroxychromene-2-carboxylate isomerase family. 2-hydroxychromene-2-carboxylate (HCCA) isomerase enzymes catalyse one step in prokaryotic polyaromatic hydrocarbon (PAH) catabolic pathways . This family also contains members with functions other than HC
51065	Ribosomal protein S27
51072	Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown
51074	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
51075	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
51082	RNA polymerases L / 13 to 16 kDa subunit
51084	3-hydroxyacyl-CoA dehydrogenase, NAD binding domain. This family also includes lambda crystallin
51085	Helix-loop-helix DNA-binding domain
51085	Helix-loop-helix DNA-binding domain
51087	'Cold-shock' DNA-binding domain
51094	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h
51109	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
51118	Utp11 protein. This protein is found to be part of a large ribonucleoprotein complex containing the U3 snoRNA. Depletion of the Utp proteins impedes production of the 18S rRNA, indicating that they are part of the active pre-rRNA processing complex. This
51119	Uncharacterized protein family UPF0023
51127	B-box zinc finger
51127	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
51130	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
51130	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
51133	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
51135	Protein kinase domain
51148	Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of va
51156	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
51157	Zinc finger
51160	VPS28 protein
51163	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte
51167	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
51170	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
51171	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
51176	HMG (high mobility group) box
51179	FMN-dependent dehydrogenase
51181	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
51182	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
51184	Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity
51196	C2 domain
51200	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
51205	Histidine acid phosphatase
51208	PMP-22/EMP/MP20/Claudin family
51209	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
51226	Clathrin adaptor complex small chain
51230	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
51231	Protein kinase domain
51232	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
51232	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
51236	Domain of unknown function (DUF383). domo keywords :chromosome c26f1.12c 41.3
51236	Domain of unknown function (DUF384). domo keywords :chromosome c26f1.12c 41.3
51240	ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER
51270	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
51276	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
51277	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con
51281	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
51284	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
51289	7 transmembrane receptor (rhodopsin family)
51292	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
51296	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
51300	SURF4 family
51301	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
51302	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
51305	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
51308	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
51310	Sugar (and other) transporter
51310	Sugar (and other) transporter
51311	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
51311	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
51312	Mitochondrial carrier protein
51312	Mitochondrial carrier protein
51314	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
51317	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
51332	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
51338	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
51338	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
51340	HAT (Half-A-TPR) repeat. The HAT (Half A TPR) repeat is found in several RNA processing proteins
51341	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
51350	Intermediate filament protein
51360	Peptidase family M50
51363	Sulfotransferase protein
51363	Sulfotransferase protein
51364	MYND finger
51365	Lipase
51367	Rpp14 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40
51375	PX domain. PX domains bind to phosphoinositides
51375	PX domain. PX domains bind to phosphoinositides
51377	Ubiquitin carboxyl-terminal hydrolase, family 1
51380	Pyridoxal-dependent decarboxylase conserved domain
51382	ATP synthase subunit D. This is a family of subunit D form various ATP synthases including V-type H+ transporting and Na+ dependent. Subunit D is suggested to be an integral part of the catalytic sector of the V-ATPase
51384	wnt family
51384	wnt family
51385	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
51398	Uncharacterised protein family (UPF0139)
51399	Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses
51402	HSF-type DNA-binding
51411	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
51411	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
51429	PX domain. PX domains bind to phosphoinositides
51438	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
51441	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment
51447	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
51449	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
51450	Homeobox domain
51451	Leucine carboxyl methyltransferase. Family of leucine carboxyl methyltransferases EC:2.1.1.- . This family may need divides a the full alignment contains a significantly shorter mouse sequence
51451	Leucine carboxyl methyltransferase. Family of leucine carboxyl methyltransferases EC:2.1.1.- . This family may need divides a the full alignment contains a significantly shorter mouse sequence
51454	Phosphotyrosine interaction domain (PTB/PID)
51454	Herpesvirus UL6 like. This family consists of various proteins from the herpesviridae that are similar to herpes simplex virus type I UL6 virion protein. UL6 is essential for cleavage and packaging of the viral genome
51455	impB/mucB/samB family. These proteins are involved in UV protection
51455	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
51458	Ammonium Transporter Family
51465	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
51465	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
51466	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
51471	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
51474	LIM domain. This family represents two copies of the LIM structural domain
51477	Myo-inositol-1-phosphate synthase. This is a family of myo-inositol-1-phosphate synthases. Inositol-1-phosphate catalyses the conversion of glucose-6- phosphate to inositol-1-phosphate, which is then dephosphorylated to inositol. Inositol phosphates play
51478	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
51479	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
51479	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
51493	Uncharacterized protein family UPF0027
51496	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
51504	Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest
51522	Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised
51523	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
51554	7 transmembrane receptor (rhodopsin family)
51554	7 transmembrane receptor (rhodopsin family)
51559	haloacid dehalogenase-like hydrolase
51560	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
51567	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN
51569	Uncharacterised protein family (UPF0185). This family contains a number of small uncharacterised proteins including BM-002
51573	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b
51580	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
51582	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
51582	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
51588	pfam02891, zf-MIZ, MIZ zinc finger
51592	B-box zinc finger
51592	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
51592	B-box zinc finger
51592	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
51596	CutA1 divalent ion tolerance protein. Several gene loci with a possible involvement in cellular tolerance to copper have been identified. One such locus in eubacteria and archaebacteria, cutA, is thought to be involved in cellular tolerance to a wide var
51604	Gpi16 subunit, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to ne
51605	Eukaryotic initiation factor 3, gamma subunit. eIF-3 is a multisubunit complex that stimulates translation initiation in vitro at several different steps. This family corresponds to the gamma subunit if eIF3
51606	Adaptin N terminal region
51608	Protein of unknown function (DUF410). Family of conserved eukaryotic proteins with undetermined function
51616	Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tra
51629	Mitochondrial carrier protein
51631	Protein of unknown function, DUF259
51634	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
51646	Yippee putative zinc-binding protein
51652	Eukaryotic protein of unknown function, DUF279
51657	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
51659	Domain of unknown function (DUF392). A eukaryotic specific domain of undetermined function.`
51660	Uncharacterised protein family (UPF0041)
51661	FKBP-type peptidyl-prolyl cis-trans isomerase
51661	FKBP-type peptidyl-prolyl cis-trans isomerase
51665	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
51666	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
51666	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
51676	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
51678	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
51678	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
51686	Ornithine decarboxylase antizyme
51691	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
51700	Oxidoreductase FAD-binding domain
51702	Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-
51704	7 transmembrane receptor (metabotropic glutamate family)
51706	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
51710	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
51715	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
51716	Carboxylesterase
51719	Mo25 protein family
51727	Adenylate kinase
51733	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
51734	SelR domain. Methionine sulfoxide reduction is an important process, by which cells regulate biological processes and cope with oxidative stress. MsrA, a protein involved in the reduction of methionine sulfoxides in proteins, has been known for four deca
51741	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
51741	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
51741	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
51741	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
51742	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
51742	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
51747	Protein of unknown function, DUF259
51752	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
51760	C2 domain
51762	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
51773	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
51776	Protein kinase domain
51780	jmjC domain
51789	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t
51791	Raf-like Ras-binding domain
51792	HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats
51793	Amidinotransferase. This family contains glycine (EC:2.1.4.1) and inosamine (EC:2.1.4.2) amidinotransferases, enzymes involved in creatine and streptomycin biosynthesis respectively. This family also includes arginine deiminases, EC:3.5.3.6. These enzyme
51795	HYR domain. This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is unc
51796	PWI domain
51799	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
51800	Apoptosis regulator proteins, Bcl-2 family
51802	Amiloride-sensitive sodium channel
51802	Amiloride-sensitive sodium channel
51807	Tubulin/FtsZ family
51809	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
51810	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
51811	Lectin C-type domain. This family includes both long and short form C-type
51812	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
51816	Adenosine/AMP deaminase
51816	Adenosine/AMP deaminase
51885	Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin
51960	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
51960	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
52024	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
52033	Protein kinase domain
52040	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
52064	ubiE/COQ5 methyltransferase family
52250	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
52323	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
52323	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
52331	Starch binding domain
52331	Starch binding domain
52389	7 transmembrane receptor (Secretin family)
52428	Hr1 repeat
52428	BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1
52443	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
52468	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
52530	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
52538	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
52552	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
52563	Anaphase promoting complex subunit 8 / cdc23. The anaphase-promoting complex is composed of eight protein subunits, including BimE (APC1), CDC27 (APC3), CDC16 (APC6), and CDC23 (APC8)
52575	Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue
52585	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
52588	Tetraspanin family
52609	'chromo' (CHRromatin Organization MOdifier) domain
52614	7 transmembrane receptor (Secretin family)
52614	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
52633	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
52635	C2 domain
52653	Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration
52679	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
52679	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
52685	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
52700	Thioredoxin
52822	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
52837	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
52850	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
52857	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
52858	CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond
52858	CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond
52906	WD domain, G-beta repeat
52906	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
52915	MIZ zinc finger
52915	pfam02891, zf-MIZ, MIZ zinc finger
53315	Sulfotransferase protein
53318	LIM domain. This family represents two copies of the LIM structural domain
53319	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
53319	TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucl
53320	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
53320	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
53321	Fibrinogen beta and gamma chains, C-terminal globular domain
53321	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
53328	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
53330	Synaptobrevin
53335	Zinc finger, C2H2 type
53339	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
53345	Extracellular link domain
53349	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
53349	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
53349	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
53349	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
53349	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
53353	Low-density lipoprotein receptor domain class A
53354	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
53354	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
53354	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
53357	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o
53358	Phosphotyrosine interaction domain (PTB/PID)
53373	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
53382	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
53412	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase,
53412	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p
53413	Exo70 exocyst complex subunit. The Exo70 protein forms one subunit of the exocyst complex. First discovered in S. cerevisiae , Exo70 and other exocyst proteins have been observed in several other eukaryotes, including humans. In S. cerevisiae, the exocys
53418	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
53419	Low-density lipoprotein receptor domain class A
53419	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
53420	C2 domain
53424	Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen
53605	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
53606	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
53606	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
53607	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
53610	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
53615	Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD ha
53616	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
53616	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
53616	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
53616	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
53616	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
53616	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
53616	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
53616	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
53616	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
53616	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
53617	Intermediate filament protein
53620	Synaptobrevin
53622	Intermediate filament protein
53623	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
53624	PMP-22/EMP/MP20/Claudin family
53625	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
53627	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
53627	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
53627	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
53627	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
53630	Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc
53637	7 transmembrane receptor (rhodopsin family)
53791	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
53814	Ornithine decarboxylase antizyme
53817	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
53817	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
53817	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
53817	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
53822	ATP1G1/PLM/MAT8 family
53826	ATP1G1/PLM/MAT8 family
53827	ATP1G1/PLM/MAT8 family
53829	7 transmembrane receptor (rhodopsin family)
53829	7 transmembrane receptor (rhodopsin family)
53831	7 transmembrane receptor (rhodopsin family)
53832	Tissue factor
53836	7 transmembrane receptor (rhodopsin family)
53840	B-box zinc finger
53840	Zinc finger, C3HC4 type (RING finger)
53840	B-box zinc finger
53840	Zinc finger, C3HC4 type (RING finger)
53860	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
53861	Zn-finger in Ran binding protein and others
53867	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
53870	Fibronectin type III domain
53871	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
53881	Sodium:solute symporter family
53883	7 transmembrane receptor (Secretin family)
53883	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
53883	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
53885	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
53893	NUDIX domain
53901	RNA recognition motif. (a.k.a. RRM
53905	Animal haem peroxidase
53905	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
53905	Animal haem peroxidase
53905	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
53917	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
53918	eRF1 domain 3. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known
53919	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
53919	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
53938	Cyclophilin type peptidyl-prolyl cis-trans isomerase
53940	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
53942	Fibronectin type III domain
53942	Fibronectin type III domain
53944	Protein kinase domain
53949	7 transmembrane receptor (rhodopsin family)
53950	Somatotropin hormone family
53970	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
53973	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
53978	7 transmembrane receptor (rhodopsin family)
53978	7 transmembrane receptor (rhodopsin family)
53981	Metallo-beta-lactamase superfamily
54003	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
54006	MYND finger
54006	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
54059	Uncharacterized protein family UPF0054
54084	EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function,
54101	Protein kinase domain
54101	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
54106	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
54106	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
54112	7 transmembrane receptor (rhodopsin family)
54124	Cyclin-dependent kinase regulatory subunit
54125	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
54127	Ribosomal protein S28e
54128	Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions
54132	LIM domain. This family represents two copies of the LIM structural domain
54132	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
54133	LIM domain. This family represents two copies of the LIM structural domain
54133	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
54140	7 transmembrane receptor (rhodopsin family)
54145	Core histone H2A/H2B/H3/H4
54150	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
54151	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result
54152	Dynein light chain type 1
54159	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
54165	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
54169	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
54189	Rabaptin
54190	Rabaptin
54192	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
54193	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
54195	Adenylate and Guanylate cyclase catalytic domain
54199	7 transmembrane receptor (rhodopsin family)
54200	Sulfotransferase protein
54204	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
54207	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
54207	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
54207	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
54214	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
54216	Cadherin domain
54217	Ribosomal protein L36e
54218	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
54223	Adenylate and Guanylate cyclase catalytic domain
54226	Beta-amyloid peptide (beta-APP)
54226	Beta-amyloid peptide (beta-APP)
54230	BTG1 family. A novel family of anti-proliferative proteins
54231	Eukaryotic-type carbonic anhydrase
54232	Eukaryotic-type carbonic anhydrase
54233	Eukaryotic-type carbonic anhydrase
54234	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
54236	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
54239	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
54239	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
54240	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
54240	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
54241	Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain
54241	Clathrin propeller repeat. Clathrin is the scaffold protein of the basket-like coat that surrounds coated vesicles. The soluble assembly unit, a triskelion, contains three heavy chains and three light chains in an extended three-legged structure. Each le
54243	Sushi domain (SCR repeat)
54245	SH2 domain
54246	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
54248	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
54248	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
54249	Trypsin
54250	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
54256	Connexin
54257	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
54258	7 transmembrane receptor (rhodopsin family)
54260	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
54261	Calcium-activated SK potassium channel
54261	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-
54262	Calcium-activated SK potassium channel
54262	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-
54263	Calcium-activated SK potassium channel
54263	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-
54264	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
54264	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
54267	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
54269	Trypsin
54274	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
54276	Helix-loop-helix DNA-binding domain
54278	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
54278	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
54279	Fibronectin type III domain
54280	Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins
54281	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
54282	Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-
54283	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
54283	6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta
54284	Homeobox domain
54286	Protein kinase domain
54290	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
54292	Raf-like Ras-binding domain
54292	Phosphotyrosine interaction domain (PTB/PID)
54293	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
54295	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
54295	GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins
54295	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
54296	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
54296	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
54301	Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue
54302	Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue
54302	Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue
54304	Single-strand binding protein family
54305	7 transmembrane receptor (rhodopsin family)
54309	C2 domain
54311	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim
54314	Animal haem peroxidase
54315	Mitochondrial carrier protein
54319	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
54319	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
54321	Calponin family repeat
54322	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
54323	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
54324	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
54324	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
54325	GNS1/SUR4 family
54326	GNS1/SUR4 family
54328	7 transmembrane receptor (rhodopsin family)
54329	7 transmembrane receptor (rhodopsin family)
54338	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic
54345	HMG (high mobility group) box
54349	Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain
54350	Sec1 family
54354	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
54355	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
54357	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
54361	wnt family
54363	FMN-dependent dehydrogenase
54364	RNase P subunit p30. This protein is part of the RNase P complex that is involved in tRNA maturation
54366	Vinculin family
54368	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
54371	Sulfotransferase protein
54373	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
54375	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
54376	PMP-22/EMP/MP20/Claudin family
54377	PMP-22/EMP/MP20/Claudin family
54381	Peptidase family M28D
54381	Peptidase family M28D
54383	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
54387	SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1
54391	Riboflavin kinase / FAD synthetase. This family consists part of the bifunctional enzyme riboflavin kinase / FAD synthetase. These enzymes have both ATP:riboflavin 5'-phospho transferase and ATP:FMN-adenylyltransferase activitys. They catalyse the 5'-pho
54392	Domain of unknown function (DUF315). Family of plant hypothetical proteins
54392	Chromosome condensation protein 3, C-terminal region. Cnd3 is a Member of the five subunit condensin complex. Each subunit is essential for mitotic condensation
54393	7 transmembrane receptor (metabotropic glutamate family)
54393	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
54397	Palmitoyl protein thioesterase
54398	Palmitoyl protein thioesterase
54401	14-3-3 protein
54403	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
54403	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
54407	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
54410	Somatomedin B domain
54410	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle
54413	Carboxylesterase
54419	PMP-22/EMP/MP20/Claudin family
54420	PMP-22/EMP/MP20/Claudin family
54422	Homeobox domain
54426	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
54431	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
54434	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
54437	Thrombospondin type 1 domain
54437	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
54437	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
54439	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
54439	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
54441	Stromal antigen (SA/STAG) protein
54442	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
54453	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
54454	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
54454	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
54455	F-box domain
54455	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
54464	Putative 5'-3' exonuclease domain. This family aligns residues towards the N-terminus of several proteins with multiple functions. The members of this family all appear to possess 5'-3' exonuclease activity EC:3.1.11.-. Thus, the aligned region may be nec
54466	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members
54467	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
54467	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
54469	AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets i
54471	Mab-21 protein
54474	Intermediate filament protein
54480	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
54483	B-box zinc finger
54483	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
54485	Delta serrate ligand
54486	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
54490	UDP-glucoronosyl and UDP-glucosyl transferase
54498	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
54498	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
54498	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
54498	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
54498	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
54504	Serine carboxypeptidase
54504	Serine carboxypeptidase
54510	Cadherin domain
54511	HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase
54513	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
54517	Uncharacterized protein family UPF0024
54519	PH domain. PH stands for pleckstrin homology
54519	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
54522	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
54524	C2 domain
54525	C2 domain
54526	C2 domain
54527	C2 domain
54531	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
54531	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL
54536	Exocyst complex subunit Sec15-like
54542	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
54549	Fibronectin type III domain
54550	Uncharacterized ACR, YneC family COG1359
54551	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
54552	GTPase of unknown function
54561	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
54563	Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins
54563	Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins
54566	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
54566	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
54567	Delta serrate ligand
54575	UDP-glucoronosyl and UDP-glucosyl transferase
54576	UDP-glucoronosyl and UDP-glucosyl transferase
54577	UDP-glucoronosyl and UDP-glucosyl transferase
54578	UDP-glucoronosyl and UDP-glucosyl transferase
54596	L1 transposable element
54598	7 transmembrane receptor (Secretin family)
54600	UDP-glucoronosyl and UDP-glucosyl transferase
54609	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
54610	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
54611	3'5'-cyclic nucleotide phosphodiesterase
54613	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
54616	Uncharacterized ACR, COG1579
54616	Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo
54617	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
54619	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termi
54620	F-box domain
54622	ADP-ribosylation factor family
54623	Paf1. Members of this family are components of the RNA polymerase II associated Paf1 complex. The Paf1 complex functions during the elongation phase of transcription in conjunction with Spt4-Spt5 and Spt16-Pob3i
54624	Paf1. Members of this family are components of the RNA polymerase II associated Paf1 complex. The Paf1 complex functions during the elongation phase of transcription in conjunction with Spt4-Spt5 and Spt16-Pob3i
54624	Paf1. Members of this family are components of the RNA polymerase II associated Paf1 complex. The Paf1 complex functions during the elongation phase of transcription in conjunction with Spt4-Spt5 and Spt16-Pob3i
54625	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
54625	Appr-1"-p processing enzyme family. This domain is found in a number of otherwise unrelated proteins. This domain is found at the C-terminus of the macro-H2A histone protein. This domain is found in the non-structural proteins of several types of ssRNA vi
54626	Helix-loop-helix DNA-binding domain
54630	LIM domain. This family represents two copies of the LIM structural domain
54631	Fibronectin type III domain
54632	FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl
54633	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
54644	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
54646	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase,
54651	Ubiquitin carboxyl-terminal hydrolase family 2
54651	Ubiquitin carboxyl-terminal hydrolases family 2
54652	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
54652	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
54657	UDP-glucoronosyl and UDP-glucosyl transferase
54658	UDP-glucoronosyl and UDP-glucosyl transferase
54659	UDP-glucoronosyl and UDP-glucosyl transferase
54673	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
54674	Fibronectin type III domain
54674	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
54677	Choline/Carnitine o-acyltransferase
54704	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
54705	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
54707	Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity
54710	Sulfotransferase protein
54712	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
54712	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
54714	Cyclic nucleotide-binding domain
54714	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
54716	Sodium:neurotransmitter symporter family
54716	Sodium:neurotransmitter symporter family
54718	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
54725	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv
54726	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
54729	Homeobox domain
54732	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
54734	ADP-ribosylation factor family
54734	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
54737	'chromo' (CHRromatin Organization MOdifier) domain
54737	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
54738	Ets-domain
54739	TRAF-type zinc finger
54741	Vacuolar protein sorting 55. Vps55 is involved in the secretion of the Golgi form of the soluble vacuolar carboxypeptidase Y, but not the trafficking of the membrane-bound vacuolar alkaline phosphatase. Both Vps55 and obesity receptor gene-related protei
54751	LIM domain. This family represents two copies of the LIM structural domain
54751	LIM domain. This family represents two copies of the LIM structural domain
54752	Fibronectin type III domain
54760	Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic
54763	Regulatory subunit of type II PKA R-subunit
54765	B-box zinc finger
54766	BTG1 family. A novel family of anti-proliferative proteins
54795	Transient receptor
54798	Cadherin domain
54799	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino te
54800	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
54800	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
54807	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
54812	Presenilin
54813	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
54814	Glutaminyl cyclase. Glutaminyl cyclase catalyses the formation of the pyroglutamyl residue present at the amino terminus of numerous secretory peptides and proteins. Glutaminyl cyclase posses a zinc aminopeptidase domain in which the four functionally im
54822	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a
54822	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
54825	Cadherin domain
54826	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic d
54831	Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown
54840	HIT family
54840	HIT family
54840	HIT family
54840	HIT family
54840	HIT family
54843	C2 domain
54843	C2 domain
54850	F-box domain
54852	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h
54857	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b
54860	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
54865	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
54865	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
54865	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
54869	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
54869	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
54873	Paralemmin
54877	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members invol
54878	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
54879	ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain
54879	ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain
54879	ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain
54879	ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain
54886	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
54896	PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How
54898	GNS1/SUR4 family
54899	PX domain. PX domains bind to phosphoinositides
54904	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
54904	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
54904	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
54905	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
54906	O-methyltransferase
54910	Sema domain
54910	Plexin repeat
54915	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment
54916	SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1
54920	Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrou
54922	DIL domain. The DIL domain has no known function
54931	Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue
54932	3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termi
54932	Protein of unknown function DUF82. This prokaryotic protein family has no known function. The protein contains four conserved cysteines that may be involved in metal binding or disulphide bridges
54933	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f
54936	ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisi
54937	Helix-loop-helix DNA-binding domain
54946	Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples ha
54948	Ribosomal protein L16
54949	Domain of unknown function (DUF339). This family of small proteins are uncharacterised
54952	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
54957	Mitosis protein DIM1
54961	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
54961	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
54977	Mitochondrial carrier protein
54993	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
54993	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
54993	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
54996	MOSC domain. The MOSC (MOCO sulfurase C-terminal) domain is a superfamily of beta-strand-rich domains identified in the molybdenum cofactor sulfurase and several other proteins from both prokaryotes and eukaryotes. These MOSC domains contain an absolutel
55032	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
55034	MOSC N-terminal beta barrel domain. This domain is found to the N-terminus of pfam03473. The function of this domain is unknown, however it is predicted to adopt a beta barrel fold
55036	Ezrin/radixin/moesin family
55061	Sushi domain (SCR repeat)
55063	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
55066	Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in
55074	LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known
55089	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
55095	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
55103	PH domain. PH stands for pleckstrin homology
55105	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
55108	BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function
55109	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
55110	Mago nashi protein. This family was originally identified in Drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene. The human homologue has been shown to interact with an RNA binding protein. An RNAi knockout of the
55111	PH domain. PH stands for pleckstrin homology
55117	Sodium:neurotransmitter symporter family
55117	Sodium:neurotransmitter symporter family
55119	PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is abou
55124	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
55124	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
55128	B-box zinc finger
55130	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
55130	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
55148	Putative zinc finger in N-recognin
55154	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacteria
55156	Armadillo/beta-catenin-like repeat
55161	Uncharacterised protein family (UPF0121). Uncharacterised integral membrane protein family
55175	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
55175	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
55186	Mitochondrial carrier protein
55187	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t
55190	NUDIX domain
55193	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
55193	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
55198	PH domain. PH stands for pleckstrin homology
55203	EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function,
55203	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
55207	ADP-ribosylation factor family
55208	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
55210	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
55217	Gamma-butyrobetaine hydroxylase. Members of this family are gamma-Butyrobetaine hydroxylase enzymes EC:1.14.11.1
55218	3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termi
55219	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
55224	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
55229	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
55230	Ubiquitin carboxyl-terminal hydrolase family 2
55231	Microtubule associated protein (MAP65/ASE1 family)
55233	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru
55236	Repeat in ubiquitin-activating (UBA) protein
55236	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
55237	Transposase. Transposase proteins are necessary for efficient DNA transposition. This family consists of various E. coli insertion elements and other bacterial transposases some of which are members of the IS3 family
55238	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
55245	Ubiquinol-cytochrome C chaperone
55251	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
55253	Flavodoxin
55256	ARD/ARD' family. The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+. ARD and ARD' can be
55274	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
55274	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
55275	Vps53-like, N-terminal domain. Vps53 complexes with Vps52 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events
55284	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
55289	Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle
55291	Dehydrogenase E1 component
55293	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also membe
55295	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
55295	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
55300	Phosphatidylinositol 3- and 4-kinase
55303	GTPase of unknown function
55312	Riboflavin kinase / FAD synthetase. This family consists part of the bifunctional enzyme riboflavin kinase / FAD synthetase. These enzymes have both ATP:riboflavin 5'-phospho transferase and ATP:FMN-adenylyltransferase activitys. They catalyse the 5'-pho
55315	Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their
55327	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
55334	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
55341	GTPase of unknown function
55342	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
55344	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
55349	GMC oxidoreductase. This family of proteins bind FAD as a cofactor
55350	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
55350	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1
55353	Golgi 4-transmembrane spanning transporter
55356	Sugar (and other) transporter
55362	Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length
55364	Uncharacterized protein family UPF0029
55366	7 transmembrane receptor (rhodopsin family)
55367	Death domain
55367	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
55367	Death domain
55367	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
55367	Death domain
55367	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
55370	HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats
55422	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
55486	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f
55500	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
55501	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
55502	Helix-loop-helix DNA-binding domain
55503	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
55503	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
55505	Nucleolar RNA-binding protein, Nop10p family. Nop10p is a nucleolar protein that is specifically associated with H/ACA snoRNAs. It is essential for normal 18S rRNA production and rRNA pseudouridylation by the ribonucleoprotein particles containing H/ACA
55506	Core histone H2A/H2B/H3/H4
55506	Appr-1"-p processing enzyme family
55507	7 transmembrane receptor (metabotropic glutamate family)
55512	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN
55515	Amiloride-sensitive sodium channel
55515	Amiloride-sensitive sodium channel
55521	B-box zinc finger
55530	Sugar (and other) transporter
55553	HMG (high mobility group) box
55558	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
55558	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
55567	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
55575	MSP (Major sperm protein) domain
55582	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
55584	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
55584	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
55585	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
55593	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
55593	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
55605	Kinesin motor domain
55605	WD domain, G-beta repeat
55605	Intermediate filament protein
55605	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
55605	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
55605	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
55614	Kinesin motor domain
55614	Kinesin motor domain
55614	PX domain. PX domains bind to phosphoinositides
55616	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
55621	N2,N2-dimethylguanosine tRNA methyltransferase. This enzyme EC:2.1.1.32 used S-AdoMet to methylate tRNA. The TRM1 gene of Saccharomyces cerevisiae is necessary for the N2,N2-dimethylguanosine modification of both mitochondrial and cytoplasmic tRNAs. The
55623	pfam02926, THUMP, THUMP domain. The THUMP domain is named after after thiouridine synthases, methylases and PSUSs. The THUMP domain consists of about 110 amino acid residues. It is predicted that this domain is an RNA-binding domain that adopts an alpha/
55624	GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis o
55630	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
55630	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
55636	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
55636	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
55636	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
55643	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
55647	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
55650	Protein of unknown function (DUF409). Family of eukaryotic membrane proteins with unknown function
55651	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
55659	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
55662	jmjC domain
55663	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
55664	Cdc37 family. In the budding yeast Saccharomyces cerevisiae, Cdc37 is required for the productive formation of Cdc28-cyclin complexes. Cdc37 may be a kinase targeting subunit of Hsp90
55667	Sulfotransferase proteins
55679	LIM domain. This family represents two copies of the LIM structural domain
55680	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
55680	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
55681	Protein kinase domain
55684	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
55687	tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs
55691	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
55692	Protein of unknown function, DUF259
55693	jmjC domain
55698	DIL domain. The DIL domain has no known function
55698	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
55709	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
55709	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
55711	Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included
55722	Intermediate filament proteins
55727	BTB/POZ domain. The BTB (for BR-C
55728	Smr domain. This family includes the Smr (Small MutS Related) proteins, and the C-terminal region of the MutS2 protein. It has been suggested that this domain interacts with the MutS1 protein in the case of Smr proteins and with the N-terminal MutS relat
55733	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
55737	Vacuolar protein sorting-associated protein 35. Vacuolar protein sorting-associated protein (Vps) 35 is one of around 50 proteins involved in protein trafficking. In particular, Vps35 assembles into a retromer complex with at least four other proteins Vp
55738	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
55738	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
55741	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
55742	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
55743	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
55745	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
55746	Nup133 nucleoporin. RNA undergoing nuclear export first encounters the basket of the nuclear pore. Nup133 is a nucleoporin accessible on the basket side of the pore
55749	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
55751	Domain of unknown function
55752	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
55754	LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in resp
55757	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl
55758	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
55762	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
55768	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
55770	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
55775	Alpha amylase
55783	FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl
55786	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
55790	Galactosyltransferase
55794	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
55799	Cache domain
55801	Interleukin 10
55803	PH domain. PH stands for pleckstrin homology
55803	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
55806	jmjC domain
55806	jmjC domain
55808	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
55818	jmjC domain
55819	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
55821	Allantoicase repeat. This family is found in pairs in Allantoicases, forming the majority of the protein. These proteins allow the use of purines as secondary nitrogen sources in nitrogen-limiting conditions through the reaction: allantoate + H(2)0 = (-)
55823	Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain
55825	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
55830	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl
55830	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl
55832	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
55833	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t
55833	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t
55833	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t
55846	FG-GAP repeat
55849	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain conta
55867	Sugar (and other) transporter
55869	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
55870	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
55870	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
55870	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
55871	Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans
55872	Protein kinase domain
55879	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
55879	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
55885	LIM domain. This family represents two copies of the LIM structural domain
55888	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
55890	7 transmembrane receptor (metabotropic glutamate family)
55890	7 transmembrane receptor (metabotropic glutamate family)
55892	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
55897	Helix-loop-helix DNA-binding domain
55904	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
55904	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
55907	Cytidylyltransferase. This family consists of two main Cytidylyltransferase activities: 1) 3-deoxy-manno-octulosonate cytidylyltransferase,, EC:2.7.7.38 catalysing the reaction:- CTP + 3-deoxy-D-manno-octulosonate <=> diphosphate + CMP-3-deoxy-D-manno-oc
55927	Helix-loop-helix DNA-binding domain
55930	DIL domain. The DIL domain has no known function
55930	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
55930	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
55932	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
55932	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
55938	Lipocalin / cytosolic fatty-acid binding protein family
55939	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
55946	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
55948	14-3-3 protein
55951	Uncharacterised protein family (UPF0041)
55954	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
55958	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
55959	Sulfatase
55959	Sulfatase
55961	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
55963	Sodium:dicarboxylate symporter family
55964	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
55964	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
55968	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
55968	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
55968	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
55972	Mitochondrial carrier protein
55974	MtN3/saliva family. This family includes proteins such as drosophila saliva, MtN3 involved in root nodule development and a protein involved in activation and expression of recombination activation genes (RAGs). Although the molecular function of these p
55975	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
55975	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
55979	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
55980	Inositol monophosphatase family
55981	Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability
55982	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
55983	TRAF-type zinc finger
55987	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
55988	PX domain. PX domains bind to phosphoinositides
55990	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
55992	B-box zinc finger
55992	Filamin/ABP280 repeat
55992	Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine
55992	NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in at least one member, proteolysis has shown that the Peptid
55992	NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in some members, proteolysis has shown that the Peptidyl-alph
55994	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
55994	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
56000	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
56001	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
56001	TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucl
56001	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
56001	TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucl
56003	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
56010	14-3-3 protein
56011	14-3-3 protein
56014	7 transmembrane receptor (rhodopsin family)
56015	7 transmembrane receptor (rhodopsin family)
56017	Sugar (and other) transporter
56031	Cyclophilin type peptidyl-prolyl cis-trans isomerase
56032	Uncharacterised protein family (UPF0171)
56033	Homeobox domain
56036	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
56036	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
56043	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
56043	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
56045	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
56046	Ubiquinol-cytochrome C chaperone
56050	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
56052	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari
56057	BTG1 family. A novel family of anti-proliferative proteins
56062	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
56062	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
56066	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
56068	Protein of unknown function DUF51. This family contains members in prokaryotes and eukaryotes. None of the members has a known function
56070	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
56077	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
56077	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
56078	Eukaryotic-type carbonic anhydrase
56078	Eukaryotic-type carbonic anhydrase
56083	Transglutaminase family
56083	Transglutaminase family, C-terminal ig like domain
56083	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
56084	VHS domain. Domain present in VPS-27, Hrs and STAM
56085	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
56086	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
56096	Zona pellucida-like domain
56109	Cadherin domain
56109	Cadherin domain
56110	Cadherin domain
56110	Cadherin domain
56111	Cadherin domain
56111	Cadherin domain
56112	Cadherin domain
56112	Cadherin domain
56113	Cadherin domain
56113	Cadherin domain
56129	Cadherin domain
56137	Cadherin domain
56137	Cadherin domain
56139	Cadherin domain
56142	Cadherin domain
56144	Cadherin domain
56144	Cadherin domain
56147	Cadherin domain
56150	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
56160	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
56163	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
56164	Protein kinase domain
56164	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
56164	Protein kinase domain
56164	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
56165	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
56171	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
56173	PMP-22/EMP/MP20/Claudin family
56175	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
56180	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
56183	Neuromedin U
56184	Helix-loop-helix DNA-binding domain
56185	FMN-dependent dehydrogenase
56188	ATP1G1/PLM/MAT8 family
56189	Proline dehydrogenase
56191	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
56191	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
56193	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
56195	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
56196	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN
56198	Helix-loop-helix DNA-binding domain
56203	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
56207	Ubiquitin carboxyl-terminal hydrolase, family 1
56208	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either asso
56209	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b
56210	impB/mucB/samB family. These proteins are involved in UV protection
56210	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
56214	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c
56215	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
56216	Syntaxin
56218	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
56218	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
56219	Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo
56220	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
56224	Tetraspanin family
56228	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
56248	Adenylate kinase
56252	RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses
56257	O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine
56258	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
56261	Starch binding domain
56261	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b
56264	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
56265	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
56266	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
56275	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
56284	Ribosomal protein L19
56287	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
56288	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
56289	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
56293	Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region
56294	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
56294	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
56296	DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai
56297	ADP-ribosylation factor family
56299	TPR Domain
56299	TPR Domain
56299	TPR Domain
56302	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
56305	Phosphatidylinositol transfer protein
56307	metallopeptidase family M24
56314	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
56315	Ammonium Transporter Family
56318	Histidine acid phosphatase
56320	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
56322	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim
56324	VHS domain. Domain present in VPS-27, Hrs and STAM
56327	ADP-ribosylation factor family
56330	Protein of unknown function DUF122. This protein family has no known function
56334	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
56336	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
56342	Second step splicing factor 1. SSF is thought to bind rRNA and similarity with peter pan in Drosophila suggests a role in cell proliferation
56350	ADP-ribosylation factor family
56358	Clathrin adaptor complex small chain
56360	Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C
56362	Sulfotransferase protein
56365	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
56365	CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate de
56369	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
56373	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
56375	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
56376	LIM domain. This family represents two copies of the LIM structural domain
56378	P21-ARC (ARP2/3 complex 21 kDa subunit). The seven component ARP2/3 actin-organizing complex is involved in actin assembly and function
56379	Inward rectifier potassium channel
56380	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
56382	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
56386	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
56388	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
56388	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
56389	Syntaxin
56401	V-type ATPase 116kDa subunit family
56405	Protein-tyrosine phosphatase
56405	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
56413	7 transmembrane receptor (rhodopsin family)
56417	Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this r
56421	Phosphofructokinase
56424	TPR Domain
56428	Mitochondrial carrier protein
56430	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
56431	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
56434	Tetraspanin family
56440	PX domain. PX domains bind to phosphoinositides
56440	Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2
56441	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
56442	TMS membrane protein/tumour differentially expressed protein (TDE)
56447	Clathrin adaptor complex small chain
56448	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
56453	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
56455	Dynein light chain type 1
56457	tRNA pseudouridine synthase
56459	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
56462	Mitochondrial carrier protein
56463	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
56469	pfam02891, zf-MIZ, MIZ zinc finger
56469	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
56470	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
56471	Stathmin family
56473	Fatty acid desaturase
56473	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
56479	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
56479	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
56485	Sugar (and other) transporter
56486	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
56490	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
56490	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
56491	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
56492	PMP-22/EMP/MP20/Claudin family
56495	Anion-transporting ATPase. This Pfam family represents a conserved domain, which is sometimes repeated, in an anion-transporting ATPase. The ATPase is involved in the removal of arsenate, antimonite, and arsenate from the cell
56496	Tetraspanin family
56508	Cyclic nucleotide-binding domain
56508	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
56517	Sugar (and other) transporter
56521	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con
56524	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
56525	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
56526	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
56529	Signal peptidase I
56531	RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses
56532	Protein kinase domain
56533	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
56534	Hsp20/alpha crystallin family
56543	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
56543	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
56544	7 transmembrane receptor (metabotropic glutamate family)
56544	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
56546	Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes
56547	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
56551	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
56552	7 transmembrane receptor (metabotropic glutamate family)
56552	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
56554	Class I Histocompatibility antigen, domains alpha 1 and 2
56603	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
56605	Endoplasmic Reticulum Oxidoreductin 1 (ERO1). Members of this family are required for the formation of disulphide bonds in the ER
56606	Sugar (and other) transporter
56611	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
56612	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription
56615	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
56619	Lectin C-type domain. This family includes both long and short form C-type
56622	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
56622	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
56626	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
56628	Class I Histocompatibility antigen, domains alpha 1 and 2
56629	Deoxyribonuclease II
56635	Somatotropin hormone family
56640	Trypsin
56643	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
56646	Galactoside-binding lectin
56648	Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold
56650	PMP-22/EMP/MP20/Claudin family
56656	7 transmembrane receptor (rhodopsin family)
56659	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
56666	Innexin. This family includes the drosophila proteins Ogre and shaking-B, and the C. elegans proteins Unc-7 and Unc-9. Members of this family are integral membrane proteins which are involved in the formation of gap junctions. This family has been named
56667	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
56670	7 transmembrane receptor (rhodopsin family)
56676	Helix-loop-helix DNA-binding domain
56695	Metallo-beta-lactamase superfamily
56696	7 transmembrane receptor (rhodopsin family)
56702	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
56704	MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000
56708	Ciliary neurotrophic factor
56715	Vacuolar sorting protein 9 (VPS9) domain
56717	Phosphatidylinositol 3- and 4-kinase
56717	FAT domain. The FAT domain is named after FRAP, ATM and TRRAP
56717	FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal
56717	Phosphatidylinositol 3- and 4-kinase
56717	FAT domain. The FAT domain is named after FRAP, ATM and TRRAP
56717	FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal
56720	Tryptophan 2,3-dioxygenase
56720	Tryptophan 2,3-dioxygenase
56734	Tub family
56735	Intermediate filament protein
56736	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
56737	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari
56738	Radical SAM superfamily
56738	moaA / nifB / pqqE family
56739	Uncharacterized ACR, COG1354
56741	Fibronectin type III domain
56741	Fibronectin type III domain
56741	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
56741	Fibronectin type III domain
56741	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
56744	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
56745	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
56745	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
56747	CUB domain
56747	Sushi domain (SCR repeat)
56749	Dihydroorotate dehydrogenase
56751	Homeobox domain
56752	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
56753	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
56761	Notch (DSL) domain. The Notch domain is also called the 'DSL' domain. The notch proteins are transmembrane proteins with extracellular domains of repeated EGF domains and the notch (or DSL) domain N-terminal to that. These proteins are generally involved
56766	Vacuolar protein sorting 55. Vps55 is involved in the secretion of the Golgi form of the soluble vacuolar carboxypeptidase Y, but not the trafficking of the membrane-bound vacuolar alkaline phosphatase. Both Vps55 and obesity receptor gene-related protei
56773	Sulfotransferase protein
56774	Sodium:neurotransmitter symporter family
56777	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
56777	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
56780	Histidine acid phosphatase
56783	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
56784	Rap/ran-GAP
56785	Rap/ran-GAP
56787	Helix-loop-helix DNA-binding domain
56791	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
56795	ADP-ribosylation factor family
56805	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
56806	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
56806	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
56807	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c
56808	Cache domain
56809	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
56813	7 transmembrane receptor (Secretin family)
56814	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
56816	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
56816	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
56817	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
56819	Uncharacterized ACR, COG1579
56819	Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mo
56821	7 transmembrane receptor (rhodopsin family)
56825	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
56826	PH domain. PH stands for pleckstrin homology
56826	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
56827	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
56832	Interferon alpha/beta domain
56839	EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function,
56839	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
56843	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
56843	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
56843	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
56847	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
56856	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
56858	7 transmembrane receptor (rhodopsin family)
56859	7 transmembrane receptor (rhodopsin family)
56860	7 transmembrane receptor (rhodopsin family)
56861	7 transmembrane receptor (rhodopsin family)
56863	PMP-22/EMP/MP20/Claudin family
56874	IQ calmodulin-binding motif. Calmodulin-binding motif
56889	Endomembrane protein 70
56891	Galactoside-binding lectin
56893	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
56898	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
56899	Phosphotyrosine interaction domain (PTB/PID)
56899	Phosphotyrosine interaction domain (PTB/PID)
56911	ATP synthase B/B' CF(0)
56913	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
56917	Homeobox domain
56918	Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog
56923	7 transmembrane receptor (rhodopsin family)
56924	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
56928	Pox virus Ag35 surface protein
56928	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
56928	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
56934	Eukaryotic-type carbonic anhydrase
56938	Helix-loop-helix DNA-binding domain
56946	ENT domain. This presumed domain is named after Emsy N Terminus (ENT). Emsy is a protein that is amplified in breast cancer and interacts with BRCA2. The N terminus of this protein is found to be similar to other vertebrate and plant proteins of unknown
56950	MYND finger
56954	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
56956	Homeobox domain
56956	LIM domain. This family represents two copies of the LIM structural domain
56956	Homeobox domain
56956	LIM domain. This family represents two copies of the LIM structural domain
56959	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
56959	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
56959	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
56961	SH2 domain
56986	DTW domain. This presumed domain is found in bacterial and eukaryotic proteins. Its function is unknown. The domain contains multiple conserved motifs including a DTXW motif that this domain has been named after
56987	HMG (high mobility group) box
56995	Tub family
56999	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
56999	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
56999	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
56999	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
56999	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
56999	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
57006	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
57007	7 transmembrane receptor (rhodopsin family)
57007	7 transmembrane receptor (rhodopsin family)
57014	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
57014	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
57016	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
57020	D-Ala-D-Ala carboxypeptidase 3 (S13) family
57021	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
57021	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
57023	Disintegrin
57023	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
57023	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
57025	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
57025	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
57027	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
57029	Disintegrin
57029	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
57037	MYND finger
57047	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve
57048	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve
57050	Sas10/Utp3 family. This family contains Sas10 which hash been identified as a regulator of chromatin silencing. The family also contains Utp3 a component of the U3 ribonucleoprotein complex. The exact molecular function of this family is unknown
57053	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
57053	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
57057	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
57060	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
57080	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
57080	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
57088	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve
57089	GDA1/CD39 (nucleoside phosphatase) family
57094	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
57104	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o
57104	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of
57105	7 transmembrane receptor (rhodopsin family)
57113	Transient receptor
57116	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
57117	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
57119	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structur
57120	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
57121	7 transmembrane receptor (rhodopsin family)
57122	Nuclear pore protein 84 / 107. Nup84p forms a complex with five proteins, of which Nup120p, Nup85p, Sec13p, and a Sec13p homolog. This Nup84p complex in conjunction with Sec13-type proteins is required for correct nuclear pore biogenesis
57127	Ammonium Transporter Family
57132	Eukaryotic protein of unknown function, DUF279
57134	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
57136	Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine
57139	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
57139	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
57142	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
57142	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
57142	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
57147	Protein kinase domain
57152	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
57154	C2 domain
57154	C2 domain
57154	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
57154	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
57156	Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length
57165	Connexin
57167	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
57170	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
57171	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
57178	pfam02891, zf-MIZ, MIZ zinc finger
57181	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
57182	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
57186	Rap/ran-GAP
57188	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
57191	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
57209	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
57211	7 transmembrane receptor (Secretin family)
57211	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
57218	Phage maturation protein
57231	PX domain. PX domains bind to phosphoinositides
57231	PX domain. PX domains bind to phosphoinositides
57232	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
57233	LIM domain. This family represents two copies of the LIM structural domain
57248	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
57249	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
57249	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
57250	7 transmembrane receptor (rhodopsin family)
57251	7 transmembrane receptor (rhodopsin family)
57252	7 transmembrane receptor (rhodopsin family)
57255	PMP-22/EMP/MP20/Claudin family
57257	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
57257	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
57259	BTG1 family. A novel family of anti-proliferative proteins
57260	7 transmembrane receptor (rhodopsin family)
57260	7 transmembrane receptor (rhodopsin family)
57265	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
57266	Small cytokines (intecrine/chemokine)
57267	Phosphotyrosine interaction domain (PTB/PID)
57267	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
57269	7 transmembrane receptor (rhodopsin family)
57269	7 transmembrane receptor (rhodopsin family)
57270	7 transmembrane receptor (rhodopsin family)
57270	7 transmembrane receptor (rhodopsin family)
57271	7 transmembrane receptor (rhodopsin family)
57271	7 transmembrane receptor (rhodopsin family)
57272	7 transmembrane receptor (rhodopsin family)
57274	Monocarboxylate transporter
57277	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
57279	Mitochondrial carrier protein
57282	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
57294	Ribosomal protein S27
57295	Isoprenylcysteine carboxyl methyltransferase (ICMT) family. The isoprenylcysteine o-methyltransferase (EC:2.1.1.100) family carry out carboyxl methylation of cleaved eukaryotic proteins that terminate in a CaaX motif. In Saccharomyces cerevisiae this meth
57298	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
57301	7 transmembrane receptor (rhodopsin family)
57302	7 transmembrane receptor (rhodopsin family)
57305	7 transmembrane receptor (rhodopsin family)
57315	WD domain, G-beta repeat
57315	WD domain, G-beta repeat
57320	DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117
57321	Phosphoenolpyruvate carboxykinase
57325	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
57325	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
57332	'chromo' (CHRromatin Organization MOdifier) domain
57335	Zinc finger
57335	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
57337	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
57338	MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000
57339	MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000
57340	MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000
57341	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
57342	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
57349	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
57357	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster
57369	Connexin
57370	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
57376	HMG (high mobility group) box
57377	Mannosyl oligosaccharide glucosidase. This is a family of eukaryotic enzymes belonging to glycosyl hydrolase family 63. They catalyse the specific cleavage of the non-reducing terminal glucose residue from Glc(3)Man(9)GlcNAc(2). Mannosyl oligosaccharide
57385	7 transmembrane receptor (rhodopsin family)
57402	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
57403	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
57404	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
57404	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
57409	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA
57414	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f
57418	WD domain, G-beta repeat
57419	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
57419	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cells
57426	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
57429	Sulfotransferase protein
57430	Sulfotransferase protein
57436	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
57442	Slow voltage-gated potassium channel
57443	F-box domain
57446	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t
57446	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t
57447	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t
57448	Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat
57448	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
57449	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
57451	Plant PEC family metallothionein
57451	Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibers in association with hair keratin intermediate filaments. This family has been divided up int
57452	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
57453	Fibronectin type III domain
57460	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
57466	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
57467	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
57474	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
57475	Intermediate filament protein
57475	PH domain. PH stands for pleckstrin homology
57475	MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins
57477	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
57478	Ubiquitin carboxyl-terminal hydrolase family 2
57480	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
57484	Zinc finger, C3HC4 type (RING finger)
57488	C2 domain
57492	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
57492	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
57497	Fibronectin type III domain
57502	Carboxylesterase
57502	Carboxylesterase
57504	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
57504	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL
57504	Early E1A protein. This is a family of adenovirus early E1A proteins. The E1A protein is 32 kDa it can however be cleaved to yield the 28 kDa protein. The E1A protein is responsible for the transcriptional activation of the early genes with in the viral g
57504	GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the do
57505	tRNA synthetases class II (A). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only alanyl-tRNA synthetases
57512	7 transmembrane receptor (metabotropic glutamate family)
57512	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
57512	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
57512	Dolichyl-phosphate-mannose-protein mannosyltransferase. This is a family of Dolichyl-phosphate-mannose-protein mannosyltransferase proteins EC:2.4.1.109. These proteins are responsible for O-linked glycosylation of proteins, they catalyse the reaction:- D
57513	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
57514	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
57515	TMS membrane protein/tumour differentially expressed protein (TDE)
57516	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
57519	START domain
57520	C2 domain
57520	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
57520	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
57522	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
57522	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
57522	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
57524	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
57526	Cadherin domain
57528	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
57529	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
57530	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
57531	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
57531	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
57534	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
57534	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
57536	Integral membrane protein DUF106. This archaebacterial protein family has no known function. Members are predicted to be integral membrane proteins
57537	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
57538	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a
57540	AcrB/AcrD/AcrF family. Members of this family are integral membrane proteins. Some are involved in drug resistance
57540	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
57542	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
57542	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
57544	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
57546	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
57549	Fibronectin type III domain
57551	Protein kinase domain
57551	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
57552	Carboxylesterase
57553	LIM domain. This family represents two copies of the LIM structural domain
57553	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
57555	Carboxylesterase
57555	Carboxylesterase
57558	Ubiquitin carboxyl-terminal hydrolases family 2
57559	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has
57563	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
57565	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
57565	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
57568	Rap/ran-GAP
57568	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
57569	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
57570	Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat
57573	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
57576	Kinesin motor domain
57586	C2 domain
57591	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
57593	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
57604	ubiE/COQ5 methyltransferase family
57605	Phosphatidylinositol transfer protein
57605	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoester
57609	AMP-binding enzyme
57610	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
57611	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
57612	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
57612	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
57621	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
57622	Fibronectin type III domain
57622	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
57622	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
57626	BTB/POZ domain. The BTB (for BR-C
57626	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Kelch is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Sev
57628	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
57629	Doublecortin
57630	Zinc finger, C3HC4 type (RING finger)
57630	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
57634	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
57636	PH domain. PH stands for pleckstrin homology
57636	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
57642	Fibronectin type III domain
57642	von Willebrand factor type A domain
57642	Thrombospondin N-terminal -like domain
57645	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
57646	Ubiquitin carboxyl-terminal hydrolase family 2
57649	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
57649	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
57655	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
57657	Cyclic nucleotide-binding domain
57657	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
57659	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
57659	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
57661	Zinc finger, C3HC4 type (RING finger)
57661	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
57663	Ubiquitin carboxyl-terminal hydrolase family 2
57665	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
57669	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
57671	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
57671	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
57674	Zinc finger, C3HC4 type (RING finger)
57677	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
57679	Vacuolar sorting protein 9 (VPS9) domain
57680	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
57680	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
57684	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
57684	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
57685	Cache domain
57685	Cache domain
57689	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
57690	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t
57690	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
57690	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
57692	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
57695	Ubiquitin carboxyl-terminal hydrolase family 2
57695	Ubiquitin carboxyl-terminal hydrolases family 2
57696	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
57697	ERCC4 domain. This domain is predicted to be a nuclease domain
57697	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
57699	C2 domain
57703	MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains
57703	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
57705	Beige/BEACH domain
57705	WD domain, G-beta repeat
57706	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
57706	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
57706	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
57708	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
57709	Amino acid permease
57713	Sterile alpha motif (SAM)/Pointed domain
57713	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
57713	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter
57722	Fibronectin type III domain
57727	Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain
57729	Protein kinase domain
57729	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
57731	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
57732	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
57733	Glycosyl hydrolase family 1
57737	Homeobox domain
57738	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
57741	Uncharacterised protein family (UPF0120)
57746	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
57746	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
57749	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
57749	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
57756	LIM domain. This family represents two copies of the LIM structural domain
57764	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
57775	Ubiquitin carboxyl-terminal hydrolase family 2
57775	Ubiquitin carboxyl-terminal hydrolase family 2
57775	Ubiquitin carboxyl-terminal hydrolases family 2
57780	ATP1G1/PLM/MAT8 family
57782	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
57794	Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding
57801	Helix-loop-helix DNA-binding domain
57804	DNA polymerase delta, subunit 4
57808	Ribosomal protein L35Ae
57809	Ribosomal protein L35Ae
57810	Fibronectin type III domain
57811	7 transmembrane receptor (rhodopsin family)
57813	Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes
57815	RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses
57815	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
57815	Parvovirus non-structural protein NS1. This family also contains the NS2 protein. Parvoviruses encode two non-structural proteins, NS1 and NS2. The mRNA for NS2 contains the coding sequence for the first 87 amino acids of NS1, then by an alternative splic
57819	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
57829	Zona pellucida-like domain
57834	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
57835	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
57835	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
57835	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
57835	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
57876	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
57908	Type IV secretion system CagX conjugation protein
57913	Death domain
58155	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
58157	Globin
58158	Helix-loop-helix DNA-binding domain
58168	7 transmembrane receptor (rhodopsin family)
58170	Amiloride-sensitive sodium channel
58175	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
58176	Ammonium Transporter Family
58177	Mitochondrial carrier protein
58178	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
58180	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
58180	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
58182	7 transmembrane receptor (rhodopsin family)
58184	Cell differentiation family, Rcd1-like. Two of the members in this family have been characterised as being involved in regulation of Ste11 regulated sex genes
58185	Radical SAM superfamily
58187	PMP-22/EMP/MP20/Claudin family
58187	PMP-22/EMP/MP20/Claudin family
58190	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
58190	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
58193	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
58198	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
58198	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
58206	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
58208	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
58222	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
58223	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
58226	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
58226	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
58227	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
58227	Rifin/stevor family. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmem
58230	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
58240	PX domain. PX domains bind to phosphoinositides
58242	NUDIX domain
58242	NUDIX domain
58243	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
58250	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
58475	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
58477	ADP-ribosylation factor family
58484	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
58485	Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses
58489	Dienelactone hydrolase family
58492	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
58492	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
58497	DHHA2 domain. This domain is often found adjacent to the DHH domain pfam01368 and is called DHHA2 for DHH associated domain. This domain is diagnostic of DHH subfamily 2 members. The domain is about 120 residues long and contains a conserved DXK motif at
58500	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
58508	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
58508	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
58508	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
58508	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
58510	Proline dehydrogenase
58511	Deoxyribonuclease II
58511	Deoxyribonuclease II
58512	Guanylate-kinase-associated protein (GKAP) protein
58517	PWI domain
58517	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
58520	Uncharacterised protein family (UPF0143). This family of uncharacterised proteins are integral membrane proteins. They may contain 4 transmembrane helices. The family contains a conserved arginine and histidine that may be functionally important
58522	B-box zinc finger
58524	DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown
58533	PX domain. PX domains bind to phosphoinositides
58538	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
58800	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a
58800	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
58802	Calcium-activated potassium channel, beta subunit
58802	Calcium-activated potassium channel, beta subunit
58803	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
58805	Helix-loop-helix DNA-binding domain
58807	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
58807	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
58808	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
58810	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
58813	pfam02872, 5_nucleotidaseC, 5'-nucleotidase, C-terminal domain
58813	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte
58814	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
58817	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24
58820	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
58822	Protein kinase domain
58845	TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway
58851	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
58851	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
58852	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
58852	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
58853	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
58853	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
58854	Proteasome A-type and B-type
58857	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
58860	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
58860	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
58861	7 transmembrane receptor (rhodopsin family)
58866	Trehalase
58867	Synaptogyrin. This family of proteins is distantly related to pfam01284
58868	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
58868	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
58868	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are
58911	Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown
58918	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
58919	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
58920	Glypican
58922	Homeobox domain
58923	B-box zinc finger
58923	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
58924	Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes
58927	Ribosomal protein L36e
58934	MYND finger
58934	Programmed cell death protein 2, C-terminal domain
58935	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carbox
58936	Protein kinase domain
58936	POLO box duplicated region
58937	Somatotropin hormone family
58945	Dynein light chain type 1
58946	Fibronectin type III domain
58947	Trypsin
58949	7 transmembrane receptor (rhodopsin family)
58952	Peptidase family M28D
58953	Sulfotransferase protein
58958	Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins
58959	7 transmembrane receptor (Secretin family)
58961	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
58962	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
58963	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
58963	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
58964	Nucleoside diphosphate kinase
58968	Extracellular link domain
58970	Fatty acid desaturase
58971	ATP1G1/PLM/MAT8 family
58974	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
58978	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
58978	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
58980	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
58980	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
58982	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
58985	Tissue factor
58992	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
58992	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
58994	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN
59004	pfam02891, zf-MIZ, MIZ zinc finger
59007	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
59009	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
59012	Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
59012	Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
59013	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
59025	Ubiquitin carboxyl-terminal hydrolase family 2
59028	RNA 3'-terminal phosphate cyclase
59030	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
59031	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
59033	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
59044	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
59049	Sugar (and other) transporter
59053	Domain of unknown function (DUF383). domo keywords :chromosome c26f1.12c 41.3
59053	Domain of unknown function (DUF384). domo keywords :chromosome c26f1.12c 41.3
59058	Helix-loop-helix DNA-binding domain
59069	Tropomyosin
59075	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
59076	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
59077	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
59082	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
59083	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
59084	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle
59086	Transforming growth factor beta like domain
59086	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
59087	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
59087	TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucl
59088	Somatotropin hormone family
59095	ATP1G1/PLM/MAT8 family
59103	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
59106	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
59107	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
59108	Globin
59113	Monooxygenase. This family includes diverse enzymes that utilise FAD
59116	Acyl CoA binding protein
59117	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
59117	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
59264	7 transmembrane receptor (rhodopsin family)
59265	PH domain. PH stands for pleckstrin homology
59266	Cyclic nucleotide-binding domain
59266	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
59268	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
59269	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
59271	Galactose binding lectin domain
59272	Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid
59277	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
59283	PMP-22/EMP/MP20/Claudin family
59284	PMP-22/EMP/MP20/Claudin family
59286	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
59289	7 transmembrane receptor (rhodopsin family)
59293	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
59296	Deoxyribonuclease II
59300	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
59303	Uncharacterised protein family (UPF0121). Uncharacterised integral membrane protein family
59305	Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes,
59307	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
59312	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
59317	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
59317	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
59323	Furin-like cysteine rich region
59323	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
59324	TASK K+ channel
59326	Cyclic nucleotide-binding domain
59326	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
59327	Helix-loop-helix DNA-binding domain
59328	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
59328	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
59340	7 transmembrane receptor (rhodopsin family)
59341	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
59341	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
59341	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
59341	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
59342	Serine carboxypeptidase
59344	Lipoxygenase
59344	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
59347	Translationally controlled tumor protein
59349	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
59350	7 transmembrane receptor (rhodopsin family)
60312	PH domain. PH stands for pleckstrin homology
60314	Uncharacterised protein family (UPF0160). This family of proteins contains a large number of metal binding residues. The patterns are suggestive of a phosphoesterase function. The conserved DHH motif may mean this family is related to pfam01368
60315	Uncharacterised protein family (UPF0160). This family of proteins contains a large number of metal binding residues. The patterns are suggestive of a phosphoesterase function. The conserved DHH motif may mean this family is related to pfam01368
60326	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
60326	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
60327	Transglutaminase family
60327	Transglutaminase family, C-terminal ig like domain
60327	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
60328	Kinase associated domain 1
60329	Helix-loop-helix DNA-binding domain
60331	Josephin
60335	Transglutaminase family
60335	Transglutaminase family, C-terminal ig like domain
60335	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
60338	ATP1G1/PLM/MAT8 family
60349	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
60349	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
60351	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
60351	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
60353	Peridinin-chlorophyll A binding protein
60355	Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist o
60356	Pyridoxal-dependent decarboxylase conserved domain
60357	Dolichyl-phosphate-mannose-protein mannosyltransferase
60361	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
60363	PMP-22/EMP/MP20/Claudin family
60367	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
60371	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
60372	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
60374	Intermediate filament protein
60374	Intermediate filament tail domain
60377	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
60379	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
60381	Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen
60383	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
60384	Coatomer WD associated domain
60386	Mitochondrial carrier protein
60390	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
60393	Sulfotransferase protein
60394	Helix-loop-helix DNA-binding domain
60395	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
60398	HIT family
60399	7 transmembrane receptor (rhodopsin family)
60409	Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses
60412	Sec8 exocyst complex component specific domain
60414	Ammonium Transporter Family
60418	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b
60423	Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin
60425	C2 domain
60427	Stem cell factor. Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. The crystal structure of human SCF has been resolved and a potential receptor-binding site i
60430	Apoptosis regulator proteins, Bcl-2 family
60431	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
60432	7 transmembrane receptor (Secretin family)
60432	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
60433	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
60434	Apoptosis regulator proteins, Bcl-2 family
60437	Cadherin domain
60441	Phosphatidylethanolamine-binding protein
60442	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
60444	Calcium-activated BK potassium channel alpha subunit
60444	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
60446	Homeobox domain
60448	7 transmembrane receptor (rhodopsin family)
60450	SH2 domain
60451	7 transmembrane receptor (rhodopsin family)
60460	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
60463	7 transmembrane receptor (rhodopsin family)
60464	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
60465	Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018
60468	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
60481	GNS1/SUR4 family
60484	Extracellular link domain
60485	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
60495	Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular
60510	C2 domain
60527	Fatty acid desaturase
60527	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
60527	Fatty acid desaturase
60529	OAR domain
60529	Homeobox domain
60560	Mak10 subunit, NatC N(alpha)-terminal acetyltransferase. NatC N(alpha)-terminal acetyltransferases contains Mak10p, Mak31p and Mak3p subunits. All three subunits are associated with each other to form the active complex
60563	jmjC domain
60564	C2 domain
60565	C2 domain
60566	C2 domain
60567	C2 domain
60568	C2 domain
60575	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
60576	Trehalase
60577	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
60580	HIT family
60581	Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction
60584	LIF / OSM family
60586	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
60587	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
60589	Protein kinase domain
60589	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
60590	7 transmembrane receptor (metabotropic glutamate family)
60590	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
60591	Calcium-activated potassium channel, beta subunit
60595	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
60597	Phosphotyrosine interaction domain (PTB/PID)
60598	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
60613	KCNQ1 voltage-gated potassium channel
60613	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
60613	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
60628	7 transmembrane receptor (rhodopsin family)
60661	Helix-loop-helix DNA-binding domain
60662	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
60663	Furin-like cysteine rich region
60663	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
60664	SH2 domain
60665	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
60666	NAD-dependent glycerol-3-phosphate dehydrogenase
60667	7 transmembrane receptor (rhodopsin family)
60668	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
60669	7 transmembrane receptor (rhodopsin family)
60670	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
60671	FAD binding domain. This family consists of various enzymes that use FAD as a co-factor, most of the enzymes are similar to oxygen oxidoreductase. One of the enzymes Vanillyl-alcohol oxidase (VAO) has a solved structure, the alignment includes the FAD bi
60681	FKBP-type peptidyl-prolyl cis-trans isomerase
60682	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
60685	AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets i
63827	Extracellular link domain
63827	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
63836	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
63839	LIM domain. This family represents two copies of the LIM structural domain
63843	7 transmembrane receptor (rhodopsin family)
63847	ATP1G1/PLM/MAT8 family
63848	ATP1G1/PLM/MAT8 family
63849	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
63849	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
63849	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
63852	Phosphotriesterase family
63857	Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc
63859	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
63864	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
63865	Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this
63866	Amiloride-sensitive sodium channel
63867	UDP-glucoronosyl and UDP-glucosyl transferase
63873	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
63873	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
63878	Stanniocalcin family
63882	Amiloride-sensitive sodium channel
63883	Slow voltage-gated potassium channel
63885	3'5'-cyclic nucleotide phosphodiesterase
63889	7 transmembrane receptor (rhodopsin family)
63897	Adaptin N terminal region. This family consists of the N terminal region of various alpha
63904	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
63917	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
63923	Fibronectin type III domain
63923	Fibrinogen beta and gamma chains, C-terminal globular domain
63931	Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes
63935	Hexon-associated protein (IIIa)
63938	3-hydroxyacyl-CoA dehydrogenase, NAD binding domain. This family also includes lambda crystallin
63938	Prephenate dehydrogenase. Members of this family are prephenate dehydrogenases EC:1.3.1.12 involved in tyrosine biosynthesis
63938	NAD binding domain of 6-phosphogluconate dehydrogenase. The NAD binding domain of 6-phosphogluconate dehydrogenase adopts a Rossman fold
63938	TrkA-N domain. This domain is found in a wide variety of proteins. These protein include potassium channels, phosphoesterases, and various other transporters. This domain binds to NAD
63941	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typ
63943	TPR Domain
63950	DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown
63951	DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown
63951	DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai
63953	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
63954	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
63954	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
63954	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
63959	Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their
63973	Helix-loop-helix DNA-binding domain
63973	Helix-loop-helix DNA-binding domain
63974	Helix-loop-helix DNA-binding domain
63976	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
63985	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
63986	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
63995	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
63997	Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their
64001	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
64005	Myosin head (motor domain)
64009	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
64010	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
64013	Phosducin
64017	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
64017	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
64020	7 transmembrane receptor (rhodopsin family)
64021	7 transmembrane receptor (rhodopsin family)
64023	CUB domain
64028	Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen
64032	Thrombospondin type 1 domain
64035	Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses
64036	Sushi domain (SCR repeat)
64036	Sushi domain (SCR repeat)
64037	Sugar (and other) transporter
64040	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
64042	7 transmembrane receptor (rhodopsin family)
64043	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
64044	Death effector domain
64045	Glutaredoxin
64046	Mer11 DNA-binding domain. The Mre11 complex is a multisubunit nuclease that is composed of Mre11, Rad50 and Nbs1/Xrs2, and is involved in checkpoint signalling and DNA replication. Mre11 has an intrinsic DNA-binding activity that is stimulated by Rad50 o
64050	YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity
64051	Sugar (and other) transporter
64055	Guanylin precursor
64057	Extracellular link domain
64057	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
64060	Sec1 family
64061	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
64066	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
64066	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
64066	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
64072	Cadherin domain
64072	Cadherin domain
64074	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
64075	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
64076	Sulfate transporter family. Mutations may lead to several human diseases
64078	Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin
64081	Phenazine biosynthesis-like protein. PhzC/PhzF is involved in dimerization of two 2,3-dihydro-3-oxo-anthranilic acid molecules to create PCA by P. fluorescens. This family appears to be distantly related to pfam01678, including containing a weak internal
64086	Protein kinase domain
64088	PX domain. PX domains bind to phosphoinositides
64089	PX domain. PX domains bind to phosphoinositides
64089	PX domain. PX domains bind to phosphoinositides
64089	PX domain. PX domains bind to phosphoinositides
64093	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
64094	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
64095	7 transmembrane receptor (rhodopsin family)
64096	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro
64096	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro
64096	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro
64097	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
64098	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
64099	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
64102	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
64103	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
64104	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
64106	7 transmembrane receptor (rhodopsin family)
64107	7 transmembrane receptor (rhodopsin family)
64110	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
64114	Uncharacterized protein family UPF0005
64116	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
64118	Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrou
64119	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
64120	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
64124	7 transmembrane receptor (Secretin family)
64129	Papain family cysteine protease
64130	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
64131	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
64132	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
64133	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon
64134	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
64138	Papain family cysteine protease
64144	YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity
64145	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
64147	Kinesin motor domain
64147	Kinesin motor domain
64147	Kinesin motor domain
64151	Chromosome condensation protein 3, C-terminal region. Cnd3 is a Member of the five subunit condensin complex. Each subunit is essential for mitotic condensation
64155	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
64156	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
64156	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
64157	Amidinotransferase. This family contains glycine (EC:2.1.4.1) and inosamine (EC:2.1.4.2) amidinotransferases, enzymes involved in creatine and streptomycin biosynthesis respectively. This family also includes arginine deiminases, EC:3.5.3.6. These enzyme
64159	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
64159	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
64167	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
64174	Renal dipeptidase
64176	Sugar (and other) transporter
64177	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
64180	Renal dipeptidase
64185	CAP protein
64187	ADP-ribosylation factor family
64188	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
64189	Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellu
64190	ATP1G1/PLM/MAT8 family
64191	Ergosterol biosynthesis ERG4/ERG24 family
64195	Hepatic lectin, N-terminal domain
64195	Lectin C-type domain. This family includes both long and short form C-type
64198	Insulinase (Peptidase family M16)
64201	Mitochondrial carrier protein
64202	Calreticulin family
64203	Aminotransferase class IV. The D-amino acid transferases (D-AAT) are required by bacteria to catalyse the synthesis of D-glutamic acid and D-alanine, which are essential constituents of bacterial cell wall and are the building block for other D-amino aci
64204	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
64204	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
64205	Ribosomal protein L10
64206	Tryptophan 2,3-dioxygenase
64211	Homeobox domain
64211	LIM domain. This family represents two copies of the LIM structural domain
64213	ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain
64218	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
64225	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
64231	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
64231	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
64231	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
64232	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
64236	LIM domain. This family represents two copies of the LIM structural domain
64242	Globin
64282	PAP/25A associated domain
64284	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
64285	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f
64290	Fork head domain
64291	Oxysterol-binding protein
64292	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
64293	Protein kinase domain
64294	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
64297	7 transmembrane receptor (metabotropic glutamate family)
64298	Ribosomal protein L31e
64300	Formate--tetrahydrofolate ligase
64302	Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA
64305	Sulfotransferase protein
64306	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
64307	Ribosomal protein L24e
64314	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
64315	Tetraspanin family
64316	HYR domain. This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is unc
64317	Glutathione peroxidase
64321	HMG (high mobility group) box
64324	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
64324	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
64324	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
64324	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
64324	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
64324	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
64333	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
64340	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
64340	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
64342	PX domain. PX domains bind to phosphoinositides
64342	PX domain. PX domains bind to phosphoinositides
64343	Intermediate filament protein
64347	Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid
64348	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
64349	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
64350	von Willebrand factor type A domain
64352	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
64353	LIM domain. This family represents two copies of the LIM structural domain
64354	7 transmembrane receptor (rhodopsin family)
64360	Ribosomal protein L23
64361	Somatotropin hormone family
64362	Intermediate filament protein
64363	Protein kinase domain
64363	Raf-like Ras-binding domain
64363	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
64365	Eukaryotic cobalamin-binding protein
64367	Cyclophilin type peptidyl-prolyl cis-trans isomerase
64368	Somatotropin hormone family
64369	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
64370	metallopeptidase family M24
64372	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
64374	Eukaryotic aspartyl protease
64377	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
64378	7 transmembrane receptor (rhodopsin family)
64380	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
64381	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
64382	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
64385	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
64386	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
64386	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
64386	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
64386	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
64386	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
64386	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
64388	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb
64392	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
64395	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
64396	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
64403	Cadherin domain
64403	Cadherin domain
64403	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
64403	Cadherin domain
64403	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
64405	Cadherin domain
64405	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
64409	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
64410	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
64411	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
64412	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
64418	Glucose inhibited division protein A
64420	Sushi domain (SCR repeat)
64422	Autophagocytosis associated protein, C-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole
64422	Autophagocytosis associated protein, N-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole
64424	Uncharacterized BCR, YceG family COG1559
64428	pfam02906, Fe_hyd_lg_C, Iron only hydrogenase large subunit, C-terminal domain
64434	MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains
64434	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
64440	C2 domain
64441	HSF-type DNA-binding
64442	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
64443	7 transmembrane receptor (rhodopsin family)
64444	LIM domain. This family represents two copies of the LIM structural domain
64445	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
64450	7 transmembrane receptor (rhodopsin family)
64451	AMP-binding enzyme
64451	AMP-binding enzyme
64452	Sodium:solute symporter family
64452	Sodium:solute symporter family
64452	Sodium:solute symporter family
64454	Sodium:solute symporter family
64454	Sodium:solute symporter family
64454	Sodium:solute symporter family
64455	ADP-ribosylation factor family
64455	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
64456	Reeler domain
64456	Thrombospondin type 1 domain
64457	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t
64459	Trypsin
64459	CUB domain
64459	Sushi domain (SCR repeat)
64461	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
64462	Protein kinase domain
64463	SURF1 family
64464	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
64467	GDA1/CD39 (nucleoside phosphatase) family
64468	Hyaluronidase
64468	Hyaluronidase
64470	CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in p
64472	Mak10 subunit, NatC N(alpha)-terminal acetyltransferase. NatC N(alpha)-terminal acetyltransferases contains Mak10p, Mak31p and Mak3p subunits. All three subunits are associated with each other to form the active complex
64473	Inositol monophosphatase family
64473	Inositol monophosphatase family
64478	CUB domain
64478	Sushi domain (SCR repeat)
64478	Sushi domain (SCR repeat)
64508	Adenylate and Guanylate cyclase catalytic domain
64512	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
64513	V-type ATPase 116kDa subunit family
64514	Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac
64516	Sec1 family
64518	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
64519	GDA1/CD39 (nucleoside phosphatase) family
64520	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
64521	Tetraspanin family
64522	Sodium:solute symporter family
64529	Papain family cysteine protease
64532	Adenylate and Guanylate cyclase catalytic domain
64534	Protein kinase domain
64537	Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular
64540	Tetraspanin family
64543	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
64543	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
64544	Glycosyltransferase family 43
64551	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
64553	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
64555	Adenylate kinase
64557	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
64558	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
64560	LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known
64563	Fibronectin type III domain
64564	Gamma-butyrobetaine hydroxylase. Members of this family are gamma-Butyrobetaine hydroxylase enzymes EC:1.14.11.1
64566	wnt family
64567	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
64567	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
64567	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
64570	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
64571	Homeobox domain
64572	Homeobox domain
64573	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
64574	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
64574	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
64576	Fibronectin type III domain
64577	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
64577	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
64579	Sulfotransferase protein
64580	Sulfotransferase protein
64582	7 transmembrane receptor (rhodopsin family)
64582	7 transmembrane receptor (rhodopsin family)
64591	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
64598	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
64619	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
64620	Golgi 4-transmembrane spanning transporter
64621	Alkaline phosphatase
64623	Prenylated rab acceptor (PRA1)
64624	Cullin family
64626	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
64632	Exo70 exocyst complex subunit. The Exo70 protein forms one subunit of the exocyst complex. First discovered in S. cerevisiae , Exo70 and other exocyst proteins have been observed in several other eukaryotes, including humans. In S. cerevisiae, the exocys
64633	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
64634	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
64635	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
64636	7 transmembrane receptor (rhodopsin family)
64637	Helix-loop-helix DNA-binding domain
64638	Ribosomal L28e protein family
64641	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
64652	PX domain. PX domains bind to phosphoinositides
64657	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
64659	Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes
64664	ADP-ribosylation factor family
64665	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
64668	Lectin C-type domain. This family includes both long and short form C-type
64670	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
64672	Phospholamban. The regulation of calcium levels across the membrane of the sarcoplasmic reticulum involves the interplay of many membrane proteins. Phospholamban is a 52 residue integral membrane protein that is involved in reversibly inhibiting the Ca(2
64677	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster
64678	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
64678	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
64679	Transglutaminase family
64679	Transglutaminase family, C-terminal ig like domain
64679	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
64680	7 transmembrane receptor (Secretin family)
64681	Major Vault Protein repeat. The vault is a ubiquitous and highly conserved ribonucleoprotein particle of approximately 13 mDa of unknown function. This family corresponds to a repeat found in the amino terminal half of the major vault protein
64702	B-box zinc finger
64707	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
64714	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
64718	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
64744	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
64746	Acyl CoA binding protein
64748	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
64754	MYND finger
64772	Glycosyl hydrolase family 85. Family of endo-beta-N-acetylglucosaminidases. These enzymes work on a broad spectrum of substrates
64778	Fibronectin type III domain
64780	LIM domain. This family represents two copies of the LIM structural domain
64780	FAD binding domain. This domain is involved in FAD binding in a number of enzymes
64780	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
64794	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
64801	Arv1-like family. Arv1 is a transmembrane protein with potential zinc-binding motifs. ARV1 is a novel mediator of eukaryotic sterol homeostasis
64803	7 transmembrane receptor (rhodopsin family)
64805	7 transmembrane receptor (rhodopsin family)
64805	7 transmembrane receptor (rhodopsin family)
64816	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
64816	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
64816	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
64817	Sushi domain (SCR repeat)
64817	von Willebrand factor type A domain
64817	Pentaxin family. Pentaxins are also known as pentraxins
64817	HYR domain. This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is unc
64822	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
64823	Protein kinase domain
64828	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
64829	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
64830	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
64833	Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C
64834	GNS1/SUR4 family
64840	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
64841	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
64843	LIM domain. This family represents two copies of the LIM structural domain
64846	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
64848	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment
64849	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
64855	PH domain
64855	Acetaldehyde dehydrogenase
64856	Fibronectin type III domain
64857	PH domain. PH stands for pleckstrin homology
64857	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
64862	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
64865	Cadherin domain
64881	Cadherin domain
64918	Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13,
64919	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
64919	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
64922	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
64926	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
64928	Ribosomal protein L14p/L23e
64931	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
64931	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
64933	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
64940	Stromal antigen (SA/STAG) protein
64960	Ribosomal protein S15
64965	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
64978	Phosphatidylethanolamine-binding protein
65008	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
65009	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t
65009	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t
65010	Sulfate transporter family. Mutations may lead to several human diseases
65010	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
65010	Sulfate transporter family. Mutations may lead to several human diseases
65010	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
65010	Sulfate transporter family. Mutations may lead to several human diseases
65010	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
65012	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
65018	Protein kinase domain
65020	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
65020	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
65022	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
65024	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
65024	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
65029	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
65029	Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
65029	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog
65030	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
65031	C2 domain
65032	7 transmembrane receptor (rhodopsin family)
65035	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
65035	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
65036	Carboxylesterase
65037	Fibronectin type III domain
65039	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
65040	Guanylate-kinase-associated protein (GKAP) protein
65042	Tricarboxylate carrier
65043	Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14
65044	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
65045	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
65046	Clathrin adaptor complex small chain
65047	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
65048	Cadherin domain
65048	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
65050	Homeobox domain
65051	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
65052	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
65055	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
65081	Sec1 family
65082	Sec1 family
65083	Nrap protein. Members of this family are nucleolar RNA-associated proteins (Nrap) which are highly conserved from yeast (Saccharomyces cerevisiae) to human. In the mouse, Nrap is ubiquitously expressed and is specifically localized in the nucleolus. Nrap
65083	Nrap protein. Members of this family are nucleolar RNA-associated proteins (Nrap) which are highly conserved from yeast (Saccharomyces cerevisiae) to human. In the mouse, Nrap is ubiquitously expressed and is specifically localized in the nucleolus. Nrap
65083	Nrap protein. Members of this family are nucleolar RNA-associated proteins (Nrap) which are highly conserved from yeast (Saccharomyces cerevisiae) to human. In the mouse, Nrap is ubiquitously expressed and is specifically localized in the nucleolus. Nrap
65086	7 transmembrane receptor (rhodopsin family)
65094	jmjC domain
65096	Galactose binding lectin domain
65096	7 transmembrane receptor (Secretin family)
65096	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
65096	Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein cou
65098	AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets i
65106	Prenylated rab acceptor (PRA1)
65109	Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3
65109	Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3
65110	Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3
65110	Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3
65114	Vacuolar protein sorting-associated protein 35. Vacuolar protein sorting-associated protein (Vps) 35 is one of around 50 proteins involved in protein trafficking. In particular, Vps35 assembles into a retromer complex with at least four other proteins Vp
65116	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy
65129	PMP-22/EMP/MP20/Claudin family
65130	PMP-22/EMP/MP20/Claudin family
65131	PMP-22/EMP/MP20/Claudin family
65132	PMP-22/EMP/MP20/Claudin family
65134	Syntaxin
65136	Ribosomal protein S8e
65139	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
65140	7 transmembrane receptor (rhodopsin family)
65142	ADP-ribosylation factor family
65144	Tuberin
65151	Endoribonuclease L-PSP. Endoribonuclease active on single-stranded mRNA. Inhibits protein synthesis by cleavage of mRNA. Previously thought to inhibit protein synthesis initiation. This protein may also be involved in the regulation of purine biosynthesi
65152	Phosphofructokinase
65154	Thrombospondin type 1 domain
65156	Dihydroorotate dehydrogenase
65160	Nucleotide-sensitive chloride conductance regulator (ICln)
65162	Iodothyronine deiodinase
65165	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
65166	Signal peptidase I
65168	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a cy
65169	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a cy
65170	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c
65171	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c
65172	Protein kinase domain
65172	LIM domain. This family represents two copies of the LIM structural domain
65172	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
65178	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
65180	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
65180	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
65183	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
65185	Sulfotransferase protein
65187	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
65187	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
65190	Radical SAM superfamily
65191	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
65191	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
65193	Homeobox domain
65194	Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their
65195	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
65195	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
65196	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
65197	Sugar (and other) transporter
65198	Cyclic nucleotide-binding domain
65198	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
65198	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
65200	Monocarboxylate transporter
65201	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
65202	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
65204	Calponin family repeat
65205	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
65205	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
65205	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
65206	Calcium-activated SK potassium channel
65206	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-
65207	Ammonium Transporter Family
65209	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
65210	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
65217	Cadherin domain
65218	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
65220	ATP-NAD kinase. Members of this family are ATP-NAD kinases EC:2.7.1.23. Catalyses the phosphorylation of NAD to NADP utilizing ATP and other nucleoside triphosphates as well as inorganic polyphosphate as a source of phosphorus
65221	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
65241	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
65256	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
65256	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
65262	ATP synthase alpha/beta chain, C terminal domain
65262	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
65262	pfam02874, ATP-synt_ab_N, ATP synthase alpha/beta family, beta-barrel domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
65270	Glycosyltransferase family 6
65272	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
65275	7 transmembrane receptor (rhodopsin family)
65276	7 transmembrane receptor (rhodopsin family)
65278	Protein kinase domain
65279	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
65279	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
65793	Ribosomal protein S7e
65945	Cadherin domain
65961	Sas10/Utp3 family. This family contains Sas10 which hash been identified as a regulator of chromatin silencing. The family also contains Utp3 a component of the U3 ribonucleoprotein complex. The exact molecular function of this family is unknown
65961	Sas10/Utp3 family. This family contains Sas10 which hash been identified as a regulator of chromatin silencing. The family also contains Utp3 a component of the U3 ribonucleoprotein complex. The exact molecular function of this family is unknown
65969	CUB domain
65970	LIM domain. This family represents two copies of the LIM structural domain
65972	Gamma interferon inducible lysosomal thiol reductase (GILT). This family includes the two characterized human gamma-interferon-inducible lysosomal thiol reductase (GILT) sequences. It also contains several other eukaryotic putative proteins with similari
65986	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
65987	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
65988	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
65989	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
65992	PCI domain
66002	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
66011	Importin-beta N-terminal domain
66012	Adenylate and Guanylate cyclase catalytic domain
66014	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
66016	Calcium-activated potassium channel, beta subunit
66017	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
66019	Homeobox domain
66021	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
66022	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
66022	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
66023	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuro
66024	7 transmembrane receptor (rhodopsin family)
66025	7 transmembrane receptor (rhodopsin family)
66026	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
66026	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
66027	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
66028	Prenylated rab acceptor (PRA1)
66029	Indoleamine 2,3-dioxygenase
66030	Synaptophysin / synaptoporin
66035	Sugar (and other) transporter
66052	Succinate dehydrogenase cytochrome b subunit
66053	Cyclophilin type peptidyl-prolyl cis-trans isomerase
66056	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
66061	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
66072	Domain of unknown function (DUF339). This family of small proteins are uncharacterised
66084	Uncharacterized ACR, YagE family COG1723
66090	Yippee putative zinc-binding protein
66090	Yippee putative zinc-binding protein
66094	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
66101	Cyclophilin type peptidyl-prolyl cis-trans isomerase
66109	Tetraspanin family
66111	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
66112	MOSC N-terminal beta barrel domain. This domain is found to the N-terminus of pfam03473. The function of this domain is unknown, however it is predicted to adopt a beta barrel fold
66112	MOSC domain. The MOSC (MOCO sulfurase C-terminal) domain is a superfamily of beta-strand-rich domains identified in the molybdenum cofactor sulfurase and several other proteins from both prokaryotes and eukaryotes. These MOSC domains contain an absolutely
66113	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
66116	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
66118	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
66120	FKBP-type peptidyl-prolyl cis-trans isomerase
66124	Josephin
66144	ATP synthase (F/14-kDa) subunit. This family includes 14-kDa subunit from vATPases, which is in the peripheral catalytic part of the complex. The family also includes archaebacterial ATP synthase subunit F
66148	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con
66153	F-box domain
66154	Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised
66166	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
66168	Uncharacterized protein family UPF0005
66177	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
66181	Nucleolar RNA-binding protein, Nop10p family. Nop10p is a nucleolar protein that is specifically associated with H/ACA snoRNAs. It is essential for normal 18S rRNA production and rRNA pseudouridylation by the ribonucleoprotein particles containing H/ACA
66182	ADP-ribosylation factor family
66192	Peptidase family M3
66194	Delta 1-pyrroline-5-carboxylate reductase
66197	Cyclin-dependent kinase regulatory subunit
66198	Fungal specific domain of unknown function (DUF391). A family of uncharacterised fungal proteins
66204	Acylphosphatase
66208	Steroid binding domain
66211	Ribosomal protein L3
66212	Sec61beta family. This family consists of homologues of Sec61beta - a component of the Sec61/SecYEG protein secretory system. The domain is found in eukaryotes and archaea and is possibly homologous to the bacterial SecG
66222	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
66234	Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues.
66235	Eukaryotic initiation factor 1A
66237	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
66248	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
66251	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
66269	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
66283	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
66284	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
66286	Signal peptidase I
66290	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
66307	Isochorismatase family. This family are hydrolase enzymes
66313	C2 domain
66313	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
66313	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
66314	Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction
66322	Intermediate filament protein
66328	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
66332	Enhancer of rudimentary. Enhancer of rudimentary is a protein of unknown function that is highly conserved in plants and animals. This protein is found to be an enhancer of the rudimentary gene
66333	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
66333	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
66335	V-ATPase subunit C
66354	SKIP/SNW domain. This domain is found in chromatin proteins
66355	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
66368	RNA 3'-terminal phosphate cyclase
66369	Uncharacterized protein family UPF0034
66369	Histidine biosynthesis protein. Proteins involved in steps 4 and 6 of the histidine biosynthesis pathway are contained in this family. Histidine is formed by several complex and distinct biochemical reactions catalysed by eight enzymes. The enzymes in thi
66369	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea
66369	Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrour
66373	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
66373	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
66383	NifU-like N terminal domain. This domain is found in NifU in combination with pfam01106. This domain is found on isolated in several bacterial species. The nif genes are responsible for nitrogen fixation. However this domain is found in bacteria that do
66384	SRP19 protein. The signal recognition particle (SRP) binds to the signal peptide of proteins as they are being translated. The binding of the SRP halts translation and the complex is then transported to the endoplasmic reticulum's cytoplasmic surface. Th
66387	NUDIX domain
66392	Somatotropin hormone family
66399	Elongation factor TS
66401	NUDIX domain
66406	SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1
66407	Ribosomal protein S15
66408	HIT family
66409	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
66410	mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein
66411	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
66420	RNA polymerase Rpb5, C-terminal domain. The assembly domain of Rpb5. The archaeal equivalent to this domain is subunit H. Subunit H lacks the N-terminal domain
66420	RNA polymerase Rpb5, N-terminal domain. Rpb5 has a bipartite structure which includes a eukaryote-specific N-terminal domain and a C-terminal domain resembling the archaeal RNAP subunit H. The N-terminal domain is involved in DNA binding and is part of th
66427	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
66435	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt
66441	Mago nashi protein. This family was originally identified in Drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene. The human homologue has been shown to interact with an RNA binding protein. An RNAi knockout of the
66445	Cytochrome C1 family
66446	3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
66446	3' exoribonuclease family, domain 2. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
66447	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
66449	Uncharacterised protein family (UPF0108)
66454	Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase, Glycerol-3-phosphate cytidylyltransferase
66454	Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase. Glycerol-3-phosphate cytidylyltransferase
66454	Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase, Glycerol-3-phosphate cytidylyltransferase
66454	Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase. Glycerol-3-phosphate cytidylyltransferase
66459	Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest
66461	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
66464	Transcription initiation factor TFIID subunit A
66480	Ribosomal L15
66481	Ribosomal protein S21e
66482	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
66483	Ribosomal protein L44
66505	MYND finger
66505	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
66513	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
66513	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
66515	Cullin family
66541	Signal peptidase I
66549	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
66552	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
66552	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
66569	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b
66576	Ubiquinol-cytochrome C reductase hinge protein. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 'hinge' protein of the complex which is thought to mediate formati
66586	CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond
66588	Adenylate kinase
66589	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
66590	tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain
66592	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
66592	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
66597	B-box zinc finger
66607	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
66612	ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER
66614	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
66615	Peptidase family C54
66616	PX domain. PX domains bind to phosphoinositides
66622	Putative zinc finger in N-recognin
66647	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
66659	Histidine acid phosphatase
66663	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
66667	jmjC domain
66673	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
66676	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
66682	Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits
66686	LCCL domain
66689	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
66691	Vacuolar sorting protein 9 (VPS9) domain
66696	PX domain. PX domains bind to phosphoinositides
66698	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
66700	Eukaryotic protein of unknown function, DUF279
66701	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
66702	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
66704	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members invol
66711	Uncharacterized protein family UPF0023
66720	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
66733	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
66733	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
66734	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
66745	Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction
66747	F-actin capping protein alpha subunit
66767	Fructose-bisphosphate aldolase class-I
66786	7 transmembrane receptor (rhodopsin family)
66795	Autophagocytosis associated protein, C-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole
66799	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
66805	Tetraspanin family
66808	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
66809	Intermediate filament protein
66810	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
66821	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
66824	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
66830	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
66844	ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER
66848	Alpha-L-fucosidase
66853	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o
66854	B-box zinc finger
66854	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
66860	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
66861	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
66863	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
66863	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
66863	Recombination activating protein 2. V-D-J recombination is the combinatorial process by which the huge range of immunoglobulin and T cell binding specificity is generated from a limited amount of genetic material. This process is synergistically activate
66866	NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in at least one member, proteolysis has shown that the Peptid
66866	NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in some members, proteolysis has shown that the Peptidyl-alph
66867	HMG (high mobility group) box
66871	C2 domain
66873	Fibronectin type III domain
66873	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
66877	HAT (Half-A-TPR) repeat. The HAT (Half A TPR) repeat is found in several RNA processing proteins
66889	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
66890	Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19-
66892	Eukaryotic initiation factor 4E
66901	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
66902	Phosphorylase family 2
66904	Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction
66905	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
66913	ER lumen protein retaining receptor
66914	VPS28 protein
66921	PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is abou
66921	PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is about
66926	Eukaryotic initiation factor 3, gamma subunit. eIF-3 is a multisubunit complex that stimulates translation initiation in vitro at several different steps. This family corresponds to the gamma subunit if eIF3
66938	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
66943	PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How
66945	Fumarate reductase/succinate dehydrogenase flavoprotein C-terminal domain. This family contains fumarate reductases, succinate dehydrogenases and L-aspartate oxidases
66945	pfam02910, succ_DH_flav_C, Fumarate reductase/succinate dehydrogenase flavoprotein C-terminal domain. This family contains fumarate reductases, succinate dehydrogenases and L-aspartate oxidases
66945	FAD binding domain. This family includes members that bind FAD. This family includes the flavoprotein subunits from succinate and fumarate dehydrogenase, aspartate oxidase and the alpha subunit of adenylylsulfate reductase
66957	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
66958	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
66959	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
66964	Got1-like family. Traffic through the yeast Golgi complex depends on a member of the syntaxin family of SNARE proteins, Sed5, present in early Golgi cisternae. Got1 is thought to facilitate Sed5-dependent fusion events
66968	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
66977	Nuf2 family. Members of this family are components of the mitotic spindle. It has been shown that Nuf2 from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle. An arabidopsis protein h
66978	Protein of unknown function, DUF259
66978	Protein of unknown function, DUF259
66993	BAF60b domain of the SWIB complex
67000	Somatotropin hormone family
67016	PH domain. PH stands for pleckstrin homology
67016	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
67028	Eukaryotic protein of unknown function, DUF279
67038	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
67042	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
67043	BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function
67052	HEC/Ndc80p family. Members of this family are components of the mitotic spindle. It has been shown that Ndc80/HEC from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle
67053	Rpp14 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40
67054	SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase
67054	AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this
67064	Eukaryotic protein of unknown function, DUF279
67068	Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn
67071	Protein kinase domain
67071	Protein kinase C terminal domain
67072	Cdc37 family. In the budding yeast Saccharomyces cerevisiae, Cdc37 is required for the productive formation of Cdc28-cyclin complexes. Cdc37 may be a kinase targeting subunit of Hsp90
67073	Phosphatidylinositol 3- and 4-kinase
67075	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
67097	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
67106	Protein of unknown function DUF101. The members of this family are uncharacterised. The alignment of these proteins contains several conserved polar residues that might be potential catalytic residues
67111	Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in
67115	Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14
67117	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
67123	UBA/TS-N domain
67131	Acyl CoA binding protein
67133	Zona pellucida-like domain
67136	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
67160	Elongation factor 1 gamma, conserved domain
67161	DNA gyrase/topoisomerase IV, subunit A
67161	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
67161	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
67166	ADP-ribosylation factor family
67168	7 transmembrane receptor (rhodopsin family)
67169	Death domain
67178	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
67181	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
67187	MYND finger
67199	KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif
67201	Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily
67204	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
67205	Utp11 protein. This protein is found to be part of a large ribonucleoprotein complex containing the U3 snoRNA. Depletion of the Utp proteins impedes production of the 18S rRNA, indicating that they are part of the active pre-rRNA processing complex. This
67224	V-type ATPase 116kDa subunit family
67231	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
67239	Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids
67252	CAP protein
67260	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
67281	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
67283	Mitochondrial carrier protein
67285	Cyclophilin type peptidyl-prolyl cis-trans isomerase
67286	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
67287	Integral membrane protein DUF110. This archaebacterial protein family has no known function. Some members of this family are annotated as FlaJ, however we can find no supporting evidence for this annotation
67287	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
67300	Coatomer WD associated domain
67300	Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain
67300	Clathrin propeller repeat. Clathrin is the scaffold protein of the basket-like coat that surrounds coated vesicles. The soluble assembly unit, a triskelion, contains three heavy chains and three light chains in an extended three-legged structure. Each leg
67302	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
67307	Phenazine biosynthesis-like protein. PhzC/PhzF is involved in dimerization of two 2,3-dihydro-3-oxo-anthranilic acid molecules to create PCA by P. fluorescens. This family appears to be distantly related to pfam01678, including containing a weak internal
67310	Somatotropin hormone family
67332	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
67338	FYVE zinc finger
67344	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
67366	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
67369	Glutaminyl cyclase. Glutaminyl cyclase catalyses the formation of the pyroglutamyl residue present at the amino terminus of numerous secretory peptides and proteins. Glutaminyl cyclase posses a zinc aminopeptidase domain in which the four functionally im
67370	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
67373	Trypsin
67373	Trypsin
67384	BAG domain. Domain present in Hsp70 regulators
67390	SpoU rRNA Methylase family. This family of proteins probably use S-AdoMet
67393	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
67398	Signal recognition particle, alpha subunit, N-terminal. SRP is a complex of six distinct polypeptides and a 7S RNA that is essential for transferring nascent polypeptide chains that are destined for export from the cell to the translocation apparatus of
67402	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
67412	Uncharacterized ACR, COG1579
67414	GTPase of unknown function
67420	Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included
67427	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
67429	Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration
67429	Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration
67441	Isochorismatase family. This family are hydrolase enzymes
67443	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
67444	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
67446	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
67452	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o
67453	Mitochondrial carrier protein
67455	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
67459	RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses
67459	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
67464	GDA1/CD39 (nucleoside phosphatase) family
67466	Phosducin
67471	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
67484	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN
67489	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
67492	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
67492	AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets in
67498	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
67498	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
67498	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
67498	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
67500	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
67505	Somatotropin hormone family
67511	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
67512	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
67516	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
67525	Zinc finger, C3HC4 type (RING finger)
67525	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
67526	Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5
67527	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
67527	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
67527	Gag polyprotein, inner coat protein p12. The retroviral p12 is a virion structural protein. p12 is proline rich. The function carried out by p12 in assembly and replication is unknown. p12 is associated with pathogenicity of the virus
67528	NUDIX domain
67528	NUDIX domain
67530	Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly i
67533	Uncharacterized ACR, COG1579
67533	Intermediate filament protein
67533	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
67533	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
67533	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
67543	Poly-adenylate binding protein, unique domain
67547	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
67553	ubiE/COQ5 methyltransferase family
67553	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
67553	Glucose inhibited division protein. This is a family of bacterial Glucose inhibited division proteins these are probably involved in the regulation of cell devision
67553	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
67554	Mitochondrial carrier protein
67561	WD domain, G-beta repeat
67562	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
67563	pfam02906, Fe_hyd_lg_C, Iron only hydrogenase large subunit, C-terminal domain
67573	Lysyl oxidase
67574	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain conta
67582	Mitochondrial carrier protein
67586	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
67591	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
67597	Protein kinase C terminal domain
67603	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
67604	Protein of unknown function (DUF410). Family of conserved eukaryotic proteins with undetermined function
67607	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
67608	pfam02906, Fe_hyd_lg_C, Iron only hydrogenase large subunit, C-terminal domain
67630	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
67634	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
67666	Extracellular link domain
67666	Extracellular link domain
67666	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
67667	ubiE/COQ5 methyltransferase family
67671	Ribosomal L38e protein family
67674	Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest
67674	Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest
67675	CutA1 divalent ion tolerance protein. Several gene loci with a possible involvement in cellular tolerance to copper have been identified. One such locus in eubacteria and archaebacteria, cutA, is thought to be involved in cellular tolerance to a wide var
67681	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
67684	Protein of unknown function, DUF259
67689	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
67702	Zinc finger, C3HC4 type (RING finger)
67702	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
67703	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv
67703	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
67712	Mitochondrial carrier protein
67712	Mitochondrial carrier protein
67713	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con
67717	ab-hydrolase associated lipase region
67722	Actin
67728	Putative diphthamide synthesis protein. One member is a candidate tumour suppressor gene. DPH2 from yeast, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein s
67728	Putative diphthamide synthesis protein. One member is a candidate tumour suppressor gene. DPH2 from yeast, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein sy
67728	Putative diphthamide synthesis protein.One member is a candidate tumour suppressor gene. DPH2 from yeast1, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein sy
67738	Cyclophilin type peptidyl-prolyl cis-trans isomerase
67760	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
67760	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
67768	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
67772	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
67784	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
67790	ADP-ribosylation factor family
67790	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
67795	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
67800	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
67803	LIM domain. This family represents two copies of the LIM structural domain
67804	PX domain. PX domains bind to phosphoinositides
67804	Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2
67815	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
67815	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
67821	Sodium / potassium ATPase beta chain
67830	Rer1 family. RER1 family protein are involved in involved in the retrieval of some endoplasmic reticulum membrane proteins from the early golgi compartment. The C terminus of yeast Rer1p interacts with a coatomer complex
67834	Isocitrate/isopropylmalate dehydrogenase
67841	Autophagocytosis associated protein, C-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole
67841	Autophagocytosis associated protein, N-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole
67843	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
67848	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
67848	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
67854	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
67854	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
67855	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
67860	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
67861	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
67863	Mitochondrial carrier protein
67878	Uncharacterised protein family (UPF0121). Uncharacterised integral membrane protein family
67880	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
67890	Uncharacterised protein family (UPF0185). This family contains a number of small uncharacterised proteins including BM-002
67891	Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA
67891	Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA
67892	Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase
67895	Inorganic pyrophosphatase
67897	mRNA capping enzyme, large subunit
67909	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
67931	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
67933	Fibronectin type III domain
67941	Ribosomal protein S27
67942	ATP synthase subunit C
67952	MAS20 protein import receptor
67958	Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding
67958	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
67963	ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen r
67972	E1-E2 ATPase
67972	Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
67972	Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
67972	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
67973	Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerev
67974	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
67980	Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase
67981	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
67988	Calsequestrin
67988	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
67991	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
68009	Defensin propeptide
68011	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
68028	Ribosomal L22e protein family
68039	Bombesin-like peptide
68047	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has
68052	Ribosomal protein S15
68053	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
68054	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
68059	Endomembrane protein 70
68066	Mitochondrial carrier protein
68079	Programmed cell death protein 2, C-terminal domain
68080	Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity
68082	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
68087	Dephospho-CoA kinase. This family catalyzes the phosphorylation of the 3'-hydroxyl group of dephosphocoenzyme A to form Coenzyme A EC:2.7.1.24. This enzyme uses ATP in its reaction
68090	Hrf1 family. This family includes a number of eukaryotic proteins. It is an integral membrane protein, conserved in at least 1 copy in all sequenced eukaryotes. The gene name in S. pombe is hrf1+ for Heavy metal Resistance Factor 1
68097	Dynein light chain type 1
68107	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
68133	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl grou
68134	Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3
68137	ER lumen protein retaining receptor
68140	Tc5 transposase
68140	CENP-B protein. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding domain, which binds to a corresponding 17bp CENP-B box sequence. In the C-terminal 59 residues,
68140	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
68146	ADP-ribosylation factor family
68153	TFIIE beta subunit core domain. General transcription factor TFIIE consists of two subunits, TFIIE alpha pfam02002 and TFIIE beta. TFIIE beta has been found to bind to the region where the promoter starts to open to be single-stranded upon transcription
68159	Syntaxin
68178	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
68192	Vacuolar protein sorting 55. Vps55 is involved in the secretion of the Golgi form of the soluble vacuolar carboxypeptidase Y, but not the trafficking of the membrane-bound vacuolar alkaline phosphatase. Both Vps55 and obesity receptor gene-related protei
68193	Ribosomal protein L24e
68195	Ribonuclease T2 family
68212	Uncharacterized protein family UPF0005
68219	MutT-like domain
68239	Intermediate filament protein
68267	Mitochondrial carrier protein
68273	GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis o
68273	GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis o
68275	Replication factor-A protein 1, N-terminal domain
68278	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
68278	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
68279	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
68280	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
68291	Glucose inhibited division protein A
68292	Oligosaccharyl transferase STT3 subunit. This family consists of the oligsacharyl transferase STT3 subunit and related proteins. The STT3 subunit is part of the oligosccharyl transferase (OTase) complex of proteins and is required for its activity. OTase
68299	Vps53-like, N-terminal domain. Vps53 complexes with Vps52 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events
68304	Filamin/ABP280 repeat
68311	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
68312	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
68312	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
68337	LIM domain. This family represents two copies of the LIM structural domain
68338	Got1-like family. Traffic through the yeast Golgi complex depends on a member of the syntaxin family of SNARE proteins, Sed5, present in early Golgi cisternae. Got1 is thought to facilitate Sed5-dependent fusion events
68346	Sir2 family
68346	NAD(P) transhydrogenase beta subunit. This family corresponds to the beta subunit of NADP transhydrogenase in prokaryotes, and either the protein N- or C terminal in eukaryotes. The domain is often found in conjunction with pfam01262. Pyridine nucleotide
68348	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
68349	Respiratory-chain NADH dehydrogenase, 30 Kd subunit
68352	Domain of unknown function DUF108. This family has no known function. It is found to compose the complete protein in archaebacteria and a single domain in a large C. elegans protein
68360	Electron transfer flavoprotein beta subunit. This protein is distantly related to and forms a heterodimer with pfam00766
68371	Phenazine biosynthesis-like protein. PhzC/PhzF is involved in dimerization of two 2,3-dihydro-3-oxo-anthranilic acid molecules to create PCA by P. fluorescens. This family appears to be distantly related to pfam01678, including containing a weak internal
68393	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
68396	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
68401	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
68423	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
68423	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
68427	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
68436	Ribosomal protein L34e
68441	ADP-ribosylation factor family
68441	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
68444	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
68453	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
68460	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
68463	Ribosomal protein L14p/L23e
68465	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
68473	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known struc
68479	Uncharacterised protein family (UPF0123). This family of proteins are uncharacterised, they contain five CXXC motifs
68494	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
68497	Filamin/ABP280 repeat
68497	Filamin/ABP280 repeat
68497	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
68498	Tetraspanin family
68507	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
68509	Pentaxin family. Pentaxins are also known as pentraxins
68511	Doublecortin
68526	Auxin Efflux Carrier
68526	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
68553	von Willebrand factor type A domain
68553	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
68553	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
68572	Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly con
68581	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
68585	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
68588	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
68591	MOSC N-terminal beta barrel domain. This domain is found to the N-terminus of pfam03473. The function of this domain is unknown, however it is predicted to adopt a beta barrel fold
68591	MOSC domain. The MOSC (MOCO sulfurase C-terminal) domain is a superfamily of beta-strand-rich domains identified in the molybdenum cofactor sulfurase and several other proteins from both prokaryotes and eukaryotes. These MOSC domains contain an absolutely
68598	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con
68603	Phosphomevalonate kinase. Phosphomevalonate kinase (EC:2.7.4.2) catalyzes the phosphorylation of 5-phosphomevalonate into 5-diphosphomevalonate, an essential step in isoprenoid biosynthesis via the mevalonate pathway. This family represents the animal ty
68604	WD domain, G-beta repeat
68606	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
68607	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
68616	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b
68621	Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain
68621	Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain
68621	pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain
68628	WD domain, G-beta repeat
68646	ATP-NAD kinase. Members of this family are ATP-NAD kinases EC:2.7.1.23. Catalyses the phosphorylation of NAD to NADP utilizing ATP and other nucleoside triphosphates as well as inorganic polyphosphate as a source of phosphorus
68653	Bacterial surface antigen. This entry includes the following surface antigens
68666	Sugar (and other) transporter
68667	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
68667	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
68668	Trypsin
68678	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
68701	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
68701	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
68703	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p
68705	Transcription initiation factor IIF, beta subunit. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIF (TFIIF) is a tetramer of two beta subun
68723	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
68724	ADP-ribosylation factor family
68729	B-box zinc finger
68729	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
68730	Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrou
68731	Ribosome-binding factor A
68735	Ribosomal protein S18
68750	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
68753	Fibronectin type III domain
68760	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
68764	Cadherin domain
68774	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
68775	V-ATPase subunit C
68792	HYR domain. This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is unc
68794	Filamin/ABP280 repeat
68801	GNS1/SUR4 family
68802	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
68810	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
68815	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
68815	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
68828	Intermediate filament protein
68833	Phosducin
68833	Phosducin
68837	Fork head domain
68837	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
68839	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
68839	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
68842	Tub family
68860	Cache domain
68865	Arv1-like family. Arv1 is a transmembrane protein with potential zinc-binding motifs. ARV1 is a novel mediator of eukaryotic sterol homeostasis
68870	Shikimate kinase
68870	Adenylate kinase
68870	Papillomavirus helicase. This protein is a DNA helicase that is required for initiation of viral DNA replication. This protein forms a complex with the E2 protein pfam00508
68895	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
68897	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
68897	7 transmembrane receptor (metabotropic glutamate family)
68897	AcrB/AcrD/AcrF family. Members of this family are integral membrane proteins. Some are involved in drug resistance
68897	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
68897	MMPL family. Members of this family are putative integral membrane proteins from bacteria. Several of the members are mycobacterial proteins. Many of the proteins contain two copies of this aligned region. The function of these proteins is not known, alth
68904	Domain of unknown function (DUF341)
68904	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
68904	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
68911	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
68915	tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included
68917	HIT family
68922	WD domain, G-beta repeat
68926	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t
68929	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
68938	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
68938	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
68939	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
68942	Eukaryotic protein of unknown function, DUF279
68942	Eukaryotic protein of unknown function, DUF279
68943	Protein kinase domain
68943	Protein kinase domain
68947	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
68953	Eukaryotic protein of unknown function, DUF279
68961	Protein kinase domain
68966	Sas10/Utp3 family. This family contains Sas10 which hash been identified as a regulator of chromatin silencing. The family also contains Utp3 a component of the U3 ribonucleoprotein complex. The exact molecular function of this family is unknown
68969	Translation initiation factor SUI1
68972	Metalloenzyme of unknown function DUF136. This family of archaeal proteins has no known function. However they appear to be related to a large superfamily of metalloenzymes
68972	TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme
68976	Alpha-L-fucosidase
68977	Metallo-beta-lactamase superfamily
68980	WD domain, G-beta repeat
68992	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
68999	Anaphase-promoting complex, subunit 10 (APC10)
69008	Mo25 protein family
69014	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
69020	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
69024	MIT domain
69024	PX domain. PX domains bind to phosphoinositides
69024	MIT domain
69024	PX domain. PX domains bind to phosphoinositides
69028	MIT domain
69032	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzyme
69036	Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein
69047	E1-E2 ATPase
69047	Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
69047	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
69049	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
69051	Delta 1-pyrroline-5-carboxylate reductase
69052	SH2 domain
69060	Lipase
69070	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
69077	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
69083	Sulfotransferase protein
69089	60Kd inner membrane protein
69091	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp
69097	B-box zinc finger
69097	B-box zinc finger
69106	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
69106	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
69109	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
69113	2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2
69117	Zinc-binding dehydrogenase
69117	Prephenate dehydrogenase. Members of this family are prephenate dehydrogenases EC:1.3.1.12 involved in tyrosine biosynthesis
69121	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
69123	Acyl CoA binding protein
69144	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
69149	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
69149	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
69149	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
69149	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
69156	O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production
69156	ubiE/COQ5 methyltransferase family
69156	O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production
69168	BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems
69171	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
69178	PX domain. PX domains bind to phosphoinositides
69183	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
69185	DTW domain. This presumed domain is found in bacterial and eukaryotic proteins. Its function is unknown. The domain contains multiple conserved motifs including a DTXW motif that this domain has been named after
69186	Protein of unknown function (DUF423). Potential integral membrane protein
69188	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
69188	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
69191	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
69207	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
69215	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
69219	Amidinotransferase. This family contains glycine (EC:2.1.4.1) and inosamine (EC:2.1.4.2) amidinotransferases, enzymes involved in creatine and streptomycin biosynthesis respectively. This family also includes arginine deiminases, EC:3.5.3.6. These enzymes
69226	PX domain. PX domains bind to phosphoinositides
69228	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
69228	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
69228	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
69228	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
69239	PH domain. PH stands for pleckstrin homology
69239	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
69241	RNA polymerase Rpb4
69241	RNA polymerase Rpb4
69253	Hsp20/alpha crystallin family
69256	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
69256	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
69259	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
69259	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
69259	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
69270	Uncharacterised protein family (UPF0080)
69274	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
69276	Translocation protein Sec62
69277	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
69286	SCP-like extracellular protein
69286	SCP-like extracellular protein. This domain is also found in prokaryotes
69288	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
69288	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
69294	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family
69294	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
69317	HMG (high mobility group) box
69335	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription
69352	Uncharacterized ACR, YneC family COG1359
69354	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
69362	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
69362	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family
69372	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
69379	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
69382	Core histone H2A/H2B/H3/H4
69382	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
69389	Core histone H2A/H2B/H3/H4
69416	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
69416	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
69435	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist
69444	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes
69459	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
69462	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
69511	Trypsin
69538	von Willebrand factor type A domain
69562	Protein kinase domain
69574	Dienelactone hydrolase family
69574	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
69583	TNF(Tumor Necrosis Factor) family
69597	Peptidase family M41
69597	Sigma-54 interaction domain
69597	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
69601	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
69602	Protein of unknown function, DUF270
69608	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24
69632	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
69632	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
69635	Death domain
69656	Uncharacterized BCR, YhhW family COG1741
69660	Uncharacterized protein family UPF0005
69672	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
69674	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA
69675	Animal haem peroxidase
69675	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
69675	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
69677	Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18
69687	Yippee putative zinc-binding protein
69693	Intermediate filament protein
69693	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
69713	PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline
69716	RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses
69716	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
69718	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
69719	Aspartate/ornithine carbamoyltransferase, Asp/Orn binding domain
69719	Aspartate/ornithine carbamoyltransferase, carbamoyl-P binding domain
69719	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
69719	D-ala D-ala ligase. This family contains D-alanine--D-alanine ligase enzymes EC:6.3.2.4
69719	Carbamoyl-phosphate synthase L chain, N-terminal domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosyn
69719	Carbamoyl-phosphate synthase L chain, ATP binding domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosy
69719	Carbamoyl-phosphate synthetase large chain, oligomerisation domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. The carbamoyl-phosphate synthase (CPS) enzyme in prokar
69719	MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase and the domain is also found in Carbamoyl phosphate synthetase (CPS) where it forms a regulatory domain that binds to the allosteric effector ornithine. This family also i
69721	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
69726	MYND finger
69727	Ubiquitin carboxyl-terminal hydrolase family 2
69727	Ubiquitin carboxyl-terminal hydrolases family 2
69737	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
69745	DNA polymerase delta, subunit 4
69761	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog
69772	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
69774	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
69780	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
69787	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
69814	Trypsin
69826	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
69833	RNA polymerase Rpb6. Rpb6 is an essential subunit in the eukaryotic polymerases Pol I, II and III. The bacterial equivalent to Rpb6 is the omega subunit. Rpb6 and omega are structurally conserved and both function in polymerase assembly
69837	Zinc finger, C3HC4 type (RING finger)
69847	Protein kinase domain
69860	Eukaryotic initiation factor 1A
69864	Phospholipase A2 inhibitor
69865	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
69888	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
69902	Ribosomal protein L10
69903	DIL domain. The DIL domain has no known function
69906	Mitochondrial carrier protein
69908	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
69920	RNA polymerases M/15 Kd subunit
69923	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
69934	Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue
69955	tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain
69965	'Cold-shock' DNA-binding domain
69976	GHMP kinases putative ATP-binding protein
69981	LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in resp
69993	SH2 domain
69993	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
69993	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
70008	Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid
70009	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members
70019	ENV polyprotein (coat polyprotein)
70025	Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-C
70026	TspO/MBR family. Tryptophan-rich sensory protein (TspO) is an integral membrane protein that acts as a negative regulator of the expression of specific photosynthesis genes in response to oxygen/light. It is involved in the efflux of porphyrin intermedia
70028	Dopey, N-terminal domain. DopA is the founding member of the Dopey family and is required for correct cell morphology and spatiotemporal organization of multicellular structures in the filamentous fungus Aspergillus nidulans. DopA homologues are found in
70044	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
70047	Poly A polymerase family. This family includes nucleic acid independent RNA polymerases, such as Poly(A) polymerase, which adds the poly (A) tail to mRNA EC:2.7.7.19. This family also includes the tRNA nucleotidyltransferase that adds the CCA to the 3' of
70052	WD domain, G-beta repeat
70059	Fatty acid desaturase
70061	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
70061	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
70065	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
70065	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
70065	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
70073	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
70086	7 transmembrane receptor (rhodopsin family)
70093	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
70097	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
70099	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
70099	RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated
70099	SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily
70101	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
70114	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
70120	tRNA synthetases class I (W and Y)
70122	YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity
70152	ubiE/COQ5 methyltransferase family
70160	Vacuolar protein sorting 36. Vps36 is involved in Golgi to endosome trafficking
70160	Vacuolar protein sorting 36. Vps36 is involved in Golgi to endosome trafficking
70163	Phospholipase A2 inhibitor
70166	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
70192	Lectin C-type domain. This family includes both long and short form C-type
70202	Papain family cysteine protease
70207	Domain of unknown function DUF28. This domain is found in bacterial and yeast proteins it compromises the entire length or central region of most of the proteins in the family, all of which are hypothetical with no known function. The average length of th
70218	Kinesin motor domain
70223	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
70223	OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl -tRNA synthetases cata
70233	GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein
70235	WD domain, G-beta repeat
70310	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve
70314	Rabaptin
70315	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
70335	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
70349	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
70355	7 transmembrane receptor (metabotropic glutamate family)
70361	Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19-
70363	Lecithin:cholesterol acyltransferase. Lecithin:cholesterol acyltransferase (LACT) is involved in extracellular metabolism of plasma lipoproteins, including cholesterol
70364	Fibronectin type III domain
70369	BAG domain. Domain present in Hsp70 regulators
70370	Sushi domain (SCR repeat)
70370	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
70370	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
70380	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
70381	PH domain. PH stands for pleckstrin homology
70383	UbiA prenyltransferase family
70406	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps
70425	Protein kinase domain
70428	RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase comp
70430	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
70432	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
70432	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
70435	WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin
70439	Zn-finger in Ran binding protein and others
70439	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
70450	C2 domain
70451	NADP oxidoreductase coenzyme F420-dependent
70451	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
70456	Uncharacterised protein family (UPF0041)
70472	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
70472	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
70478	Peptidase family M3. This is the Thimet oligopeptidase family, large family of mammalian and bacterial oligopeptidases that cleave medium sized peptides. The group also contains mitochondrial intermediate peptidase which is encoded by nuclear DNA but func
70479	PX domain. PX domains bind to phosphoinositides
70503	FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1
70508	HMG (high mobility group) box
70520	SCP-like extracellular protein
70520	SCP-like extracellular protein. This domain is also found in prokaryotes
70527	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has
70530	Fibronectin type III domain
70533	NAC domain
70536	Glutaminyl cyclase. Glutaminyl cyclase catalyses the formation of the pyroglutamyl residue present at the amino terminus of numerous secretory peptides and proteins. Glutaminyl cyclase posses a zinc aminopeptidase domain in which the four functionally imp
70546	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
70549	I/LWEQ domain. I/LWEQ domains bind to actin. It has been shown that the I/LWEQ domains from mouse talin and yeast Sla2p interact with F-actin. I/LWEQ domains can be placed into four major groups based on sequence similarity: (1) Metazoan talin
70550	Gelsolin repeat
70550	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
70551	TPR Domain
70556	Mitochondrial carrier protein
70559	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
70561	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
70571	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
70571	FF domain. This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions
70574	Zinc carboxypeptidase
70584	Protein kinase domain
70584	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
70604	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
70611	F-box domain
70616	Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding
70620	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
70620	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
70620	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
70640	NUDIX domain
70646	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
70650	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members invol
70673	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
70673	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
70673	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
70676	Phosphotyrosine interaction domain (PTB/PID)
70676	Phosphotyrosine interaction domain (PTB/PID)
70686	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
70693	7 transmembrane receptor (Secretin family)
70693	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
70693	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
70696	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
70719	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
70719	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
70727	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
70729	Phosphotyrosine interaction domain (PTB/PID)
70737	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
70747	Tetraspanin family
70750	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
70767	PWI domain
70772	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
70772	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
70784	ADP-ribosylation factor family
70784	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
70785	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
70785	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
70785	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
70788	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
70788	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
70796	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
70796	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
70797	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
70797	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
70802	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
70804	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
70810	Intermediate filament protein
70823	HMG (high mobility group) box
70834	Intermediate filament protein
70839	7 transmembrane receptor (rhodopsin family)
70840	Sugar (and other) transporter
70843	Intermediate filament protein
70861	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
70866	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
70866	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
70873	Cyclic nucleotide-binding domain
70882	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
70892	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
70893	Glycosyl hydrolases family 35
70902	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
70902	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
70911	Fibronectin type III domain
70911	Fibronectin type III domain
70927	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
70928	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
70930	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
70945	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
70945	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
70965	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
70974	Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II
70974	Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I
70974	pfam02878, PGM_PMM_I, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I
70974	pfam02879, PGM_PMM_II, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II
70980	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
70980	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
70981	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
70981	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
70990	mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein
70999	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
71003	Trypsin
71027	Eukaryotic protein of unknown function, DUF284. Members of this family have no known function. They have been predicted to contain transmembrane helices
71027	LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in respo
71037	Trypsin
71041	Zinc finger, C3HC4 type (RING finger)
71062	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
71063	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
71066	HSF-type DNA-binding
71066	HSF-type DNA-binding domain
71078	Disintegrin
71078	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
71078	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
71085	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
71089	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
71091	Protein kinase domain
71093	Helix-loop-helix DNA-binding domain
71103	Glycosyltransferase family 6
71131	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
71137	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
71147	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
71147	Beta-ketoacyl synthase, C-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains
71147	Beta-ketoacyl synthase, N-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains.
71164	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
71176	F-box domain
71183	Lectin C-type domain. This family includes both long and short form C-type
71207	NUDIX domain
71228	Guanylate kinase
71268	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
71275	Delta 1-pyrroline-5-carboxylate reductase
71279	Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their
71287	Serine carboxypeptidase
71302	PH domain. PH stands for pleckstrin homology
71302	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
71302	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
71310	GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
71310	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
71310	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
71330	Regulator of chromosome condensation (RCC1)
71330	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
71351	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
71355	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
71365	Polyprenyl synthetase
71367	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
71375	Fork head domain
71382	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
71389	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
71393	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
71393	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
71395	CUB domain
71395	CUB domain
71395	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
71412	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
71435	PH domain. PH stands for pleckstrin homology
71435	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
71436	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
71436	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
71436	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
71452	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
71468	Homeobox domain
71472	MYND finger
71472	Ubiquitin carboxyl-terminal hydrolase family 2
71472	Ubiquitin carboxyl-terminal hydrolases family 2
71519	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
71520	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
71522	Gamma-glutamyltranspeptidase
71538	F-box domain
71540	PX domain. PX domains bind to phosphoinositides
71544	PH domain. PH stands for pleckstrin homology
71544	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
71567	MCM2/3/5 family
71584	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b
71591	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
71592	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
71592	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
71597	Homeobox domain
71599	Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
71599	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
71602	Myosin head (motor domain)
71602	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
71609	Death domain
71619	ADP-ribosylation factor family
71619	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
71619	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
71648	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
71665	Alpha-L-fucosidase
71678	BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1
71682	WD domain, G-beta repeat
71685	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
71693	Lectin C-type domain. This family includes both long and short form C-type
71699	Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples hav
71709	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
71709	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
71710	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
71710	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
71720	Oxysterol-binding protein
71726	Uracil DNA glycosylase superfamily
71729	Raf-like Ras-binding domain
71729	Phosphotyrosine interaction domain (PTB/PID)
71729	Raf-like Ras-binding domain
71729	Phosphotyrosine interaction domain (PTB/PID)
71729	LGN motif, putative GEF specific for G-alpha GTPase
71729	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
71729	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
71732	Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain
71733	Sushi domain (SCR repeat)
71738	MAM domain. An extracellular domain found in many receptors
71738	MAM domain. An extracellular domain found in many receptors
71742	MIT domain
71742	Protein kinase domain
71743	Dephospho-CoA kinase. This family catalyzes the phosphorylation of the 3'-hydroxyl group of dephosphocoenzyme A to form Coenzyme A EC:2.7.1.24. This enzyme uses ATP in its reaction
71745	Cullin family
71746	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
71749	Zinc-binding dehydrogenase
71754	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
71756	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
71761	Chlorohydrolase. This family consist of chlorohydrolase from the ATZ/TRZ family
71761	Adenine deaminase. Adenine deaminase EC:3.5.4.2 hydrolyses adenine to form hypoxanthine and ammonia. The enzyme is part of a large metal dependent hydrolase superfamily. Adenine deaminases reaction is important for adenine utilization as a purine and also
71766	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
71766	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
71768	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
71770	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
71770	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
71773	UDP-glucoronosyl and UDP-glucosyl transferase
71775	Transferrin
71778	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
71778	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
71778	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
71780	Myo-inositol-1-phosphate synthase. This is a family of myo-inositol-1-phosphate synthases. Inositol-1-phosphate catalyses the conversion of glucose-6- phosphate to inositol-1-phosphate, which is then dephosphorylated to inositol. Inositol phosphates play
71787	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
71790	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
71791	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
71791	Zinc carboxypeptidase
71791	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fol
71793	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
71795	Phosphatidylinositol transfer protein
71801	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
71801	PH domain. PH stands for pleckstrin homology
71801	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
71803	Mitochondrial carrier protein
71803	Mitochondrial carrier protein
71805	Nucleoporin interacting componentent
71810	RanBP1 domain
71816	Zinc finger, C3HC4 type (RING finger)
71819	Kinesin motor domain
71820	WD domain, G-beta repeat
71820	WD domain, G-beta repeat
71822	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
71822	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
71822	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
71824	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
71827	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
71828	Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are rep
71830	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
71832	Citrate synthase
71833	WD domain, G-beta repeat
71834	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
71840	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
71847	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
71853	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
71854	Renal dipeptidase
71865	F-box domain
71869	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
71879	AMP-binding enzyme
71881	Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine
71883	UbiA prenyltransferase family
71884	Glycosyl hydrolases family 18
71884	Oxygen-independent Coproporphyrinogen III oxidase. This family of bacterial enzymes catalyses the anaerobic transformation of coproporphyrinogen III to protoporphyrinogen IX required for porphyrin biosynthesis
71887	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
71888	Intermediate filament protein
71889	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
71890	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
71891	Cytidine and deoxycytidylate deaminase zinc-binding region
71893	PX domain. PX domains bind to phosphoinositides
71902	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
71903	Carboxylesterase
71904	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h
71907	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
71908	PMP-22/EMP/MP20/Claudin family
71910	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
71911	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
71916	Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrou
71922	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
71924	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
71924	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
71932	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
71934	Eukaryotic-type carbonic anhydrase
71941	tRNA synthetases class I (K). This family includes only lysyl tRNA synthetases from prokaryotes
71941	tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included
71941	tRNA synthetases class I (C). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only cysteinyl tRNA synthetases
71943	VHS domain. Domain present in VPS-27, Hrs and STAM
71943	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
71949	Homeobox domain
71949	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
71950	Homeobox domain
71951	Glypican
71955	Eukaryotic protein of unknown function, DUF292
71960	Myosin head (motor domain)
71960	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
71966	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
71967	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
71972	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
71972	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
71972	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
71981	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
71982	PX domain. PX domains bind to phosphoinositides
71985	Acyl-CoA dehydrogenase, middle domain. Central domain of Acyl-CoA dehydrogenase has a beta-barrel fold
71985	Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle
71985	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
71985	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
71989	RNA pseudouridylate synthase. Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes RluD, a pseudouridylate synthase that converts specific uracils to
71990	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
71994	Calponin family repeat
71996	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
71998	Mitochondrial carrier protein
72000	Intermediate filament tail domain
72002	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
72002	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
72003	Synaptophysin / synaptoporin
72007	Fibronectin type III domain
72008	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
72014	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
72017	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
72023	Cytochrome b561
72026	tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs
72027	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
72033	TSC-22/dip/bun family
72040	Cadherin domain
72053	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
72054	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
72055	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
72057	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
72061	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
72068	NOT2 / NOT3 / NOT5 family. NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5
72076	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
72077	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
72082	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
72090	GDA1/CD39 (nucleoside phosphatase) family
72094	UDP-glucoronosyl and UDP-glucosyl transferase
72096	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
72096	Putative methyltransferase. This is a family of putative methyltransferases. The aligned region contains the GXGXG S-AdoMet binding site suggesting a putative methyltransferase activity
72106	jmjC domain
72121	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
72121	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
72121	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
72129	Peroxin 13, N-terminal domain. Both termini of the Peroxin-13 are oriented to the cytosol. Peroxin-13 is required for peroxisomal association of peroxin-14
72133	TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that
72135	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
72136	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
72137	WD domain, G-beta repeat
72137	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
72147	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
72151	Rad17 cell cycle checkpoint protein
72151	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
72152	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
72152	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
72154	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
72160	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
72162	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
72162	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
72167	ubiE/COQ5 methyltransferase family
72167	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
72167	Putative methyltransferase. This is a family of putative methyltransferases. The aligned region contains the GXGXG S-AdoMet binding site suggesting a putative methyltransferase activity
72168	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
72169	B-box zinc finger
72175	Sugar (and other) transporter
72179	F-box domain
72183	PX domain. PX domains bind to phosphoinositides
72184	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
72184	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
72194	F-box domain
72205	HELP domain. The HELP (Hydrophobic ELP) domain is found in EMAP and EMAP-like proteins (ELPs). Although called a domain it contains a predicted transmembrane helix and may not form a globular domain. It is also not clear if these proteins localize to memb
72205	HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes.
72238	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
72252	NAD-dependent glycerol-3-phosphate dehydrogenase
72258	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
72265	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
72269	Cytidine and deoxycytidylate deaminase zinc-binding region
72278	Late embryogenesis abundant protein. Different types of LEA proteins are expressed at different stages of late embryogenesis in higher plant seed embryos and under conditions of dehydration stress. The function of these proteins is unknown
72281	SH2 domain
72287	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
72290	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
72294	7 transmembrane receptor (rhodopsin family)
72296	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
72296	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
72297	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
72303	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
72303	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt
72305	Tetraacyldisaccharide-1-P 4'-kinase. This family consists of tetraacyldisaccharide-1-P 4'-kinase also known as Lipid-A 4'-kinase or Lipid A biosynthesis protein LpxK, EC:2.7.1.130. This enzyme catalyses the reaction: ATP + 2,3-bis(3-hydroxytetradecanoyl)-
72310	PMP-22/EMP/MP20/Claudin family
72318	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
72318	PH domain. PH stands for pleckstrin homology
72318	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
72325	Vacuolar sorting protein 9 (VPS9) domain
72330	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
72333	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
72344	Ubiquitin carboxyl-terminal hydrolase family 2
72344	Ubiquitin carboxyl-terminal hydrolases family 2
72349	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
72351	Protein prenyltransferase alpha subunit repeat
72361	Carboxylesterase
72373	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
72378	PH domain. PH stands for pleckstrin homology
72378	PH domain. PH stands for pleckstrin homology
72378	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
72388	Protein kinase domain
72393	Uncharacterized protein family UPF0005
72399	Zn-finger in ubiquitin-hydrolases and other protein
72413	Calcium-activated potassium channel, beta subunit
72432	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
72459	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
72461	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
72461	Aminopeptidase I zinc metalloprotease (M18)
72465	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
72469	PH domain. PH stands for pleckstrin homology
72479	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
72479	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
72480	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
72480	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
72482	Acyl CoA binding protein
72500	7 transmembrane receptor (Secretin family)
72508	Protein kinase domain
72508	Protein kinase C terminal domain
72515	WD domain, G-beta repeat
72535	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
72542	Phosphoglycerate mutase family
72542	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
72544	3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
72549	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
72549	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
72552	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
72556	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
72560	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
72562	Pterin 4 alpha carbinolamine dehydratase. Pterin 4 alpha carbinolamine dehydratase is also known as DCoH (dimerisation cofactor of hepatocyte nuclear factor 1-alpha)
72573	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
72584	Cullin family
72590	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
72599	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
72605	Eukaryotic-type carbonic anhydrase
72607	Ubiquitin carboxyl-terminal hydrolase family 2
72607	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
72634	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
72634	KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
72650	Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons
72654	Arginyl tRNA synthetase N terminal domain. This domain is found at the amino terminus of Arginyl tRNA synthetase, also called additional domain 1 (Add-1). It is about 140 residues long and it has been suggested that this domain will be involved in tRNA r
72654	Arginyl tRNA synthetase N terminal domain. This domain is found at the amino terminus of Arginyl tRNA synthetase, also called additional domain 1 (Add-1). It is about 140 residues long and it has been suggested that this domain will be involved in tRNA re
72662	RNB-like protein. The function of this region of similarity is uncertain
72667	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
72667	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
72674	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h
72674	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
72685	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
72686	Ubiquitin carboxyl-terminal hydrolase family 2
72727	Glycosyltransferase family 43
72736	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
72739	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
72739	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
72739	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
72749	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
72776	Uncharacterized ACR, COG1579
72776	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
72776	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
72778	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
72790	Single-strand binding protein family
72823	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
72825	SAND family protein. Members of this family are uncharacterised proteins that have been called SAND proteins. These protein do not contain a SAND domain. The members of this family are 500-600 amino acids long and contain several conserved motifs
72828	Phosphoglycerate mutase family
72828	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
72828	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
72828	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
72843	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
72844	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
72852	Metallo-beta-lactamase superfamily
72895	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
72900	Respiratory-chain NADH dehydrogenase 24 Kd subunit
72925	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
72927	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
72938	Anaphase-promoting complex, subunit 10 (APC10)
72949	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
72958	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
72960	Eukaryotic DNA topoisomerase I, DNA binding fragment. Topoisomerase I promotes the relaxation of DNA superhelical tension by introducing a transient single-stranded break in duplex DNA and are vital for the processes of replication, transcription, and rec
72960	Eukaryotic DNA topoisomerase I, catalytic core. Topoisomerase I promotes the relaxation of DNA superhelical tension by introducing a transient single-stranded break in duplex DNA and are vital for the processes of replication, transcription, and recombina
72960	pfam02919, Topoisomer_I_N, Eukaryotic DNA topoisomerase I, DNA binding fragment. Topoisomerase I promotes the relaxation of DNA superhelical tension by introducing a transient single-stranded break in duplex DNA and are vital for the processes of replicat
72961	Sugar (and other) transporter
72978	Cornichon protein
72993	PH domain. PH stands for pleckstrin homology
72993	Phosphotyrosine interaction domain (PTB/PID)
73010	7 transmembrane receptor (rhodopsin family)
73062	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
73062	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
73068	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
73075	Cyclophilin type peptidyl-prolyl cis-trans isomerase
73078	Insulinase (Peptidase family M16)
73086	Protein kinase domain
73086	Protein kinase C terminal domain
73094	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
73095	Mitochondrial carrier protein
73102	Sugar (and other) transporter
73106	Trypsin
73130	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
73132	Mitochondrial carrier protein
73132	Mitochondrial carrier protein
73159	Cadherin domain
73162	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
73166	Ergosterol biosynthesis ERG4/ERG24 family
73173	Cadherin domain
73178	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
73178	WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin
73178	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl
73181	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
73192	Exportin-t. Exportin-t is a specific mediator of tRNA export. RanGTP-binding, importin beta-related factor with predominantly nuclear localization. It shuttles rapidly between nucleus and between nucleus and cytoplasm and interacts with nuclear pore compl
73218	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
73218	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
73218	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
73230	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
73244	Somatotropin hormone family
73246	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
73250	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
73251	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
73254	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
73254	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
73254	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
73296	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
73296	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
73316	Calreticulin family
73316	Calreticulin family
73327	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
73332	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
73333	Mitochondrial carrier protein
73333	Mitochondrial carrier protein
73338	Mab-21 protein
73341	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
73341	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
73353	Actin
73360	Actin
73368	Fibronectin type III domain
73368	Thrombospondin N-terminal -like domain
73368	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
73379	LCCL domain
73379	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
73382	Trypsin
73412	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
73412	Nucleoside diphosphate kinase
73430	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
73430	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
73442	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
73447	WD domain, G-beta repeat
73458	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
73547	Protein-tyrosine phosphatase
73547	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
73626	Trypsin
73626	Trypsin
73642	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
73642	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
73647	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
73649	Cytochrome b561
73656	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
73658	Sugar (and other) transporter
73674	WD domain, G-beta repeat
73680	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
73682	L1 transposable element
73682	L1 transposable element
73683	WD domain, G-beta repeat
73699	HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats
73699	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
73707	Protein kinase domain
73707	Adenylate and Guanylate cyclase catalytic domain
73713	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
73724	Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily
73732	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
73744	Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism
73748	Pyridoxal-dependent decarboxylase conserved domain
73804	Kinesin motor domain
73809	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
73825	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
73826	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
73834	ATP synthase subunit D. This is a family of subunit D form various ATP synthases including V-type H+ transporting and Na+ dependent. Subunit D is suggested to be an integral part of the catalytic sector of the V-ATPase
73859	Inward rectifier potassium channel
73902	Proteasome A-type and B-type
73910	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
73914	Death domain
73914	Protein kinase domain
73940	Extracellular link domain
73983	7 transmembrane receptor (metabotropic glutamate family)
73988	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
73991	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
73991	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
74002	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
74006	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
74007	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
74007	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
74008	Sulfatase
74011	Mitochondrial carrier protein
74018	Vacuolar sorting protein 9 (VPS9) domain
74020	C2 domain
74022	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
74030	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
74032	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
74038	ENV polyprotein (coat polyprotein)
74038	ENV polyprotein (coat polyprotein)
74042	jmjC domain
74042	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
74044	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
74044	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
74044	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u
74051	NADP oxidoreductase coenzyme F420-dependent
74055	C2 domain
74071	Intermediate filament tail domain
74090	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
74102	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
74105	VHS domain. Domain present in VPS-27, Hrs and STAM
74105	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
74105	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
74105	Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This ig-fold domain is found
74108	R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA
74111	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
74114	Choline/Carnitine o-acyltransferase
74121	Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA
74127	Intermediate filament protein
74127	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
74129	Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in
74134	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
74136	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
74136	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
74140	Endomembrane protein 70
74142	NB-ARC domain
74142	ATP-dependent protease La (LON) domain
74142	RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses
74142	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
74143	Dynamin family
74145	Transglutaminase family
74145	Transglutaminase family, C-terminal ig like domain
74145	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
74149	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
74153	Repeat in ubiquitin-activating (UBA) protein
74153	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
74153	Repeat in ubiquitin-activating (UBA) protein
74153	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
74157	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
74157	FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl
74158	Josephin
74159	Acyl CoA binding protein
74176	Transglutaminase family
74176	Transglutaminase family, C-terminal ig like domain
74176	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
74180	von Willebrand factor type D domain
74180	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
74180	von Willebrand factor type D domain
74180	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
74180	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
74180	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
74182	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b
74182	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b
74186	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
74187	WD domain, G-beta repeat
74187	Cytochrome D1 heme domain. Cytochrome cd1 (nitrite reductase) catalyses the conversion of nitrite to nitric oxide in the nitrogen cycle. This family represents the d1 heme binding domain of cytochrome cd1, in which His/Tyr side chains ligate the d1 heme i
74188	Somatotropin hormone family
74191	7 transmembrane receptor (rhodopsin family)
74195	Radical SAM superfamily
74197	TFIIE alpha subunit. The general transcription factor TFIIE has an essential role in eukaryotic transcription initiation together with RNA polymerase II and other general factors. Human TFIIE consists of two subunits TFIIE-alpha and TFIIE-beta and joins
74199	LCCL domain
74199	von Willebrand factor type A domain
74202	LIM domain. This family represents two copies of the LIM structural domain
74205	AMP-binding enzyme
74213	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
74213	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
74213	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
74215	Trypsin
74222	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
74229	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h
74237	Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin
74238	mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein
74244	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
74245	Glycosyl hydrolases family 18
74246	Glutamate/Leucine/Phenylalanine/Valine dehydrogenase
74246	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
74246	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
74246	Isoflavone reductase. This is a family of isoflavone reductases from plants. Isoflavone reductase enzymes EC:1.3.1.45 catalyse the penultimate step in the synthesis of the phytoalexin medicarpin
74246	Polysaccharide biosynthesis protein. This is a family of diverse bacterial polysaccharide biosynthesis proteins including the CapD protein, WalL protein mannosyl-transferase and several putative epimerases (e.g. WbiI)
74246	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
74251	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
74251	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
74253	Lectin C-type domain. This family includes both long and short form C-type
74254	Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity
74256	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
74257	Tetraspanin family
74270	Ubiquitin carboxyl-terminal hydrolase family 2
74270	Ubiquitin carboxyl-terminal hydrolases family 2
74270	Zn-finger in ubiquitin-hydrolases and other protein
74271	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
74276	PMP-22/EMP/MP20/Claudin family
74279	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
74286	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
74306	Trypsin
74309	Oxysterol-binding protein
74315	Zinc finger, C3HC4 type (RING finger)
74316	HesB-like domain. This family includes HesB which may be involved in nitrogen fixation
74322	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
74325	Clathrin light chain
74326	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
74330	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con
74338	Sodium:neurotransmitter symporter family
74352	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
74354	Guanylate kinase
74365	Zinc finger, C3HC4 type (RING finger)
74365	ATP-dependent protease La (LON) domain
74375	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
74377	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
74385	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
74409	Hyaluronidase
74410	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
74411	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
74419	Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site
74419	Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases
74419	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
74419	D-xylulose 5-phosphate/D-fructose 6-phosphate phosphoketolase. Bacterial enzyme splits fructose-6-P and/or xylulose-5-P with the aid of inorganic phosphate into either acetyl-P and erythrose-4-P and/or acetyl-P and glyeraldehyde-3-P EC:4.1.2.9, EC:4.1.2.2
74419	Transketolase, thiamine diphosphate binding domain. This family includes transketolase enzymes EC:2.2.1.1. and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit EC:1.2.4.4. Both these enzymes utilise thiamine pyrophosphate as a cofacto
74434	Helix-loop-helix DNA-binding domain
74437	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
74438	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
74438	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
74438	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
74441	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
74441	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
74448	ADP-ribosylation factor family
74448	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
74450	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
74478	PX domain. PX domains bind to phosphoinositides
74479	PX domain. PX domains bind to phosphoinositides
74480	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
74483	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
74483	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
74486	Oxysterol-binding protein
74488	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
74490	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
74493	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
74493	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
74493	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
74499	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
74513	CUB domain
74519	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
74533	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
74551	Phosphoenolpyruvate carboxykinase. Catalyses the formation of phosphoenolpyruvate by decarboxylation of oxaloacetate
74558	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
74558	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
74559	GNS1/SUR4 family
74561	Homeobox domain
74562	C2 domain
74563	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
74569	TPR Domain
74570	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
74571	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
74574	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
74577	Glycosyl hydrolases family 35
74581	PH domain. PH stands for pleckstrin homology
74585	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
74585	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
74589	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
74596	Phosphatidate cytidylyltransferase
74616	Peptidase family U34
74617	Serine carboxypeptidase
74649	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
74653	Protein kinase domain
74670	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
74672	Carboxylesterase
74675	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
74685	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
74686	Glycosyl hydrolases family 18
74686	Mitochondrial carrier protein
74694	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
74711	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
74718	PX domain. PX domains bind to phosphoinositides
74720	PMP-22/EMP/MP20/Claudin family
74729	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
74729	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
74732	Syntaxin
74734	ADP-ribosylation factor family
74734	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
74734	ADP-ribosylation factor family
74734	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
74735	B-box zinc finger
74753	Uncharacterised protein family UPF0066
74754	FAD binding domain. This family consists of various enzymes that use FAD as a co-factor, most of the enzymes are similar to oxygen oxidoreductase. One of the enzymes Vanillyl-alcohol oxidase (VAO) has a solved structure, the alignment includes the FAD bi
74762	MAM domain. An extracellular domain found in many receptors
74763	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
74769	PI3-kinase family, p85-binding domain
74769	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
74769	PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains
74769	Phosphatidylinositol 3- and 4-kinase
74769	PI3-kinase family, p85-binding domain
74769	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
74769	Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation
74769	PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains
74770	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
74772	E1-E2 ATPase
74772	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
74776	Inorganic pyrophosphatase
74782	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt
74841	Ubiquitin carboxyl-terminal hydrolase family 2
74843	MYND finger
74851	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
74854	Glycosyltransferase family 6
74901	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
74901	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
74915	ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit
74919	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
74919	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
74943	CUB domain
74943	Sushi domain (SCR repeat)
74946	Cell differentiation family, Rcd1-like. Two of the members in this family have been characterised as being involved in regulation of Ste11 regulated sex genes
75002	Trypsin
75010	Uncharacterized protein family UPF0005
75019	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
75030	WD domain, G-beta repeat
75079	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
75079	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
75097	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
75141	ADP-ribosylation factor family
75141	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
75142	BED zinc finger
75156	Lipase/Acylhydrolase with GDSL-like motif
75196	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
75196	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
75199	Homeobox domain
75209	Sugar (and other) transporter
75210	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
75219	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
75221	Peptidase family M49
75266	MAS20 protein import receptor
75267	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
75291	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
75291	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
75292	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
75292	Protein kinase domain
75292	PH domain. PH stands for pleckstrin homology
75292	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
75305	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
75305	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
75307	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
75320	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
75320	Domain of unknown function (DUF227). This family includes a large number of drosophila proteins of unknown function. The family also includes several C. elegans proteins. The alignment contains many histidines and aspartates that are conserved, suggesting
75320	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
75352	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
75387	Sir2 family
75388	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
75396	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
75404	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
75410	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
75416	Nop14-like family. Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production
75420	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
75422	Conserved hypothetical protein 95
75422	C-5 cytosine-specific DNA methylase
75422	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
75424	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
75452	CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2
75456	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
75465	Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn
75471	ADP-ribosylation factor family
75471	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
75477	Profilin
75482	Hsp20/alpha crystallin family
75497	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
75512	Glutathione peroxidase
75516	TPR Domain
75526	WAP-type (Whey Acidic Protein) 'four-disulfide core'
75552	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
75555	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
75560	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
75572	Acylphosphatase
75578	FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain
75578	FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain
75580	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
75590	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
75590	Protein-tyrosine phosphatase
75590	Putative tyrosine phosphatase family. This family consists of putative tyrosine phosphatase proteins, this function is inferred from several sequences at the top of the noise, EC:3.1.3.48
75590	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
75593	Domain of unknown function DUF143. This domain has no known function nor do any of the proteins that possess it. The aligned region is approximately 100 amino acids long
75596	Somatotropin hormone family
75605	jmjC domain
75605	jmjN domain
75605	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
75605	pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function
75605	jmjN domain
75605	jmjC domain
75605	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
75605	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
75605	C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function
75605	pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function
75612	Sulfatase
75612	Sulfatase
75617	S25 ribosomal protein
75617	S25 ribosomal protein
75625	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
75668	ADP-ribosylation factor family
75668	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
75668	ADP-ribosylation factor family
75668	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
75669	Protein kinase domain
75669	WD domain, G-beta repeat
75704	Carboxylesterase
75706	Intermediate filament protein
75712	Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised
75717	Cullin family
75718	von Willebrand factor type A domain
75725	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
75731	Shikimate kinase
75735	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
75739	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
75739	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
75739	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
75746	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
75746	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
75750	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the f
75764	Endo/excinuclease amino terminal domain. This domain has no known function it is found in the amino terminal region of excinuclease abc subunit c (uvrC), bacteriophage T4 endonucleases segA, segB, segC, segD and segE
75767	C2 domain
75769	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
75772	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of
75778	Uncharacterized protein PaaI, COG2050
75778	Fungal specific domain of unknown function (DUF391). A family of uncharacterised fungal proteins
75785	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
75785	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
75792	WD domain, G-beta repeat
75812	Asparaginase
75828	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
75835	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
75841	Zinc finger, C3HC4 type (RING finger)
75847	Uncharacterized protein family UPF0007
75859	Glycosyltransferase family 6
75863	Lectin C-type domain. This family includes both long and short form C-type
75869	ADP-ribosylation factor family
75870	Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes,
75894	Adenosine/AMP deaminase
75911	7 transmembrane receptor (rhodopsin family)
75986	Arginase family
76074	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
76080	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
76080	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
76080	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
76080	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
76089	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
76089	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
76089	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
76113	Animal haem peroxidase
76130	Las1-like. Las1 is an essential nuclear protein involved in cell morphogenesis and cell surface growth
76135	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
76183	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
76187	Iron-containing alcohol dehydrogenase
76206	7 transmembrane receptor (rhodopsin family)
76222	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
76242	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
76251	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
76251	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
76251	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
76251	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
76251	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u
76257	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
76263	2-hydroxychromene-2-carboxylate isomerase family. 2-hydroxychromene-2-carboxylate (HCCA) isomerase enzymes catalyse one step in prokaryotic polyaromatic hydrocarbon (PAH) catabolic pathways . This family also contains members with functions other than HC
76266	Occludin/ELL family
76266	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
76267	Fatty acid desaturase
76267	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
76279	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
76282	Aminotransferase class I and II
76299	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
76303	Oxysterol-binding protein
76303	PH domain. PH stands for pleckstrin homology
76306	LacY proton/sugar symporter. This family is closely related to the sugar transporter family
76308	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
76338	ADP-ribosylation factor family
76338	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
76366	Translation initiation factor IF-3
76373	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
76376	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
76400	Phosphatidylethanolamine-binding protein
76429	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
76441	Formin Homology 2 Domain
76459	Eukaryotic-type carbonic anhydrase
76459	Eukaryotic-type carbonic anhydrase
76469	Fibronectin type III domain
76469	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
76485	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl
76487	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p
76498	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h
76499	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
76507	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
76507	Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
76507	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog
76522	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
76526	Glu-tRNAGln amidotransferase C subunit. This is a family of Glu-tRNAGln amidotransferase C subunits. The Glu-tRNA Gln amidotransferase enzyme itself is an important translational fidelity mechanism replacing incorrectly charged Glu-tRNAGln with the correc
76561	PX domain. PX domains bind to phosphoinositides
76563	Amidase
76571	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
76572	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
76577	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
76577	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
76608	Anaphase-promoting complex, subunit 10 (APC10)
76608	Anaphase-promoting complex, subunit 10 (APC10)
76608	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
76615	Aminotransferase class I and II
76630	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has
76645	Galactose binding lectin domain
76645	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
76645	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
76645	REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c
76646	WD domain, G-beta repeat
76652	Actin
76668	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also membe
76686	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
76688	ADP-ribosylation factor family
76703	Zinc carboxypeptidase
76703	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fol
76709	Arp2/3 complex, 34kD subunit p34-Arc. Arp2/3 protein complex has been implicated in the control of actin polymerization in cells. The human complex consists of seven subunits which include the actin related Arp2 and Arp3, and five others referred to as p4
76722	ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold
76722	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
76737	Giardia variant-specific surface protein
76742	PX domain. PX domains bind to phosphoinositides
76742	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
76742	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
76748	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
76763	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
76763	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
76763	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
76768	Alkaline phosphatase
76770	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
76788	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
76793	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
76793	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
76795	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
76795	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
76804	jmjC domain
76804	jmjN domain
76808	Ribosomal L18ae protein family
76813	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
76834	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
76843	WD domain, G-beta repeat
76846	Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However
76846	S4 domain. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yea
76850	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
76850	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
76856	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
76856	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
76857	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
76863	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
76866	MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000
76867	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f
76877	ADP-ribosylation factor family
76877	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
76890	Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown
76892	Branched-chain amino acid transport system / permease component. This is a large family mainly comprising high-affinity branched-chain amino acid transporter proteins such as E. coli LivH and LivM both of which are form the LIV-I transport system. Also f
76893	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
76898	Glycosyltransferase family 43
76898	Glycosyltransferase family 43
76899	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
76899	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
76901	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
76905	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
76915	Mnd1 family. This family of proteins includes MND1 from S. cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair
76916	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
76943	Saposin A-type domain
76947	Squalene/phytoene synthase
76948	Zinc finger, C3HC4 type (RING finger)
76954	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
76954	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
76954	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
76954	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
76954	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
76954	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
76954	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
76954	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
76954	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
76965	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
76969	Sulfotransferase protein
76971	Sulfotransferase protein
76974	Transthyretin precursor (formerly prealbumin). Transthyretin is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. Mutations in the human transthyretin are associated with several genetic disorders
76980	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
76983	Sec1 family
77015	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
77018	Collagen triple helix repeat (20 copies)
77031	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
77031	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
77032	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
77038	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
77042	Hyaluronidase
77047	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
77055	Intermediate filament protein
77055	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
77057	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
77087	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
77087	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
77090	Occludin/ELL family
77113	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
77113	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
77116	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
77254	Hrf1 family. This family includes a number of eukaryotic proteins. It is an integral membrane protein, conserved in at least 1 copy in all sequenced eukaryotes. The gene name in S. pombe is hrf1+ for Heavy metal Resistance Factor 1
77286	R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA
77286	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
77305	WD domain, G-beta repeat
77318	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
77318	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
77337	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
77371	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24
77407	ADP-ribosylation factor family
77407	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
77407	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
77407	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
77411	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
77432	Palmitoyl protein thioesterase
77480	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
77485	Protein kinase domain
77485	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
77485	Protein kinase domain
77485	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
77527	Cyclic nucleotide-binding domain
77531	Phosphotyrosine interaction domain (PTB/PID)
77532	CENP-B protein. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding domain, which binds to a corresponding 17bp CENP-B box sequence. In the C-terminal 59 residues,
77532	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
77559	Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str
77569	LIM domain. This family represents two copies of the LIM structural domain
77573	Sec1 family
77577	Sugar (and other) transporter
77577	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
77579	Myosin head (motor domain)
77579	Myosin head (motor domain)
77579	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
77579	Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown
77579	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
77579	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
77579	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
77583	Pectinacetylesterase
77590	Sulfotransferase protein
77595	EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains
77595	Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins
77596	7 transmembrane receptor (Secretin family)
77605	Core histone H2A/H2B/H3/H4
77652	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
77652	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
77669	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
77683	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
77701	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
77704	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
77704	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
77705	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
77705	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family
77739	Thrombospondin type 1 domain
77754	CDC45-like protein. CDC45 is an essential gene required for initiation of DNA replication in S. cerevisiae, forming a complex with MCM5/CDC46. Homologs of CDC45 have been identified in human, mouse and smut fungus among others
77782	DNA polymerase family A
77782	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
77799	SH2 domain
77864	Yippee putative zinc-binding protein
77935	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
77951	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
77974	S-adenosyl-L-homocysteine hydrolase
77974	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
77976	Protein kinase domain
77987	Sec63 domain
77987	pfam02889, Sec63, Sec63 domain
77996	CutA1 divalent ion tolerance protein. Several gene loci with a possible involvement in cellular tolerance to copper have been identified. One such locus in eubacteria and archaebacteria, cutA, is thought to be involved in cellular tolerance to a wide vari
77998	Galactoside-binding lectin
78016	Protein of unknown function (DUF390). This family of long proteins are currently only found in the rice genome. They have no known function. However they may be some kind of transposable element
78016	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
78038	Acetyl co-enzyme A carboxylase carboxyltransferase alpha subunit. Acetyl co-enzyme A carboxylase carboxyltransferase is composed of an alpha and beta subunit
78038	Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction
78070	Choline/Carnitine o-acyltransferase
78076	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
78081	SCP-like extracellular protein
78081	SCP-like extracellular protein. This domain is also found in prokaryotes
78088	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
78088	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
78088	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
78134	7 transmembrane receptor (rhodopsin family)
78134	7 transmembrane receptor (rhodopsin family)
78239	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
78240	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
78246	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
78249	7 transmembrane receptor (Secretin family)
78249	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
78251	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
78255	PH domain. PH stands for pleckstrin homology
78267	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
78285	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
78285	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
78286	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
78286	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
78287	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
78294	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
78294	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis o
78304	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
78354	Phospholipase A2 inhibitor
78369	Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes,
78373	NUDIX domain
78388	Major Vault Protein repeat. The vault is a ubiquitous and highly conserved ribonucleoprotein particle of approximately 13 mDa of unknown function. This family corresponds to a repeat found in the amino terminal half of the major vault protein
78394	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
78394	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
78405	Nerve growth factor family
78416	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
78444	Pyroglutamyl peptidase
78455	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
78455	UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Some members have large insertions near t
78507	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
78509	Plus-3 domain. This domain is about 90 residues in length and is often found associated with the pfam02213 domain. The function of this domain is uncertain. It is possible that this domain is involved in DNA binding as it has three conserved positively ch
78514	PH domain. PH stands for pleckstrin homology
78514	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
78514	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
78523	Ribosomal protein L9, N-terminal domain
78560	7 transmembrane receptor (Secretin family)
78560	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
78609	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
78609	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family
78618	PH domain. PH stands for pleckstrin homology
78618	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
78618	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
78618	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
78619	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
78619	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
78625	AMP-binding enzyme
78651	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
78653	BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems
78658	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
78670	PH domain. PH stands for pleckstrin homology
78670	PH domain. PH stands for pleckstrin homology
78689	Mak10 subunit, NatC N(alpha)-terminal acetyltransferase. NatC N(alpha)-terminal acetyltransferases contains Mak10p, Mak31p and Mak3p subunits. All three subunits are associated with each other to form the active complex
78697	Uncharacterized protein family UPF0024
78709	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
78753	ab-hydrolase associated lipase region
78753	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
78754	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
78771	C2 domain
78783	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
78783	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
78797	Flavodoxin
78797	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
78797	FAD binding domain. This domain is found in sulfite reductase, NADPH cytochrome P450 reductase, Nitric oxide synthase and methionine synthase reductase
78798	WD domain, G-beta repeat
78798	HELP domain. The HELP (Hydrophobic ELP) domain is found in EMAP and EMAP-like proteins (ELPs). Although called a domain it contains a predicted transmembrane helix and may not form a globular domain. It is also not clear if these proteins localize to memb
78798	HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes.
78803	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
78803	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
78808	Synaptobrevin
78808	Synaptobrevin
78816	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
78816	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
78826	7 transmembrane receptor (rhodopsin family)
78829	TSC-22/dip/bun family
78830	Mitochondrial carrier protein
78892	LCCL domain
78912	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
78912	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
78920	2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain
78920	Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins
78923	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
78923	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
78925	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster
78928	Gpi16 subunit, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to ne
78928	Gpi16 subunit, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to new
78940	Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. H
78947	Mannosyl oligosaccharide glucosidase. This is a family of eukaryotic enzymes belonging to glycosyl hydrolase family 63. They catalyse the specific cleavage of the non-reducing terminal glucose residue from Glc(3)Man(9)GlcNAc(2). Mannosyl oligosaccharide
78951	Hsp20/alpha crystallin family
78956	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
78957	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
78959	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
78962	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
78963	NB-ARC domain
78963	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
78964	7 transmembrane receptor (rhodopsin family)
78969	Protein kinase domain
78971	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
78976	7 transmembrane receptor (rhodopsin family)
78980	FecCD transport family
78985	7 transmembrane receptor (rhodopsin family)
78986	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
78989	Lectin C-type domain. This family includes both long and short form C-type
78999	Fibronectin type III domain
78999	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
79000	Lipid-A-disaccharide synthetase. This is a family of lipid-A-disaccharide synthetases
79008	Endo/excinuclease amino terminal domain. This domain has no known function it is found in the amino terminal region of excinuclease abc subunit c (uvrC), bacteriophage T4 endonucleases segA, segB, segC, segD and segE
79008	Endo/excinuclease amino terminal domain. This domain has no known function it is found in the amino terminal region of excinuclease abc subunit c (uvrC), bacteriophage T4 endonucleases segA, segB, segC, segD and segE
79020	GMC oxidoreductase. This family of proteins bind FAD as a cofactor
79027	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
79031	Phosducin
79033	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
79036	Protein of unknown function DUF52
79043	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
79047	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
79048	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
79049	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euk
79050	CBF/Mak21 family
79053	ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum
79054	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
79058	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
79059	Nucleoside diphosphate kinase
79065	Autophagy protein Apg9. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg9 plays a direct role in the formation of the cytoplasm to vacuole targeti
79071	GNS1/SUR4 family
79083	Rabphilin-3A effector domain
79085	Mitochondrial carrier protein
79087	Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability
79089	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
79092	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
79092	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
79095	Uncharacterised protein family (UPF0184)
79107	Animal haem peroxidase
79107	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
79113	SH2 domain
79113	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
79115	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mG
79123	Amiloride-sensitive sodium channel
79124	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
79125	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
79127	Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes
79128	Phosphotyrosine interaction domain (PTB/PID)
79131	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
79148	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
79148	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
79148	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
79148	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
79148	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
79149	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
79152	Fatty acid hydroxylase
79152	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
79153	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b
79156	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
79175	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
79177	Zinc finger
79178	Adenylate cyclase. One member was found to complement mutants of E. coli. cya. This protein has been shown to be an adenylate kinase
79183	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
79183	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
79191	Homeobox domain
79192	Homeobox domain
79196	Oxysterol-binding protein
79196	Oxysterol-binding protein
79196	PH domain. PH stands for pleckstrin homology
79201	TNFR/NGFR cysteine-rich region
79208	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
79211	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
79211	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
79212	Sodium:neurotransmitter symporter family
79213	Sodium:neurotransmitter symporter family
79214	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
79217	Connexin
79219	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
79220	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
79222	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
79224	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
79225	Helix-loop-helix DNA-binding domain
79237	Homeobox domain
79238	HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase
79239	Heme oxygenase
79240	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
79240	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
79242	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
79243	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
79244	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
79244	MaoC like domain. The MaoC protein is found to share similarity with a wide variety of enzymes
79244	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
79246	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
79246	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
79247	7 transmembrane receptor (rhodopsin family)
79249	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
79252	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
79252	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
79254	Ferrous iron transport protein B
79255	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
79257	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The
79339	7 transmembrane receptor (rhodopsin family)
79345	7 transmembrane receptor (rhodopsin family)
79361	Syntaxin
79362	Helix-loop-helix DNA-binding domain
79363	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
79365	Helix-loop-helix DNA-binding domain
79368	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv
79369	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
79369	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
79370	Apoptosis regulator proteins, Bcl-2 family
79370	Apoptosis regulator proteins, Bcl-2 family
79400	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
79411	Glycosyl hydrolases family 35
79414	Fibronectin type III domain
79414	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
79423	Stathmin family
79425	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
79430	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
79430	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
79431	Helix-loop-helix DNA-binding domain
79433	Myosin head (motor domain)
79443	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
79449	Ribosomal protein L21e
79451	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
79455	Phosducin
79462	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
79463	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1
79465	Class I Histocompatibility antigen, domains alpha 1 and 2
79541	7 transmembrane receptor (rhodopsin family)
79557	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
79557	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
79558	Protein kinase domain
79558	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
79563	GrpE
79589	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
79602	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h
79603	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
79623	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
79629	Occludin/ELL family
79634	TT viral orf 1. TT virus (TTV)
79637	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
79644	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster
79646	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
79648	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
79651	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f
79660	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase,
79661	Formamidopyrimidine-DNA glycosylase
79666	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
79668	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
79679	Protein of unknown function (DUF431)
79695	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
79705	Protein kinase domain
79709	Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of va
79714	Ezrin/radixin/moesin family
79716	Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase
79717	DNA / pantothenate metabolism flavoprotein. The DNA/pantothenate metabolism flavoprotein (EC:4.1.1.36) affects synthesis of DNA, and pantothenate metabolism
79723	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
79726	WD domain, G-beta repeat
79727	'Cold-shock' DNA-binding domain
79734	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
79739	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
79748	Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19-
79751	Mitochondrial carrier protein
79753	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
79754	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
79758	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
79763	Isochorismatase family. This family are hydrolase enzymes
79765	GTPase of unknown function
79770	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
79776	Homeobox domain
79777	Acyl CoA binding protein
79778	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
79781	ATPase family associated with various cellular activities (AAA)
79784	Myosin head (motor domain)
79784	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
79786	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
79786	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
79788	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
79796	Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability
79799	UDP-glucoronosyl and UDP-glucosyl transferase
79807	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
79813	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
79817	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru
79829	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
79842	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
79856	PX domain. PX domains bind to phosphoinositides
79862	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
79871	Domain of Unknown Function (DUF408)
79874	Rabaptin
79876	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
79877	Dephospho-CoA kinase. This family catalyzes the phosphorylation of the 3'-hydroxyl group of dephosphocoenzyme A to form Coenzyme A EC:2.7.1.24. This enzyme uses ATP in its reaction
79901	Cytochrome b561
79902	Chitinase
79903	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
79912	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
79915	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
79920	F-box domain
79923	Homeobox domain
79930	PTB domain (IRS-1 type)
79935	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
79937	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
79948	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
79957	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h
79958	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
79958	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
79958	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
79961	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
79961	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
79961	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
79973	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
79974	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
79984	Carboxylesterase
79990	PH domain. PH stands for pleckstrin homology
79993	GNS1/SUR4 family
80011	Bacterial surface antigen. This entry includes the following surface antigens; D15 antigen from H.influenzae
80012	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
80019	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
80024	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
80025	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
80025	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
80025	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
80025	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
80025	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
80025	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
80032	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
80036	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
80052	Ribosomal protein S5, C-terminal domain
80055	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
80068	Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain
80068	pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain
80070	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
80070	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
80086	Tubulin/FtsZ family. This family includes the tubulin alpha
80117	ADP-ribosylation factor family
80125	C2 domain
80128	B-box zinc finger
80142	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
80144	Calx-beta domain
80144	Giardia variant-specific surface protein
80144	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
80146	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
80148	PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How
80153	YjeF-related protein N-terminus
80156	Aldo/keto reductase family
80168	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
80201	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
80204	F-box domain
80204	F-box domain
80204	F-box domain
80205	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
80216	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a
80223	C2 domain
80230	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
80230	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
80235	Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability
80237	Occludin/ELL family
80262	Uncharacterised protein family (UPF0183). This family of proteins includes Lin-10 from C. elegans
80263	B-box zinc finger
80267	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
80267	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
80270	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
80271	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
80273	GrpE
80281	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
80283	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
80283	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
80290	7 transmembrane receptor (rhodopsin family)
80296	Chlorophyll A-B binding protein
80297	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
80298	mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein
80307	C2 domain
80307	C2 domain
80310	CUB domain
80310	CUB domain
80311	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
80311	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
80312	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
80315	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
80325	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
80325	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
80325	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
80326	wnt family
80328	Class I Histocompatibility antigen, domains alpha 1 and 2
80329	Class I Histocompatibility antigen, domains alpha 1 and 2
80331	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
80332	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
80332	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
80332	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
80332	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
80334	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
80338	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
80338	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
80339	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o
80346	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
80347	Dephospho-CoA kinase. This family catalyzes the phosphorylation of the 3'-hydroxyl group of dephosphocoenzyme A to form Coenzyme A EC:2.7.1.24. This enzyme uses ATP in its reaction
80350	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
80350	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
80351	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
80704	Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog
80706	7 transmembrane receptor (rhodopsin family)
80712	Homeobox domain
80714	PBX domain. The PBX domain is a bipartite acidic domain
80718	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
80720	PBX domain. The PBX domain is a bipartite acidic domain
80721	Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog
80722	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
80728	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
80728	MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins
80732	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
80733	Eukaryotic-type carbonic anhydrase
80733	Eukaryotic-type carbonic anhydrase
80733	Eukaryotic-type carbonic anhydrase
80740	u-PAR/Ly-6 domain
80745	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
80749	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
80754	Rabaptin
80765	START domain
80765	START domain
80773	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
80774	LIM domain. This family represents two copies of the LIM structural domain
80777	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
80781	Thrombospondin N-terminal -like domain
80781	Thrombospondin N-terminal -like domain
80781	Thrombospondin N-terminal -like domain
80790	PH domain
80794	CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and
80794	CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily co
80794	CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conser
80796	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
80820	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN
80821	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera
80824	Protein-tyrosine phosphatase
80824	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
80824	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
80834	7 transmembrane receptor (metabotropic glutamate family)
80834	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
80835	7 transmembrane receptor (metabotropic glutamate family)
80835	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
80835	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
80835	7 transmembrane receptor (metabotropic glutamate family)
80835	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
80835	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
80838	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
80840	7 transmembrane receptor (rhodopsin family)
80841	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
80843	eubacterial secY protein
80848	CUB domain
80848	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
80852	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
80853	jmjC domain
80854	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
80857	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
80859	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
80860	GH3 auxin-responsive promoter
80877	Beige/BEACH domain
80878	Monocarboxylate transporter
80878	Sugar (and other) transporter
80879	Monocarboxylate transporter
80883	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
80884	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
80885	7 transmembrane receptor (rhodopsin family)
80886	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
80888	Hsp20/alpha crystallin family
80890	B-box zinc finger
80890	Filamin/ABP280 repeat
80890	Zinc finger, C3HC4 type (RING finger)
80890	Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine
80890	NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in at least one member, proteolysis has shown that the Peptid
80890	NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in some members, proteolysis has shown that the Peptidyl-alph
80891	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
80892	Homeobox domain
80897	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
80898	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
80899	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
80899	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
80900	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
80900	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
80901	7 transmembrane receptor (rhodopsin family)
80903	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
80905	impB/mucB/samB family. These proteins are involved in UV protection
80908	Glycosyltransferase family 6
80910	7 transmembrane receptor (rhodopsin family)
80911	Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA
80978	7 transmembrane receptor (rhodopsin family)
80979	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
80981	ADP-ribosylation factor family
80985	B-box zinc finger
81003	B-box zinc finger
81003	ADP-ribosylation factor family
81003	B-box zinc finger
81003	ADP-ribosylation factor family
81003	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
81003	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
81006	7 transmembrane receptor (rhodopsin family)
81010	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
81011	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
81012	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
81013	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
81014	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
81015	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
81016	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
81017	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
81025	Connexin
81027	Tubulin/FtsZ family
81029	wnt family
81029	wnt family
81031	Sugar (and other) transporter
81033	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
81033	Cyclic nucleotide-binding domain
81034	Mitochondrial carrier protein
81285	7 transmembrane receptor (rhodopsin family)
81370	7 transmembrane receptor (rhodopsin family)
81472	7 transmembrane receptor (rhodopsin family)
81490	Phosphatidyl serine synthase. Phosphatidyl serine synthase is also known as serine exchange enzyme. This family represents eukaryotic PSS I and II which are membrane bound proteins which catalyses the replacement of the head group of a phospholipid (phos
81491	7 transmembrane receptor (rhodopsin family)
81494	Sushi domain (SCR repeat)
81500	Eukaryotic aspartyl protease
81502	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
81502	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
81502	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
81502	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
81503	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
81504	SH2 domain
81504	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
81505	PMP-22/EMP/MP20/Claudin family
81506	ADP-ribosyl cyclase. ADP-ribosyl cyclase EC:3.2.2.5 (also know as cyclic ADP-ribose hydrolase or CD38) synthesizes cyclic-ADP ribose, a second messenger for glucose-induced insulin secretion
81507	Protein kinase domain
81507	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
81508	Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13,
81509	7 transmembrane receptor (rhodopsin family)
81512	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
81514	Fibronectin type III domain
81514	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
81515	SH2 domain
81515	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
81516	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
81516	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
81517	PTN/MK heparin-binding protein family
81518	SRF-type transcription factor (DNA-binding and dimerisation domain)
81519	GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis o
81521	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
81521	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
81527	CUB domain
81527	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
81529	3'5'-cyclic nucleotide phosphodiesterase
81530	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
81535	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
81536	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
81537	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
81539	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
81542	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
81551	Stathmin family
81557	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
81557	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
81557	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
81559	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
81562	Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19-
81567	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
81567	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
81574	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
81578	von Willebrand factor type A domain
81578	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
81600	Glycosyl hydrolases family 18
81600	Chitin binding Peritrophin-A domain. This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruse
81601	MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases
81606	Phosphorylase family
81609	PX domain. PX domains bind to phosphoinositides
81617	Mo25 protein family
81618	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
81619	Tetraspanin family
81628	TSC-22/dip/bun family
81630	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
81630	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
81630	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
81631	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
81632	Aminotransferase class-III
81635	Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this r
81636	Adenylate and Guanylate cyclase catalytic domain
81637	Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site
81637	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
81637	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
81638	7 transmembrane receptor (rhodopsin family)
81639	Lipoxygenase
81640	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermi
81641	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
81644	7 transmembrane receptor (rhodopsin family)
81645	7 transmembrane receptor (rhodopsin family)
81646	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
81646	pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB
81646	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
81646	pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB
81650	Casein kinase II regulatory subunit
81651	Laminin G domain
81653	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
81654	e3 binding domain. This family represents a small domain of the E2 subunit of 2-oxo-acid dehydrogenases responsible for the binding of the E3 subunit
81654	2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain
81654	Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins
81657	7 transmembrane receptor (metabotropic glutamate family)
81657	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
81658	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
81658	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
81659	Protein-tyrosine phosphatase
81659	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
81661	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
81662	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
81663	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
81664	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
81666	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
81667	WD domain, G-beta repeat
81668	7 transmembrane receptor (rhodopsin family)
81672	7 transmembrane receptor (metabotropic glutamate family)
81672	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
81674	Transcription initiation factor IIF, beta subunit. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIF (TFIIF) is a tetramer of two beta subu
81677	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
81678	MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function
81678	RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3
81679	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
81683	Macrophage migration inhibitory factor (MIF)
81684	Peptidase family M3. This is the Thimet oligopeptidase family, large family of mammalian and bacterial oligopeptidases that cleave medium sized peptides. The group also contains mitochondrial intermediate peptidase which is encoded by nuclear DNA but fun
81686	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
81686	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
81686	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
81687	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
81687	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
81696	7 transmembrane receptor (rhodopsin family)
81697	7 transmembrane receptor (rhodopsin family)
81707	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
81707	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
81707	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
81708	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
81710	Homeobox domain
81721	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
81721	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
81724	Methylenetetrahydrofolate reductase. This family includes the 5,10-methylenetetrahydrofolate reductase EC:1.7.99.5 from bacteria and methylenetetrahydrofolate reductase EC: 1.5.1.20 from eukaryotes. The structure for this domain is known to be a TIM barre
81725	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
81729	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
81733	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
81734	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
81735	Fibronectin type III domain
81735	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
81737	Nerve growth factor family
81738	Conserved nucleoporin domain
81739	ATP P2X receptor
81742	3'5'-cyclic nucleotide phosphodiesterase
81743	3'5'-cyclic nucleotide phosphodiesterase
81749	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
81750	Laminin G domain
81751	Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans
81752	7 transmembrane receptor (rhodopsin family)
81753	7 transmembrane receptor (Secretin family)
81754	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
81755	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
81756	ADP-ribosylation factor family
81756	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
81756	Periplasmic solute binding protein family. This family includes periplasmic solute binding proteins such as TroA that interacts with an ATP-binding cassette transport system in Treponema pallidum
81756	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
81758	Retinoblastoma-associated protein A domain. This domain has the cyclin fold as predicted
81758	Retinoblastoma-associated protein B domain. The crystal structure of the Rb pocket bound to a nine-residue E7 peptide containing the LxCxE motif, shared by other Rb-binding viral and cellular proteins, shows that the LxCxE peptide binds a highly conserve
81760	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
81761	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
81763	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
81764	Ribosomal L10
81765	Ribosomal protein L13e
81766	Eukaryotic ribosomal protein L18
81767	Ribosomal protein L19e
81768	Ribosomal L22e protein family
81769	Ribosomal protein L44
81770	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
81773	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
81774	Ribosomal protein S17
81775	Ribosomal protein S21e
81777	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
81777	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis o
81778	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
81779	7 transmembrane receptor (Secretin family)
81785	HMG (high mobility group) box
81785	Structure-specific recognition protein
81786	B-box zinc finger
81789	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d
81789	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
81792	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
81792	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
81793	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
81794	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
81794	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
81796	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
81796	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
81797	7 transmembrane receptor (rhodopsin family)
81799	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
81801	Stanniocalcin family
81802	Syntaxin
81804	Sec1 family
81805	Oxidoreductase molybdopterin binding domain. This domain is found in a variety of oxidoreductases. This domain binds to a molybdopterin cofactor. Xanthine dehydrogenases, that also bind molybdopterin, have essentially no similarity
81805	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
81806	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
81808	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
81808	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
81809	Transforming growth factor beta like domain
81809	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
81810	Protein kinase domain
81811	Erythropoietin/thrombopoietin
81812	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
81813	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
81815	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
81817	Helix-loop-helix DNA-binding domain
81818	Intermediate filament protein
81819	'Paired box' domain
81820	C2 domain
81822	Actin
81826	Phosphate transporter family. This family includes PHO-4 from Neurospora crassa which is a is a Na(+)-phosphate symporter. This family also contains the leukemia virus receptor
81828	Zona pellucida-like domain
81831	CUB domain
81832	CUB domain
81832	Low-density lipoprotein receptor domain class A
81832	CUB domain
81832	CUB domain
81832	Low-density lipoprotein receptor domain class A
81840	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
81844	Zinc finger, C3HC4 type (RING finger)
81846	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
81846	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
81846	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
81849	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
81853	Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised
81855	Tricarboxylate carrier
81869	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
81876	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
81877	Fibronectin type III domain
81879	Sterile alpha motif (SAM)/Pointed domain
81887	Las1-like. Las1 is an essential nuclear protein involved in cell morphogenesis and cell surface growth
81894	Mitochondrial carrier protein
81896	WD domain, G-beta repeat
81897	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
81904	PMP-22/EMP/MP20/Claudin family
81905	PMP-22/EMP/MP20/Claudin family
81905	PMP-22/EMP/MP20/Claudin family
81906	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
81906	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
81910	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
81919	Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes
81924	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
81925	Phosphatidate cytidylyltransferase
81930	Kinesin motor domain
83259	Cadherin domain
83259	Cadherin domain
83259	Cadherin domain
83379	Glycosyl hydrolase family 1
83383	Helix-loop-helix DNA-binding domain
83394	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoester
83397	Protein kinase A anchor
83398	Sulfotransferase protein
83398	Sulfotransferase protein
83401	GNS1/SUR4 family
83409	Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn
83410	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
83422	pfam02891, zf-MIZ, MIZ zinc finger
83422	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
83425	Protein kinase A anchor
83429	PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How
83430	Interleukin-6/G-CSF/MGF family
83431	Ezrin/radixin/moesin family
83439	HMG (high mobility group) box
83447	Mitochondrial carrier protein
83453	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
83454	TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucl
83457	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
83462	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
83463	Helix-loop-helix DNA-binding domain
83465	Cadherin domain
83465	7 transmembrane receptor (Secretin family)
83465	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
83465	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
83465	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
83465	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
83466	Cadherin domain
83466	7 transmembrane receptor (Secretin family)
83466	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
83466	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
83468	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl
83469	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
83474	HMG (high mobility group) box
83476	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
83477	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
83481	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
83493	SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin
83497	Occludin/ELL family
83500	Sugar (and other) transporter
83501	Cadherin domain
83501	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
83502	Cadherin domain
83502	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
83503	RNA polymerase Rpb6. Rpb6 is an essential subunit in the eukaryotic polymerases Pol I, II and III. The bacterial equivalent to Rpb6 is the omega subunit. Rpb6 and omega are structurally conserved and both function in polymerase assembly
83504	Glycosyl hydrolase family 1
83505	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
83510	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
83511	Sodium:neurotransmitter symporter family
83512	Fatty acid desaturase
83512	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
83513	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
83513	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
83513	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
83514	TSC-22/dip/bun family
83515	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
83518	7 transmembrane receptor (rhodopsin family)
83522	Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA
83523	Inositol monophosphatase family
83529	Eukaryotic porin
83531	Eukaryotic porin
83532	Eukaryotic porin
83533	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
83535	Inward rectifier potassium channel
83536	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
83536	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
83537	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN
83539	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
83540	Nuf2 family. Members of this family are components of the mitotic spindle. It has been shown that Nuf2 from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle. An arabidopsis protein h
83540	Nuf2 family. Members of this family are components of the mitotic spindle. It has been shown that Nuf2 from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle. An arabidopsis protein h
83546	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
83548	Sec34-like family. Sec34 and Sec35 form a sub-complex, in a seven protein complex that includes Dor1 (PFAM:PF04124). This complex is thought to be important for tether vesicles to the Golgi
83550	7 transmembrane receptor (rhodopsin family)
83551	7 transmembrane receptor (rhodopsin family)
83561	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
83563	Ubiquitin carboxyl-terminal hydrolase family 2
83564	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
83567	7 transmembrane receptor (rhodopsin family)
83568	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
83568	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
83571	Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase
83574	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
83574	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
83577	von Willebrand factor type A domain
83577	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
83577	von Willebrand factor type A domain
83577	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
83581	RNA polymerase A/beta'/A" subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This family inclu
83581	RNA polymerase alpha subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Members of this family
83582	RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase com
83583	Prenylated rab acceptor (PRA1)
83586	Helix-loop-helix DNA-binding domain
83588	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
83588	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
83589	Phosphotyrosine interaction domain (PTB/PID)
83589	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
83590	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
83592	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
83593	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
83593	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
83593	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
83595	HMG (high mobility group) box
83602	Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are rep
83602	Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are rep
83603	GNS1/SUR4 family
83610	Phosphotyrosine interaction domain (PTB/PID)
83610	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
83611	Phosphotyrosine interaction domain (PTB/PID)
83611	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
83612	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
83613	Intermediate filament protein
83614	pfam02891, zf-MIZ, MIZ zinc finger
83624	Cyclophilin type peptidyl-prolyl cis-trans isomerase
83627	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
83628	Tetraspanin family
83630	Homeobox domain
83632	MYND finger
83632	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
83633	7 transmembrane receptor (metabotropic glutamate family)
83633	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
83635	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
83642	Uncharacterized ACR, YdiU/UPF0061 family
83643	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
83657	Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn
83658	Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn
83658	Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn
83658	Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn
83659	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
83660	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
83661	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
83671	C2 domain
83672	C2 domain
83672	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
83672	C2 domain
83672	Hydantoinase B/oxoprolinase. This family includes N-methylhydaintoinase B which converts hydantoin to N-carbamyl-amino acids, and 5-oxoprolinase EC:3.5.2.9 which catalyses the formation of L-glutamate from 5-oxo-L-proline. These enzymes are part of the ox
83681	SH2 domain
83683	7 transmembrane receptor (rhodopsin family)
83685	C2 domain
83685	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
83686	Cyclic nucleotide-binding domain
83686	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
83688	Transaldolase
83690	LCCL domain
83691	LCCL domain
83694	MIT domain
83697	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
83702	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
83703	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte
83707	Domain of unknown function DUF60. The proteins in this family have no known function. Two proteins have been annotated as phosphotransferases, however this is not supported experimentally, and should be viewed with caution
83716	LCCL domain
83719	Yippee putative zinc-binding protein
83720	Cadherin domain
83720	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
83721	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
83721	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
83721	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The
83722	POLO box duplicated region
83727	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
83728	Cadherin domain
83728	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
83730	Synaptobrevin
83731	Calcium-activated BK potassium channel alpha subunit
83731	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
83733	Mitochondrial carrier protein
83734	Autophagocytosis associated protein, C-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole
83741	Transcription factor AP-2
83744	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
83756	7 transmembrane receptor (metabotropic glutamate family)
83756	Cell cycle protein. This entry includes the following members
83756	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
83758	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
83759	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members invol
83762	C2 domain
83764	Flotillin family. Flotillins are integral membrane proteins that have been shown to be present in several subcellular components, including caveolae (invaginated plasma membrane microdomains), lipid rafts (sphingolipid and cholesterol-rich, detergent-res
83765	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
83767	WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin
83768	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
83770	7 transmembrane receptor (metabotropic glutamate family)
83770	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
83771	7 transmembrane receptor (metabotropic glutamate family)
83771	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
83781	Galactoside-binding lectin
83782	Nucleoside diphosphate kinase
83783	Sulfotransferase protein
83788	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
83788	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
83790	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
83791	Polyprenyl synthetase
83796	BAF60b domain of the SWIB complex
83799	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
83800	Clathrin light chain
83803	pfam02922, isoamylase_N, Isoamylase N-terminal domain. This domain is found in a range of enzymes that act on branched substrates - isoamylase, pullulanase and branching enzyme. This family also contains the beta subunit of 5' AMP activated kinase
83805	SH2 domain
83805	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
83808	UDP-glucoronosyl and UDP-glucosyl transferase
83809	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
83810	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
83814	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
83817	Carboxylesterase
83819	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
83826	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
83826	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
83830	Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are rep
83833	BAF60b domain of the SWIB complex
83842	Choline/Carnitine o-acyltransferase
83844	Ubiquitin carboxyl-terminal hydrolase family 2
83849	C2 domain
83849	C2 domain
83850	C2 domain
83851	C2 domain
83851	C2 domain
83852	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
83858	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
83873	7 transmembrane receptor (rhodopsin family)
83875	Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc
83881	Homeobox domain
83882	Tetraspanin family
83884	Mitochondrial carrier protein
83885	Mitochondrial carrier protein
83887	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
83891	PX domain. PX domains bind to phosphoinositides
83892	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
83925	GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the d
83928	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
83931	Protein kinase domain
83931	Protein kinase domain
83937	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
83945	DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds
83945	DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh
83945	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
83954	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
83959	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
83965	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle
83993	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
83995	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
83995	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
83995	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
83995	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases
83996	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
83996	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
83996	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
83999	WSC domain. This domain may be involved in carbohydrate binding
83999	WSC domain. This domain may be involved in carbohydrate binding
84002	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
84005	Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac
84006	Protein kinase domain
84006	Protein kinase C terminal domain
84007	7 transmembrane receptor (rhodopsin family)
84008	Fibrinogen beta and gamma chains, C-terminal globular domain
84008	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
84009	PH domain. PH stands for pleckstrin homology
84009	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
84010	Delta serrate ligand
84012	Sodium:dicarboxylate symporter family
84018	SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin
84019	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
84020	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
84020	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
84022	7 transmembrane receptor (rhodopsin family)
84023	7 transmembrane receptor (rhodopsin family)
84024	Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine
84024	Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis
84024	Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. H
84028	Lipase
84028	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
84029	FMN-dependent dehydrogenase
84030	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
84030	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
84031	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
84031	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
84032	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
84032	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
84035	WSC domain. This domain may be involved in carbohydrate binding
84036	Calcium-activated SK potassium channel
84036	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-
84036	Calcium-activated SK potassium channel
84036	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
84036	Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-subunits and the Ca
84036	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-s
84046	HMG (high mobility group) box
84050	Somatomedin B domain
84050	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle
84057	Mnd1 family. This family of proteins includes MND1 from S. cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair
84059	Calx-beta domain
84059	7 transmembrane receptor (Secretin family)
84059	EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function,
84061	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
84072	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
84074	High molecular weight glutenin subunit. Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, gluteni
84078	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
84078	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
84079	Vacuolar sorting protein 9 (VPS9) domain
84083	HNH endonuclease
84085	F-box domain
84092	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
84100	ADP-ribosylation factor family
84100	ADP-ribosylation factor family
84101	Ubiquitin carboxyl-terminal hydrolases family 2
84101	Zn-finger in ubiquitin-hydrolases and other protein
84102	Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples ha
84105	Pterin 4 alpha carbinolamine dehydratase. Pterin 4 alpha carbinolamine dehydratase is also known as DCoH (dimerisation cofactor of hepatocyte nuclear factor 1-alpha)
84106	Vinculin family
84108	Zinc finger, C3HC4 type (RING finger)
84111	7 transmembrane receptor (rhodopsin family)
84112	7 transmembrane receptor (rhodopsin family)
84127	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
84132	Ubiquitin carboxyl-terminal hydrolase family 2
84132	Ubiquitin carboxyl-terminal hydrolases family 2
84134	Eukaryotic porin
84154	Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids
84159	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
84163	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
84164	CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2
84166	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
84171	Lysyl oxidase
84172	RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase com
84174	SH2 domain
84174	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
84174	SH2 domain
84174	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
84181	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
84188	Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included
84189	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
84193	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl
84197	Protein kinase domain
84203	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
84217	MYND finger
84220	RanBP1 domain
84220	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
84225	MYND finger
84236	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f
84239	E1-E2 ATPase
84239	Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
84243	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
84249	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
84251	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
84253	Rap/ran-GAP
84253	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
84258	C2 domain
84259	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
84263	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
84263	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
84264	Metallo-beta-lactamase superfamily
84274	ubiE/COQ5 methyltransferase family
84275	Mitochondrial carrier protein
84277	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con
84279	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
84280	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
84301	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
84306	Programmed cell death protein 2, C-terminal domain
84315	SAND family protein. Members of this family are uncharacterised proteins that have been called SAND proteins. These protein do not contain a SAND domain. The members of this family are 500-600 amino acids long and contain several conserved motifs
84320	Acyl CoA binding protein
84326	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
84342	Dor1-like family. Dor1 is involved in vesicle targeting to the yeast Golgi apparatus and complexes with a number of other trafficking proteins, which include Sec34 and Sec35
84347	PMP-22/EMP/MP20/Claudin family
84348	7 transmembrane receptor (rhodopsin family)
84349	TNF(Tumor Necrosis Factor) family
84350	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
84350	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
84352	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
84353	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
84355	Heavy-metal-associated domain
84357	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
84359	CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain
84360	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
84360	CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh
84364	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
84377	Somatotropin hormone family
84383	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
84384	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
84385	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
84385	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
84386	WAP-type (Whey Acidic Protein) 'four-disulfide core'
84390	Cyclic nucleotide-binding domain
84390	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
84391	Kinesin motor domain
84395	Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor
84396	Sodium / potassium ATPase beta chain
84399	Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module
84399	Hedgehog amino-terminal signaling domain. For the carboxyl Hint module, see pfam01079. Hedgehog is a family of secreted signal molecules required for embryonic cell differentiation
84401	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
84403	Connexin
84405	Interleukin-12 alpha subunit. Interleukin 12 (IL-12) is a disulphide-bonded heterodimer consisting of a 35kDa alpha subunit and a 40kDa beta subunit. It is involved in the stimulation and maintenance of Th1 cellular immune responses, including the normal
84406	Sulfotransferase protein
84407	Cadherin domain
84407	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
84410	Zinc finger
84422	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuron
84423	Transforming growth factor beta like domain
84424	Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation
84426	wnt family
84429	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
84431	Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in
84435	7 transmembrane receptor (Secretin family)
84436	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
84439	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
84448	Villin headpiece domain
84448	LIM domain. This family represents two copies of the LIM structural domain
84451	Protein kinase domain
84456	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
84456	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter
84457	Fibronectin type III domain
84457	Fibronectin type III domain
84464	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
84467	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
84469	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
84471	PX domain. PX domains bind to phosphoinositides
84471	Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2
84475	7 transmembrane receptor (rhodopsin family)
84476	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
84478	Ubiquitin fusion degradation protein UFD1. Post-translational ubiquitin-protein conjugates are recognized for degradation by the ubiquitin fusion degradation (UFD) pathway. Several proteins involved in this pathway have been identified. This family inclu
84481	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
84484	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
84485	Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in
84491	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
84492	SH2 domain
84492	Protein kinase domain
84492	PH domain. PH stands for pleckstrin homology
84492	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
84492	BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region
84493	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
84494	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
84495	XRCC1 N terminal domain
84495	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
84496	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
84503	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
84504	Homeobox domain
84505	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
84510	Stathmin family
84511	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
84511	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
84514	GH3 auxin-responsive promoter
84517	Actin
84527	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
84528	Homeobox domain
84539	7 transmembrane receptor (rhodopsin family)
84541	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
84541	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
84550	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
84556	C2 domain
84556	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
84557	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
84557	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
84574	LIM domain. This family represents two copies of the LIM structural domain
84575	Fatty acid desaturase
84575	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
84576	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
84578	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
84583	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
84585	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
84586	Fibrinogen beta and gamma chains, C-terminal globular domain
84586	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
84588	PMP-22/EMP/MP20/Claudin family
84589	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
84590	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
84592	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
84596	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
84597	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
84599	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
84608	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
84608	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
84612	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
84614	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
84620	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
84623	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv
84628	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
84630	Protein kinase domain
84631	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
84632	PH domain. PH stands for pleckstrin homology
84634	7 transmembrane receptor (rhodopsin family)
84636	7 transmembrane receptor (rhodopsin family)
84643	Kinesin motor domain
84649	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
84653	Helix-loop-helix DNA-binding domain
84656	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
84658	7 transmembrane receptor (Secretin family)
84658	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
84665	Immunoglobulin domain
84667	Helix-loop-helix DNA-binding domain
84669	Ubiquitin carboxyl-terminal hydrolase family 2
84675	B-box zinc finger
84676	B-box zinc finger
84678	jmjC domain
84678	F-box domain
84678	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
84678	F-box domain
84678	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
84679	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
84681	HIT family
84682	pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium
84683	pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium
84684	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
84689	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
84693	Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily
84694	Connexin
84695	Lysyl oxidase
84701	pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium
84717	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
84733	'chromo' (CHRromatin Organization MOdifier) domain
84749	Ubiquitin carboxyl-terminal hydrolase family 2
84750	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
84752	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
84759	Zinc finger, C3HC4 type (RING finger)
84767	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
84769	Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is
84812	PH domain. PH stands for pleckstrin homology
84821	Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown
84823	Intermediate filament protein
84823	Intermediate filament tail domain
84838	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
84838	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
84839	Homeobox domain
84844	Uncharacterised protein family (UPF0123). This family of proteins are uncharacterised, they contain five CXXC motifs
84851	B-box zinc finger
84851	Zinc finger, C3HC4 type (RING finger)
84861	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
84868	Immunoglobulin domain
84869	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
84873	7 transmembrane receptor (Secretin family)
84873	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
84878	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
84878	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
84885	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
84888	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
84888	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
84893	F-box domain
84893	F-box domain
84909	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
84913	Helix-loop-helix DNA-binding domain
84914	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
84936	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
84938	Peptidase family C54
84938	Peptidase family C54
84946	Low temperature viability protein
84948	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d
84948	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
84950	PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is abou
84950	PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is abou
84952	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
84954	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has
84955	Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration
84958	C2 domain
84967	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
84971	Peptidase family C54
84975	Sugar (and other) transporter
84976	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
84992	Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest
84993	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
85030	Death domain
85031	Lipase
85240	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
85240	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
85242	DNA polymerase alpha subunit B. The B subunit of the DNA polymerase alpha plays an essential role at the initial stage of DNA replication in S. cerevisiae and is phosphorylated in a cell cycle-dependent manner
85243	PI3-kinase family, p85-binding domain
85243	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
85243	PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains
85244	IQ calmodulin-binding motif. Calmodulin-binding motif
85246	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
85247	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
85248	Kinesin motor domain
85250	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
85253	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
85255	Thiamine pyrophosphate enzyme, C-terminal TPP binding domain
85255	Thiamine pyrophosphate enzyme, N-terminal TPP binding domain
85255	Thiamine pyrophosphate enzyme, central domain. The central domain of TPP enzymes contains a 2-fold Rossman fold
85256	Sugar (and other) transporter
85257	Cyclic nucleotide-binding domain
85257	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
85258	Cyclic nucleotide-binding domain
85258	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
85259	Cyclic nucleotide-binding domain
85259	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
85260	GDA1/CD39 (nucleoside phosphatase) family
85261	mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein
85262	Mitochondrial carrier protein
85263	Mitochondrial carrier protein
85265	LIM domain. This family represents two copies of the LIM structural domain
85267	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cells
85268	7 transmembrane receptor (rhodopsin family)
85269	Nucleoside diphosphate kinase
85270	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
85271	Zona pellucida-like domain
85272	Transforming growth factor beta like domain
85273	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate
85275	Guanylate kinase
85275	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
85275	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
85275	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
85301	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
85302	Intermediate filament protein
85302	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
85308	Uncharacterised protein family (UPF0172)
85309	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
85309	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
85311	Lipase
85315	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h
85327	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
85333	Mitochondrial carrier protein
85358	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
85363	B-box zinc finger
85363	Zinc finger, C3HC4 type (RING finger)
85363	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
85363	Zinc finger, C3HC4 type (RING finger)
85363	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
85363	Zinc finger, C3HC4 type (RING finger)
85363	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
85365	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari
85376	Fibronectin type III domain
85376	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
85377	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
85377	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
85378	Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin
85382	Globin
85406	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con
85413	Sugar (and other) transporter
85415	Hr1 repeat
85415	BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1
85419	Beige/BEACH domain
85425	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
85430	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
85431	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
85433	Zinc carboxypeptidase
85435	MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins
85435	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain s
85439	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
85443	Protein kinase domain
85445	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
85445	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
85446	Homeobox domain
85453	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
85458	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The D
85461	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
85464	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
85464	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
85465	CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond
85472	DNA polymerase family A
85482	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
85482	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
85489	Helix-loop-helix DNA-binding domain
85491	Synaptobrevin
85493	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
85496	Somatomedin B domain
85496	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle
85569	Galanin
86614	HSF-type DNA-binding
88066	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
88455	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
89122	B-box zinc finger
89122	B-box zinc finger
89776	Sugar (and other) transporter
89777	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
89778	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
89780	wnt family
89782	Leishmanolysin
89783	Golgi 4-transmembrane spanning transporter
89788	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
89789	Ergosterol biosynthesis ERG4/ERG24 family
89795	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
89797	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
89801	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p
89803	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
89804	Protein kinase domain
89804	Fibronectin type III domain
89805	Uncharacterized ACR, COG1579
89805	Intermediate filament protein
89805	Fibrinogen beta and gamma chains, C-terminal globular domain
89807	PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretin
89810	7 transmembrane receptor (rhodopsin family)
89813	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
89815	Synaptogyrin. This family of proteins is distantly related to pfam01284
89816	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
89818	Synaptobrevin
89819	G10 protein
89821	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
89822	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
89823	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
89824	Glycosyl hydrolases family 18
89825	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
89826	Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc
89832	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
89832	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
89832	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
89832	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
89842	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
89843	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
89846	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
89846	PH domain. PH stands for pleckstrin homology
89846	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
89846	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
89848	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
89849	WD domain, G-beta repeat
89852	Tetraspanin family
89857	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
89870	B-box zinc finger
89874	Mitochondrial carrier protein
89882	Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction
89883	7 transmembrane receptor (rhodopsin family)
89884	Homeobox domain
89884	LIM domain. This family represents two copies of the LIM structural domain
89890	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
89890	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
89941	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
89944	Glycosyl hydrolases family 35
89970	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
89987	Helicase. This family consists of Helicases from the Herpes viruses. Helicases are responsible for the unwinding of DNA and are essential for replication and completion of the viral life cycle
90007	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
90133	Intermediate filament protein
90134	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
90134	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
90135	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
90139	Tetraspanin family
90167	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
90193	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
90199	WAP-type (Whey Acidic Protein) 'four-disulfide core'
90199	WAP-type (Whey Acidic Protein) 'four-disulfide core'
90199	WAP-type (Whey Acidic Protein) 'four-disulfide core'
90199	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
90203	PX domain. PX domains bind to phosphoinositides
90203	PX domain. PX domains bind to phosphoinositides
90249	Death domain
90249	Thrombospondin type 1 domain
90249	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
90293	BTB/POZ domain. The BTB (for BR-C
90293	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Kelch is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Sev
90317	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
90321	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
90342	C2 domain
90353	Uncharacterized protein family UPF0021
90378	Sterile alpha motif (SAM)/Pointed domain
90378	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
90408	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
90423	ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit
90462	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
90470	Ribulose-phosphate 3 epimerase family. This enzyme catalyses the conversion of D-ribulose 5-phosphate into D-xylulose 5-phosphate
90522	Hrf1 family. This family includes a number of eukaryotic proteins. It is an integral membrane protein, conserved in at least 1 copy in all sequenced eukaryotes. The gene name in S. pombe is hrf1+ for Heavy metal Resistance Factor 1
90525	SH2 domain
90576	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
90592	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
90594	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
90627	START domain
90627	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
90673	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p
90678	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
90701	Signal peptidase I
90705	Ribosomal protein L21e
90780	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
90787	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
90827	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
90864	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
90906	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
90906	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
90933	B-box zinc finger
90987	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
91012	Homeobox domain
91012	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
91107	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
91107	B-box zinc finger
91107	Zinc finger, C3HC4 type (RING finger)
91107	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
91115	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
91133	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
91137	Mitochondrial carrier protein
91151	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d
91151	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
91156	Fibronectin type III domain
91181	Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins
91227	Gamma-glutamyltranspeptidase
91351	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
91355	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
91369	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
91373	UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate urid
91408	NAC domain
91452	Acyl CoA binding protein
91461	Protein kinase domain
91464	Homeobox domain
91543	Radical SAM superfamily
91561	Ribosomal protein S5, C-terminal domain
91561	Ribosomal protein S5, N-terminal domain
91584	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
91646	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
91646	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
91653	Fibronectin type III domain
91661	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
91662	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
91746	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment
91807	Protein kinase domain
91851	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
91893	Ferredoxin-fold anticodon binding domain. This is the anticodon binding domain found in some phenylalanyl tRNA synthetases. The domain has a ferredoxin fold
91917	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
92014	Mitochondrial carrier protein
92017	PX domain. PX domains bind to phosphoinositides
92086	Gamma-glutamyltranspeptidase
92086	Gamma-glutamyltranspeptidase
92086	Gamma-glutamyltranspeptidase
92105	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
92181	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
92211	Cadherin domain
92267	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
92370	Histidine acid phosphatase
92497	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
92552	Josephin
92597	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru
92606	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
92609	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
92610	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
92610	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
92714	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp
92736	Protein of unknown function, DUF270
92736	Protein of unknown function, DUF270
92745	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
92755	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
92755	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
92799	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
92806	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d
92840	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
92856	IMP4 family. This family includes IMP4, which is involved ribosomal RNA processing
92912	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
93010	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
93035	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
93035	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
93107	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
93107	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
93107	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
93166	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
93210	Per1-like. A member of this family has been implemented in protein processing in the endoplasmic reticulum
93225	Ribosomal protein L15
93273	LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1
93333	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
93349	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
93349	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
93395	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
93432	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
93436	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
93474	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
93517	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
93550	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
93550	AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets i
93587	Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue
93589	Cache domain
93594	Caldesmon
93611	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate
93624	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
93649	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
93649	RPEL repeat. The RPEL repeat is named after four conserved amino acids it contains. The function of the RPEL repeat is unknown however it might be a DNA binding repeat based on the observation that one member contains a pfam02037 domain that is also impl
93650	Histidine acid phosphatase
93650	Histidine acid phosphatase
93661	F-actin capping protein alpha subunit
93662	Cadherin domain
93672	Interleukin 10
93673	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
93674	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
93677	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
93685	GDA1/CD39 (nucleoside phosphatase) family
93687	Protein kinase domain
93688	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
93688	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
93689	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
93690	7 transmembrane receptor (rhodopsin family)
93692	Glutaredoxin
93710	Cadherin domain
93714	Cadherin domain
93715	Cadherin domain
93721	Zinc carboxypeptidase
93723	Cadherin domain
93728	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
93728	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
93732	Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA
93734	Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is
93735	wnt family
93739	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
93746	7 transmembrane receptor (metabotropic glutamate family)
93747	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
93747	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
93749	Chaperonin 10 Kd subunit
93757	Signal peptidase I
93760	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
93761	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
93762	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
93762	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
93762	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
93762	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
93837	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
93841	Ubiquitin carboxyl-terminal hydrolase, family 1
93842	Fibronectin type III domain
93873	Cadherin domain
93874	Cadherin domain
93875	Cadherin domain
93876	Cadherin domain
93877	Cadherin domain
93878	Cadherin domain
93879	Cadherin domain
93880	Cadherin domain
93881	Cadherin domain
93882	Cadherin domain
93883	Cadherin domain
93885	Cadherin domain
93888	Cadherin domain
93889	Cadherin domain
93892	Cadherin domain
93896	7 transmembrane receptor (Secretin family)
93896	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
93897	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
93898	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
93953	Putative metallopeptidase (SprT family). This family of uncharacterised proteins may be zinc metallopeptidases
93961	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
93979	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
94030	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
94033	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
94039	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
94045	ATP P2X receptor
94045	ATP P2X receptor
94056	BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function
94061	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
94062	Ribosomal protein L3
94081	Tricarboxylate carrier
94086	Hsp20/alpha crystallin family
94088	B-box zinc finger
94088	Zinc finger, C3HC4 type (RING finger)
94088	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
94089	B-box zinc finger
94090	B-box zinc finger
94090	Fibronectin type III domain
94090	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
94091	B-box zinc finger
94091	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
94092	B-box zinc finger
94094	B-box zinc finger
94094	Zinc finger, C3HC4 type (RING finger)
94097	Tricarboxylate carrier
94103	ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER
94106	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
94109	CUB domain
94109	Sushi domain (SCR repeat)
94120	C2 domain
94120	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
94121	C2 domain
94121	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
94122	C2 domain
94122	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
94134	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
94160	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
94160	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
94160	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
94164	Globin
94168	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
94173	Isocitrate/isopropylmalate dehydrogenase
94174	Papain family cysteine protease
94175	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
94178	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
94179	Intermediate filament protein
94185	Death domain
94191	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
94192	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
94193	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
94193	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
94193	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
94194	Cytochrome c oxidase subunit Vb
94195	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
94198	Proteasome A-type and B-type
94199	Renal dipeptidase
94200	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
94214	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
94215	UDP-glucoronosyl and UDP-glucosyl transferase
94216	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
94217	Low-density lipoprotein receptor domain class A
94217	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
94221	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
94222	Helix-loop-helix DNA-binding domain
94224	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster
94226	7 transmembrane receptor (rhodopsin family)
94229	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
94230	CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in p
94232	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
94233	7 transmembrane receptor (rhodopsin family)
94234	Fork head domain
94242	Papain family cysteine protease
94249	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
94253	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
94266	Ribosomal protein S27
94267	NUDIX domain
94268	Ephrin
94275	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
94279	Tricarboxylate carrier
94280	Tricarboxylate carrier
94281	Tricarboxylate carrier
94282	Tricarboxylate carrier
94282	Tricarboxylate carrier
94284	UDP-glucoronosyl and UDP-glucosyl transferase
94284	UDP-glucoronosyl and UDP-glucosyl transferase
94338	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DN
94339	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
94341	Inward rectifier potassium channel
94342	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
94352	Lysyl oxidase
94352	Lysyl oxidase
94352	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
94468	3-hydroxyacyl-CoA dehydrogenase, NAD binding domain. This family also includes lambda crystallin
95803	Fumarylacetoacetate (FAA) hydrolase family. This family consists of fumarylacetoacetate (FAA) hydrolase, or fumarylacetoacetate hydrolase (FAH) and it also includes HHDD isomerase/OPET decarboxylase from E. coli strain W. FAA is the last enzyme in the tyr
96764	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
96935	Sushi domain (SCR repeat)
96957	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
97086	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
97086	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
97114	Core histone H2A/H2B/H3/H4
97165	HMG (high mobility group) box
97165	HMG (high mobility group) box
97212	3-hydroxyacyl-CoA dehydrogenase, NAD binding domain. This family also includes lambda crystallin
97212	3-hydroxyacyl-CoA dehydrogenase, C-terminal domain. This family also includes lambda crystallin. Some proteins include two copies of this domain
97212	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
97440	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
97484	Dor1-like family. Dor1 is involved in vesicle targeting to the yeast Golgi apparatus and complexes with a number of other trafficking proteins, which include Sec34 and Sec35
97487	Synaptogyrin. This family of proteins is distantly related to pfam01284
97541	tRNA synthetases class I (E and Q), anti-codon binding domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase a
97541	tRNA synthetases class I (E and Q), catalytic domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase aminoacyla
97795	Laminin N-terminal (Domain VI)
97827	3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termi
97884	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
98027	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
98256	Monooxygenase. This family includes diverse enzymes that utilise FAD
98258	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
98366	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
98386	Ergosterol biosynthesis ERG4/ERG24 family
98388	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
98396	Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples ha
98396	Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples ha
98396	Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples hav
98402	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
98417	Cornichon protein
98432	PH domain. PH stands for pleckstrin homology
98432	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
98660	E1-E2 ATPase
98660	Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
98660	Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
98660	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
98682	LacY proton/sugar symporter. This family is closely related to the sugar transporter family
98711	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
98733	Fibronectin type III domain
98733	Fibronectin type III domain
98733	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
98741	Kv2 voltage-gated K+ channel
98741	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
98835	Integral membrane protein DUF106. This archaebacterial protein family has no known function. Members are predicted to be integral membrane proteins
98848	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
98848	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fo
98870	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
98870	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-f
98910	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
98932	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
99010	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
99031	Oxysterol-binding protein
99151	Anp1. The members of this family (Anp1, Van1 and Mnn9) are membrane proteins required for proper Golgi function. These proteins colocalize within the cis Golgi, and that they are physically associated in two distinct complexes
99151	Anp1. The members of this family (Anp1, Van1 and Mnn9) are membrane proteins required for proper Golgi function. These proteins co-localize within the cis Golgi, and that they are physically associated in two distinct complexes
99151	Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of var
99152	Cullin family
99237	Endomembrane protein 70
99296	7 transmembrane receptor (rhodopsin family)
99326	Rap/ran-GAP
99326	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
99334	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
99375	Cullin family
99377	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
99377	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
99382	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
99382	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
99382	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
99439	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
99439	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
99439	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous
99512	WD domain, G-beta repeat
99543	Olfactomedin-like domain
99571	Fibrinogen beta and gamma chains, C-terminal globular domain
99633	7 transmembrane receptor (Secretin family)
99633	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
99633	Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein coup
99650	6-phosphogluconate dehydrogenase, C-terminal domain. This family represents the C-terminal all-alpha domain of 6-phosphogluconate dehydrogenase. The domain contains two structural repeats of 5 helices each
99683	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24
99689	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
99689	TT viral orf 1. TT virus (TTV), isolated initially from a Japanese patient with hepatitis of unknown aetiology, has since been found to infect both healthy and diseased individuals and numerous prevalence studies have raised questions about its role in un
99689	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
99696	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
99696	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
99730	Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histon
99738	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
99738	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
99929	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
99929	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
99929	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
99929	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
100061	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
100066	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
100090	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
100102	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
100121	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
100129	7 transmembrane receptor (rhodopsin family)
100163	Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl
100198	Glucose-6-phosphate dehydrogenase, C-terminal domain
100198	Glucose-6-phosphate dehydrogenase, NAD binding domain
100198	Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase
100206	ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisi
100206	ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisin
100210	Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity
100213	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
100273	Oxysterol-binding protein
100273	Oxysterol-binding protein
100336	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
100383	BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function
100383	BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function
100465	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru
100465	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known struc
100559	UDP-glucoronosyl and UDP-glucosyl transferase
100561	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
100608	CBF/Mak21 family
100727	UDP-glucoronosyl and UDP-glucosyl transferase
100732	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
100732	EB1-like C-terminal motif. This motif is found at the C-terminus of proteins that are related to the EB1 protein. The EB1 proteins contain an N-terminal CH domain pfam00307. The human EB1 protein was originally discovered as a protein interacting with the
100737	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
100756	Ubiquitin carboxyl-terminal hydrolase family 2
100756	Ubiquitin carboxyl-terminal hydrolases family 2
100763	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
100929	Flavodoxin
100929	Radical SAM superfamily
100951	Helix-loop-helix DNA-binding domain
100972	ADP-ribosylation factor family
100972	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
101023	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
101185	Telomere-binding protein alpha subunit, N-terminal domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. A
101187	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
101202	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
101240	WD domain, G-beta repeat
101320	Protein kinase domain
101358	F-box domain
101488	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
101488	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
101488	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
101488	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
101490	SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin
101502	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
101513	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru
101533	Trypsin
101540	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
101568	Protein kinase domain
101592	Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold
101592	Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold
101613	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
101700	B-box zinc finger
101700	Zinc finger, C3HC4 type (RING finger)
101706	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
101739	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
101744	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacte
101744	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
101860	Sulfotransferase protein
101861	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
101861	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
101943	CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in p
101964	Sterile alpha motif (SAM)/Pointed domain
101964	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
101994	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
102022	Carboxylesterase
102060	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
102075	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
102141	PX domain. PX domains bind to phosphoinositides
102141	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
102278	C2 domain
102294	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
102448	FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain
102502	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
102570	Sugar (and other) transporter
102580	Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability
102607	PX domain. PX domains bind to phosphoinositides
102626	Protein kinase domain
102680	Sodium:neurotransmitter symporter family
102774	TPR Domain
102857	Sodium:neurotransmitter symporter family
102926	Peptidase family C54
102926	Peptidase family C54
103098	Sodium:neurotransmitter symporter family
103135	Ubiquitin carboxyl-terminal hydrolase family 2
103135	Ubiquitin carboxyl-terminal hydrolase family 2
103135	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
103142	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
103149	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
103199	SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin
103236	Protein kinase domain
103296	Protein kinase domain
103468	Nuclear pore protein 84 / 107. Nup84p forms a complex with five proteins, of which Nup120p, Nup85p, Sec13p, and a Sec13p homolog. This Nup84p complex in conjunction with Sec13-type proteins is required for correct nuclear pore biogenesis
103468	Nuclear pore protein 84 / 107. Nup84p forms a complex with five proteins, of which Nup120p, Nup85p, Sec13p, and a Sec13p homolog. This Nup84p complex in conjunction with Sec13-type proteins is required for correct nuclear pore biogenesis
103537	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino te
103655	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
103694	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
103775	Mitochondrial carrier protein
103784	WD domain, G-beta repeat
103841	CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2
103968	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
103968	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
103978	Glypican
103978	Glypican
103988	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
104001	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
104009	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
104010	Cadherin domain
104010	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
104069	Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid
104082	WD domain, G-beta repeat
104086	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
104099	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
104110	Adenylate and Guanylate cyclase catalytic domain
104111	Adenylate and Guanylate cyclase catalytic domain
104112	Citrate synthase
104112	CoA binding domain. This domain has a Rossmann fold and is found in a number of proteins including succinyl CoA synthetases, malate and ATP-citrate ligases
104112	CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP
104156	Ets-domain
104158	Carboxylesterase
104174	Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated
104183	Glycosyl hydrolases family 18
104184	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
104184	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
104245	Sodium:neurotransmitter symporter family
104263	jmjC domain
104303	ADP-ribosylation factor family
104318	Protein kinase domain
104318	Protein kinase domain
104348	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
104348	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
104349	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
104360	LIM domain. This family represents two copies of the LIM structural domain
104383	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
104384	Homeobox domain
104394	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
104416	Ubiquitin carboxyl-terminal hydrolase, family 1
104418	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
104418	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
104418	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
104443	7 transmembrane receptor (rhodopsin family)
104625	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
104652	Myosin head (motor domain)
104718	Coatomer WD associated domain
104759	Phospholipase D. Active site motif. Phosphatidylcholine-hydrolyzing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certa
104776	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
104816	Asparaginase
104910	Mitochondrial carrier protein
104923	ARD/ARD' family. The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+. ARD and ARD' can be
105014	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
105033	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
105072	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
105245	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
105246	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
105349	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
105372	WD domain, G-beta repeat
105387	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
105418	RNA polymerase A/beta'/A" subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This family includ
105446	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
105446	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
105540	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
105540	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
105594	Homeobox domain
105594	Homeobox domain
105663	Adenylate cyclase. One member was found to complement mutants of E. coli. cya. This protein has been shown to be an adenylate kinase
105675	Cyclophilin type peptidyl-prolyl cis-trans isomerase
105727	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
105734	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d
105734	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
105734	CENP-B protein. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding domain, which binds to a corresponding 17bp CENP-B box sequence. In the C-terminal 59 residues,
105734	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di
105734	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
105785	ER lumen protein retaining receptor
105787	Protein kinase domain
105835	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
105866	Intermediate filament protein
105887	UDP-glucoronosyl and UDP-glucosyl transferase
105892	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
106039	VHS domain. Domain present in VPS-27, Hrs and STAM
106039	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
106039	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
106052	F-box domain
106112	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
106112	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
106151	Oxysterol-binding protein
106200	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
106200	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
106248	Queuine tRNA-ribosyltransferase. This is a family of queuine tRNA-ribosyltransferases EC:2.4.2.29, also known as tRNA-guanine transglycosylase and guanine insertion enzyme. Queuine tRNA-ribosyltransferase modifies tRNAs for asparagine, aspartic acid, hist
106326	Oxysterol-binding protein
106369	Yippee putative zinc-binding protein
106407	Domain of unknown function
106522	Protein kinase domain
106529	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster
106564	DNA / pantothenate metabolism flavoprotein. The DNA/pantothenate metabolism flavoprotein (EC:4.1.1.36) affects synthesis of DNA, and pantothenate metabolism
106572	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
106583	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
106597	S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure
106618	WD domain, G-beta repeat
106628	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
106648	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
106794	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
106794	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
106794	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
106795	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
106877	PH domain. PH stands for pleckstrin homology
106877	PH domain. PH stands for pleckstrin homology
106947	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
106952	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
106957	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
107141	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
107182	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
107182	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
107182	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u
107221	7 transmembrane receptor (rhodopsin family)
107221	7 transmembrane receptor (rhodopsin family)
107250	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
107250	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
107257	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
107271	tRNA synthetases class I (W and Y)
107271	Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte act
107271	tRNA synthetases class I (W and Y)
107271	Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti
107272	Aminotransferase class-V
107305	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
107321	LIM domain. This family represents two copies of the LIM structural domain
107328	Domain of unknown function DUF60. The proteins in this family have no known function. Two proteins have been annotated as phosphotransferases, however this is not supported experimentally, and should be viewed with caution
107338	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
107351	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
107351	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
107358	Endomembrane protein 70
107358	Endomembrane protein 70
107368	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
107371	Exocyst complex subunit Sec15-like
107375	Mitochondrial carrier protein
107393	Lectin C-type domain. This family includes both long and short form C-type
107393	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
107476	Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction
107508	WHEP-TRS domain
107508	Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain
107508	tRNA synthetases class I (C). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only cysteinyl tRNA synthetases
107508	tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt
107508	tRNA synthetases class I (E and Q), anti-codon binding domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase a
107508	tRNA synthetases class I (E and Q), catalytic domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase aminoacyla
107513	Translocon-associated protein (TRAP), alpha subunit. The alpha-subunit of the TRAP complex (TRAP alpha) is a single-spanning membrane protein of the endoplasmic reticulum (ER) which is found in proximity of nascent polypeptide chains translocating across
107527	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
107528	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
107568	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
107568	C2 domain
107568	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
107568	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
107581	Thrombospondin N-terminal -like domain
107585	Iodothyronine deiodinase
107589	Protein kinase domain
107589	Fibronectin type III domain
107589	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
107605	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
107607	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
107607	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
107607	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
107607	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
107650	Phosphatidylinositol 3- and 4-kinase
107652	UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate uri
107684	WD domain, G-beta repeat
107686	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
107702	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
107723	K-Cl Co-transporter type 1 (KCC1)
107747	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
107751	OAR domain
107751	Homeobox domain
107753	Galactoside-binding lectin
107767	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a cy
107771	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo
107797	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
107798	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
107817	jmjC domain
107823	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
107823	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
107823	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
107831	7 transmembrane receptor (Secretin family)
107831	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
107831	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
107869	Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OA
107885	5-formyltetrahydrofolate cyclo-ligase family. 5-formyltetrahydrofolate cyclo-ligase or methenyl-THF synthetase EC:6.3.3.2 catalyses the interchange of 5-formyltetrahydrofolate (5-FTHF) to 5-10-methenyltetrahydrofolate, this requires ATP and Mg2+. 5-FTHF
107889	GCM motif protein
107932	'chromo' (CHRromatin Organization MOdifier) domain
107932	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
107932	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
107932	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u
107934	7 transmembrane receptor (Secretin family)
107934	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
107934	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
107970	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
107971	PTB domain (IRS-1 type)
108000	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
108000	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
108000	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
108011	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
108011	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
108012	Clathrin adaptor complex small chain
108013	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
108013	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
108015	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
108015	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
108015	Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
108015	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
108015	Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
108015	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
108017	ATP1G1/PLM/MAT8 family
108030	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
108030	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
108037	Serine hydroxymethyltransferase
108043	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
108043	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
108043	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
108043	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
108052	Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue
108062	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
108062	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
108067	Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars
108067	Cytidylyltransferase. This family consists of two main Cytidylyltransferase activities: 1) 3-deoxy-manno-octulosonate cytidylyltransferase, EC:2.7.7.38 catalysing the reaction:- CTP + 3-deoxy-D-manno-octulosonate <=> diphosphate + CMP-3-deoxy-D-manno-octu
108069	7 transmembrane receptor (metabotropic glutamate family)
108069	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
108069	7 transmembrane receptor (metabotropic glutamate family)
108071	7 transmembrane receptor (metabotropic glutamate family)
108071	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
108072	7 transmembrane receptor (metabotropic glutamate family)
108072	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
108073	7 transmembrane receptor (metabotropic glutamate family)
108073	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
108075	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
108075	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
108077	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
108079	Protein kinase domain
108083	Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal
108089	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
108096	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
108096	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
108105	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
108114	Sugar (and other) transporter
108115	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
108115	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
108116	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
108116	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
108124	NSF attachment protein
108143	Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tra
108148	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
108150	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
108151	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
108151	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
108154	Thrombospondin type 1 domain
108154	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
108154	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
108156	Formate--tetrahydrofolate ligase
108159	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
108159	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
108657	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
108660	Zinc finger
108664	V-ATPase subunit H
108672	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
108686	Intermediate filament protein
108686	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
108687	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
108699	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
108699	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
108699	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
108723	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
108760	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
108800	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
108800	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
108802	Calreticulin family
108837	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
108837	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
108841	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
108841	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
108853	PCRF domain. This domain is found in peptide chain release factors
108853	Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons
108888	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
108888	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
108898	NAD-dependent glycerol-3-phosphate dehydrogenase
108898	NAD-dependent glycerol-3-phosphate dehydrogenase
108943	Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue
108946	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
108960	Death domain
108960	Protein kinase domain
108960	Death domain
108960	Protein kinase domain
108995	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
109032	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
109032	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
109052	Intermediate filament protein
109052	Intermediate filament protein
109052	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
109052	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
109075	3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
109093	tRNA synthetases class I (R). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only arginyl tRNA synthetase
109108	XPA protein
109108	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
109108	XPA protein
109108	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
109113	YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named
109113	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
109113	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
109115	Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histon
109115	Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone
109137	ADP-ribosylation factor family
109137	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
109161	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
109161	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
109168	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
109225	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
109225	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
109232	Saccharopine dehydrogenase. This family comprised of three structural domains that can not be separated in the linear sequence. In some organisms this enzyme is found as a bifunctional polypeptide with lysine ketoglutarate reductase (PF). The saccharopine
109246	Tetraspanin family
109246	Tetraspanin family
109246	Tetraspanin family
109264	Malic enzyme, NAD binding domain
109264	Malic enzyme, N-terminal domain
109264	Malic enzyme, NAD binding domain
109267	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
109272	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
109280	Sugar (and other) transporter
109310	Ubiquitin carboxyl-terminal hydrolase family 2
109323	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
109331	Zinc finger, C3HC4 type (RING finger)
109333	Hr1 repeat
109333	Protein kinase domain
109333	Protein kinase C terminal domain
109342	Sodium:solute symporter family
109346	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
109346	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
109594	LIM domain. This family represents two copies of the LIM structural domain
109596	IQ calmodulin-binding motif. Calmodulin-binding motif
109624	Caldesmon
109637	Tetraspanin family
109648	Pancreatic hormone peptide
109652	Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases
109672	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
109674	Adenosine/AMP deaminase
109676	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
109676	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
109676	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
109685	Hyaluronidase
109685	Hyaluronidase
109685	RolB/RolC glucosidase family. This family of proteins includes RolB and RolC. RolC releases cytokinins from glucoside conjugates. Whereas RolB hydrolyses indole glucosides
109689	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
109689	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
109689	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
109689	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
109697	Zinc carboxypeptidase
109697	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fol
109711	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
109711	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
109731	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
109754	Oxidoreductase FAD-binding domain
109778	Oxidoreductase family, NAD-binding Rossmann fold. This family of enzymes utilise NADP or NAD
109801	Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily
109820	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
109857	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
109880	B-box zinc finger
109880	Protein kinase domain
109901	Trypsin
109904	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
109929	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
109929	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
109978	NAD:arginine ADP-ribosyltransferase
109979	NAD:arginine ADP-ribosyltransferase
110006	Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities
110006	Glycosyl hydrolases family 2, sugar binding domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities and has a jelly-roll fold
110052	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
110058	C2 domain
110074	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
110075	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
110078	Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses
110082	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
110095	Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses
110109	NOL1/NOP2/sun family
110119	Phosphomannose isomerase type I
110135	Fibrinogen beta and gamma chains, C-terminal globular domain
110147	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
110147	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
110147	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
110147	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
110157	Protein kinase domain
110157	Raf-like Ras-binding domain
110157	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
110168	7 transmembrane receptor (rhodopsin family)
110196	Polyprenyl synthetase
110197	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
110197	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
110213	Uncharacterized protein family UPF0005
110253	PH domain. PH stands for pleckstrin homology
110279	C2 domain
110279	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
110279	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
110304	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
110304	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
110308	Intermediate filament protein
110310	Intermediate filament protein
110323	Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase
110326	7 transmembrane receptor (metabotropic glutamate family)
110326	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
110351	Rap/ran-GAP
110351	LGN motif, putative GEF specific for G-alpha GTPase
110355	Protein kinase domain
110355	PH domain. PH stands for pleckstrin homology
110355	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
110382	Low-density lipoprotein receptor domain class A
110382	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb
110382	Thrombospondin type 1 domain
110382	Low-density lipoprotein receptor domain class A
110382	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl
110446	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
110521	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
110524	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
110524	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
110524	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
110532	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
110542	Protein kinase domain
110542	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
110595	Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species
110596	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
110616	Josephin
110637	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
110637	Bacterial extracellular solute-binding proteins, family 3
110637	Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors
110637	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
110648	Homeobox domain
110648	LIM domain. This family represents two copies of the LIM structural domain
110695	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
110750	CAS/CSE protein, C-terminus. Mammalian cellular apoptosis susceptibility (CAS) proteins are homologous to the yeast chromosome-segregation protein, CSE1. This family aligns the C-terminal halves (approximately). CAS is involved in both cellular apoptosis
110751	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
110751	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
110789	Calx-beta domain
110805	Fork head domain
110805	Ribosomal L29e protein family
110809	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
110816	Periodic tryptophan protein 2 WD repeat associated domain
110816	WD domain, G-beta repeat
110816	Periodic tryptophan protein 2 WD repeat associated domain
110829	LIM domain. This family represents two copies of the LIM structural domain
110834	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
110834	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
110835	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
110835	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
110835	Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
110835	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
110854	Phosphotyrosyl phosphate activator (PTPA) protein. Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is
110855	3'5'-cyclic nucleotide phosphodiesterase
110862	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
110862	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
110880	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
110886	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
110886	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
110886	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
110886	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
110891	Calx-beta domain
110891	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
110893	Calx-beta domain
110893	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
110902	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
110902	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
110911	Phosphatidate cytidylyltransferase
110920	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
110920	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
110948	Biotin/lipoate A/B protein ligase family. This family includes biotin protein ligase, lipoate-protein ligase A and B. Biotin is covalently attached at the active site of certain enzymes that transfer carbon dioxide from bicarbonate to organic acids to for
110954	Ribosomal L10
110956	ENV polyprotein (coat polyprotein)
110956	ENV polyprotein (coat polyprotein)
111174	7 transmembrane receptor (rhodopsin family)
111175	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
112400	Neuregulin family
112417	UDP-glucoronosyl and UDP-glucosyl transferase
112483	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
112574	PX domain. PX domains bind to phosphoinositides
112714	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
112714	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
112755	Syntaxin
112802	Intermediate filament protein
112885	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
112936	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp
112937	Glycosyl hydrolases family 35
112939	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
113026	PH domain. PH stands for pleckstrin homology
113178	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c
113189	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
113220	Kinesin motor domain
113251	GTPase of unknown function
113419	Protein of unknown function (DUF396). A family of conserved eukaryotic transmembrane proteins
113451	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
113457	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
113457	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
113612	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
113622	ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisi
113730	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
113829	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
113845	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113846	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113847	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113848	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113849	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113849	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113849	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113850	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113850	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113850	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113851	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113851	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113852	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113852	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113852	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113853	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113853	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113854	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113855	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113856	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113857	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113858	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113859	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113860	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113861	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113861	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113862	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113863	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113863	7 transmembrane receptor (rhodopsin family)
113863	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113864	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113864	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113865	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113867	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
113886	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
113892	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
113893	Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid
113894	Octicosapeptide repeat. Short motif that may bind Ca2+
113894	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
113901	Glycosyl hydrolases family 18
113901	Chitin binding Peritrophin-A domain. This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruse
113902	Carboxylesterase
113906	Hsp20/alpha crystallin family
113910	Prion/Doppel alpha-helical domain. The prion protein is thought to be the infectious agent that causes transmissible spongiform encephalopathies, such as scrapie and BSE. It is thought that the prion protein can exist in two different forms: one is the no
113911	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
113914	7 transmembrane receptor (rhodopsin family)
113918	Sodium:neurotransmitter symporter family
113927	Protein kinase domain
113936	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fo
113940	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
113940	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
113947	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
113956	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
113958	Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes
113959	7 transmembrane receptor (rhodopsin family)
113960	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
113960	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
113970	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
113983	Homeobox domain
113984	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
113984	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
113992	UDP-glucoronosyl and UDP-glucosyl transferase
113995	ATP P2X receptor
114002	Sulfotransferase protein
114020	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
114022	Glutaredoxin
114023	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
114027	FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1
114032	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
114032	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
114087	Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration
114088	Fibronectin type III domain
114088	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
114088	Fibronectin type III domain
114089	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
114091	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
114094	Ubiquitin carboxyl-terminal hydrolase, family 1
114095	Sec1 family
114096	Isocitrate/isopropylmalate dehydrogenase
114097	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
114098	7 transmembrane receptor (rhodopsin family)
114099	7 transmembrane receptor (rhodopsin family)
114100	Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues.
114101	Phosphatidate cytidylyltransferase
114102	Cadherin domain
114105	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
114106	Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc
114107	WSC domain. This domain may be involved in carbohydrate binding
114108	LIM domain. This family represents two copies of the LIM structural domain
114109	Homeobox domain
114112	Pyridine nucleotide-disulphide oxidoreductase, dimerisation domain. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases
114112	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
114113	Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellu
114114	Dynamin GTPase effector domain
114114	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
114115	ATP P2X receptor
114116	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
114116	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
114119	GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins
114124	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
114128	Golgi 4-transmembrane spanning transporter
114134	Sugar (and other) transporter
114141	PMP-22/EMP/MP20/Claudin family
114142	Fork head domain
114213	Homeobox domain
114214	CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain
114216	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
114228	Trypsin
114229	7 transmembrane receptor (rhodopsin family)
114243	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
114244	Cyclic nucleotide-binding domain
114244	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
114245	Cyclic nucleotide-binding domain
114245	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
114246	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
114299	Paralemmin
114301	Paralemmin
114304	Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin
114332	Extracellular link domain
114335	Cystine-knot domain
114336	Cystine-knot domain
114481	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con
114482	Protein kinase domain
114482	Protein of unknown function (DUF390). This family of long proteins are currently only found in the rice genome. They have no known function. However they may be some kind of transposable element
114493	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
114494	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
114494	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
114497	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a
114499	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
114500	Homeobox domain
114502	Homeobox domain
114502	'Paired box' domain
114503	Homeobox domain
114504	Homeobox domain
114507	Sodium:solute symporter family
114508	Fructose-1-6-bisphosphatase
114513	PH domain. PH stands for pleckstrin homology
114513	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
114516	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
114516	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
114516	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
114521	Uracil DNA glycosylase superfamily
114522	Stromal antigen (SA/STAG) protein
114523	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
114524	Laminin N-terminal (Domain VI)
114524	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
114548	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
114553	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
114555	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
114558	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either asso
114561	Phosphatidylinositol transfer protein
114562	Cdc37 family. In the budding yeast Saccharomyces cerevisiae, Cdc37 is required for the productive formation of Cdc28-cyclin complexes. Cdc37 may be a kinase targeting subunit of Hsp90
114564	7 transmembrane receptor (rhodopsin family)
114564	7 transmembrane receptor (rhodopsin family)
114565	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
114566	Intermediate filament protein
114570	LIM domain. This family represents two copies of the LIM structural domain
114570	LIM domain. This family represents two copies of the LIM structural domain
114571	Sugar (and other) transporter
114589	Fibronectin type III domain
114590	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
114591	Peptidase family M49
114593	Atrial natriuretic peptide
114598	Lectin C-type domain. This family includes both long and short form C-type
114601	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
114601	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
114602	MYND finger
114629	Sulfate transporter family. Mutations may lead to several human diseases
114629	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
114629	Sulfate transporter family. Mutations may lead to several human diseases
114629	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
114630	ATP synthase subunit C
114632	Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti
114633	C2 domain
114633	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
114634	Homeobox domain
114639	Zona pellucida-like domain
114641	Ribosomal protein L31e
114644	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
114663	Inositol monophosphatase family
114664	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
114664	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
114671	Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
114671	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
114674	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
114675	Caldesmon
114675	WD domain, G-beta repeat
114675	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
114700	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
114705	Raf-like Ras-binding domain
114712	Helix-loop-helix DNA-binding domain
114757	Globin
114767	Protein-tyrosine phosphatase
114767	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
114769	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
114780	Galactose binding lectin domain
114780	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
114780	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
114780	REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c
114780	Galactose binding lectin domain
114780	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
114780	REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c
114780	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
114780	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
114781	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
114782	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
114783	Protein kinase domain
114784	CUB domain
114784	Sushi domain (SCR repeat)
114784	Sushi domain (SCR repeat)
114788	CUB domain
114788	Sushi domain (SCR repeat)
114788	Sushi domain (SCR repeat)
114789	Mitochondrial carrier protein
114791	Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin
114792	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
114794	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
114798	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
114803	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
114805	QXW lectin repeat
114805	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
114814	7 transmembrane receptor (rhodopsin family)
114815	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
114819	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
114821	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
114821	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
114822	Hr1 repeat
114822	BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1
114823	SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1
114825	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
114826	MYND finger
114845	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
114846	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
114851	Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase
114856	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
114859	Protein kinase domain
114861	Serine carboxypeptidase
114874	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoester
114874	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera
114876	Oxysterol-binding protein
114876	Oxysterol-binding protein
114876	Oxysterol-binding protein
114879	Oxysterol-binding protein
114879	Oxysterol-binding protein
114880	Oxysterol-binding protein
114880	Oxysterol-binding protein
114881	Oxysterol-binding protein
114881	Oxysterol-binding protein
114882	Oxysterol-binding protein
114883	Oxysterol-binding protein
114883	Oxysterol-binding protein
114883	Oxysterol-binding protein
114883	Oxysterol-binding protein
114883	Oxysterol-binding protein
114883	Oxysterol-binding protein
114883	Oxysterol-binding protein
114884	Oxysterol-binding protein
114885	Oxysterol-binding protein
114886	Globin
114889	Homeobox domain
114893	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
114893	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
114895	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
114897	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
114899	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
114901	Sorbin homologous domain
114903	PMP-22/EMP/MP20/Claudin family
114906	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
114971	Protein-tyrosine phosphatase
114971	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
115004	Mab-21 protein
115004	Mab-21 protein
115019	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
115019	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
115131	7 transmembrane receptor (rhodopsin family)
115154	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
115196	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
115201	Peptidase family C54
115201	Peptidase family C54
115201	Peptidase family C54
115207	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
115273	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
115281	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
115286	Mitochondrial carrier protein
115290	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate
115290	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate
115294	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
115330	7 transmembrane receptor (rhodopsin family)
115361	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
115361	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
115362	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
115362	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
115426	YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named
115426	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
115426	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
115426	YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named
115426	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
115426	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
115442	Protein-tyrosine phosphatase
115509	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
115548	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
115584	Sodium:solute symporter family
115669	TPR Domain
115701	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a
115727	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
115727	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
115727	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
115761	ADP-ribosylation factor family
115770	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
115817	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
115827	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
115908	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
115939	Domain of unknown function (DUF367)
115939	Possible metal-binding domain in RNase L inhibitor, RLI. Possible metal-binding domain in endoribonuclease RNase L inhibitor. Found at the N-terminal end of RNase L inhibitor proteins, adjacent to the 4Fe-4S binding domain, fer4, pfam00037. Also often fou
116068	LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known
116085	Sugar (and other) transporter
116085	Sugar (and other) transporter
116123	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
116166	Zinc-binding dehydrogenase
116179	Transglutaminase family
116179	Transglutaminase family, C-terminal ig like domain
116179	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
116225	MYND finger
116255	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
116328	Mechanosensitive ion channel. Two members of this protein family of M. jannaschii have been functionally characterized. Both proteins form mechanosensitive (MS) ion channels upon reconstitution into liposomes and functional examination by the patch-clamp
116362	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
116369	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
116369	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
116379	Tissue factor
116379	Tissue factor
116379	Tissue factor
116412	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
116444	Bacterial extracellular solute-binding proteins, family 3
116444	Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors
116448	Helix-loop-helix DNA-binding domain
116456	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
116463	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
116466	wnt family
116469	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
116470	Syntaxin
116474	Somatotropin hormone family
116481	HAT (Half-A-TPR) repeat. The HAT (Half A TPR) repeat is found in several RNA processing proteins
116482	SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin
116484	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
116486	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
116486	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
116487	Fasciclin domain. This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria
116491	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
116498	7 transmembrane receptor (rhodopsin family)
116499	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
116499	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling.
116502	Apoptosis regulator proteins, Bcl-2 family
116506	7 transmembrane receptor (Secretin family)
116509	Sodium:neurotransmitter symporter family
116510	Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species
116511	7 transmembrane receptor (rhodopsin family)
116519	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
116534	7 transmembrane receptor (rhodopsin family)
116543	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
116544	Pou domain - N-terminal to homeobox domain
116547	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
116548	Furin-like cysteine rich region
116548	Giardia variant-specific surface protein
116548	Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic d
116548	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold cont
116549	Protein kinase domain
116551	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
116555	Nuclear pore protein 84 / 107. Nup84p forms a complex with five proteins, of which Nup120p, Nup85p, Sec13p, and a Sec13p homolog. This Nup84p complex in conjunction with Sec13-type proteins is required for correct nuclear pore biogenesis
116556	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
116557	Fork head domain
116558	Interleukin 7/9 family. IL-7 is a cytokine that acts as a growth factor for early lymphoid cells of both B- and T-cell lineages. IL-9 is a multi-functional cytokine that, although originally described as a T-cell growth factor, its function in T-cell resp
116560	Inward rectifier potassium channel
116561	Clathrin light chain
116562	Interleukin 2
116563	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
116564	Protein kinase domain
116564	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
116568	Gamma-glutamyltranspeptidase
116569	Glycosyl hydrolase family 1
116589	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
116592	Trefoil (P-type) domain
116593	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
116598	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
116599	Oxidoreductase family, NAD-binding Rossmann fold. This family of enzymes utilise NADP or NAD
116600	Dihydropyridine sensitive L-type calcium channel (Beta subunit)
116631	N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation
116632	N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation
116637	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
116639	Homeobox domain
116639	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
116642	Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin
116643	Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist o
116644	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
116646	Translationally controlled tumor protein
116647	Carboxylesterase
116648	Homeobox domain
116649	Inward rectifier potassium channel
116649	Inward rectifier potassium channel
116651	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
116653	CAP protein
116654	Sec8 exocyst complex component specific domain
116655	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
116658	Fibronectin type III domain
116663	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
116664	ATP synthase (F/14-kDa) subunit. This family includes 14-kDa subunit from vATPases, which is in the peripheral catalytic part of the complex. The family also includes archaebacterial ATP synthase subunit F
116665	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
116666	Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19-
116668	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
116671	Cyclin-dependent kinase 5 activator protein
116672	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o
116673	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
116676	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
116677	7 transmembrane receptor (rhodopsin family)
116681	Guanylate-kinase-associated protein (GKAP) protein
116683	Ephrin
116684	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
116685	Intermediate filament protein
116685	Intermediate filament tail domain
116689	SH2 domain
116690	Fibronectin type III domain
116691	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
116692	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
116693	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
116694	Calpain family cysteine protease
116694	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by
116696	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
116700	Glycoprotein hormone
116717	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
116719	Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction
116720	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
116723	Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K cata
116724	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
116727	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
116727	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
116740	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
116742	Cadherin domain
116743	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
116744	7 transmembrane receptor (rhodopsin family)
116745	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
116748	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spli
116777	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
116780	Cadherin domain
116782	Cadherin domain
116808	Cadherin domain
116831	ATP1G1/PLM/MAT8 family
116838	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
116839	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
116841	SNAP-25 family. SNAP-25 (synaptosome-associated protein 25 kDa) proteins are components of SNARE complexes. Members of this family contain a cluster of cysteine residues that can be palmitoylated for membrane attachment
116844	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
116848	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
116848	DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors)
116849	Interleukin 10
116852	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
116870	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
116870	GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the do
116871	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
116872	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
116873	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
116904	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a
116905	Putative diphthamide synthesis protein. One member is a candidate tumour suppressor gene. DPH2 from yeast, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein s
116914	Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analog
116939	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o
116940	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
116967	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions
116969	NAD:arginine ADP-ribosyltransferase
116983	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
116984	PH domain. PH stands for pleckstrin homology
116985	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
116985	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
116986	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
116986	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
116987	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
116987	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
116987	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
116988	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
116996	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
116998	Pentaxin family. Pentaxins are also known as pentraxins
116999	Ribosomal protein L36e
116999	Zinc finger, C3HC4 type (RING finger)
116999	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
117001	ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold
117001	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
117004	7 transmembrane receptor (rhodopsin family)
117005	7 transmembrane receptor (rhodopsin family)
117016	Kinase associated domain 1
117018	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
117018	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
117019	Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac
117021	Non-repetitive/WGA-negative nucleoporin. This is a family of nucleoporin proteins. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells, and mediate bidirectional nucleocytoplasmic transport, especially of mRNA and protein
117023	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
117023	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
117024	Ornithine cyclodeaminase/mu-crystallin family. This family contains the bacterial Ornithine cyclodeaminase enzyme EC:4.3.1.12, which catalyses the deamination of ornithine to proline. This family also contains mu-Crystallin the major component of the eye
117025	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa
117026	Phosphotyrosine interaction domain (PTB/PID)
117027	7 transmembrane receptor (rhodopsin family)
117028	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
117029	7 transmembrane receptor (rhodopsin family)
117033	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
117033	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
117033	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
117034	Helix-loop-helix DNA-binding domain
117035	Mitochondrial carrier protein
117037	Hyaluronidase
117042	Ribosomal protein L6e
117042	Ribosomal protein L6, N-terminal domain
117043	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
117045	Eukaryotic initiation factor 4E
117046	Intermediate filament protein
117048	Cadherin domain
117050	ADP-ribosylation factor family
117051	ADP-ribosylation factor family
117052	Inward rectifier potassium channel
117053	Ribosomal protein S8
117055	Connexin
117061	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
117061	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
117061	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
117065	Homeobox domain
117088	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the e
117094	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
117095	Helix-loop-helix DNA-binding domain
117096	Carboxylesterase
117097	7 transmembrane receptor (rhodopsin family)
117098	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
117099	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
117100	Sodium:neurotransmitter symporter family
117101	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
117101	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
117103	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
117105	Kv2 voltage-gated K+ channel
117105	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
117105	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
117106	CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion
117107	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
117108	Glycosyltransferase family 43
117109	Rpp14 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40
117140	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
117144	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
117148	Uncharacterized ACR, YneC family COG1359
117148	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typ
117148	Uncharacterized ACR, YneC family COG1359
117152	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
117154	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
117155	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
117155	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
117155	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
117177	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail regio
117179	CUB domain
117180	'Cold-shock' DNA-binding domain
117182	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
117194	7 transmembrane receptor (rhodopsin family)
117195	7 transmembrane receptor (rhodopsin family)
117196	7 transmembrane receptor (rhodopsin family)
117198	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
117198	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
117229	Protein kinase domain
117229	Protein kinase domain
117232	Homeobox domain
117240	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
117241	Rpp14 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40
117241	Rpp14 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40
117242	Plant PEC family metallothionein
117248	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
117252	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
117257	7 transmembrane receptor (rhodopsin family)
117258	Fibronectin type III domain
117260	NUDIX domain
117261	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
117264	7 transmembrane receptor (rhodopsin family)
117267	Sulfate transporter family. Mutations may lead to several human diseases
117270	Trefoil (P-type) domain
117274	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
117276	6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta
117279	Death effector domain
117280	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
117281	HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats
117283	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
117287	DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117
117505	LIM domain. This family represents two copies of the LIM structural domain
117512	Low-density lipoprotein receptor domain class A
117512	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb
117513	Synaptobrevin
117515	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
117519	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
117523	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
117526	Transcription factor TFIID (or TATA-binding protein, TBP)
117533	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
117537	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
117537	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
117540	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve
117548	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
117548	Kinesin motor domain
117548	PH domain. PH stands for pleckstrin homology
117548	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
117549	7 transmembrane receptor (rhodopsin family)
117550	Kinesin motor domain
117550	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
117553	Repeat in ubiquitin-activating (UBA) protein
117553	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
117554	Lipase
117556	Sugar (and other) transporter
117557	Tropomyosin
117559	Sugar (and other) transporter
117581	Helix-loop-helix DNA-binding domain
117591	Sugar (and other) transporter
117592	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
117600	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
117600	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
117606	Fibronectin type III domain
117854	B-box zinc finger
117854	Zinc finger, C3HC4 type (RING finger)
117854	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
117955	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
118424	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
118427	Olfactomedin-like domain
118446	Connexin
118453	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
118453	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
118453	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
118453	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
118453	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
118453	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
118454	Connexin
118454	Connexin
118454	Connexin
118704	PH domain. PH stands for pleckstrin homology
118704	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
118704	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
118706	PH domain. PH stands for pleckstrin homology
118706	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
118706	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
118709	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
118856	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
118881	O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production
118933	ab-hydrolase associated lipase region
118933	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
118934	Intermediate filament protein
118934	ab-hydrolase associated lipase region
118934	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
118945	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
118980	Tricarboxylate carrier
118980	Tricarboxylate carrier
118987	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
118992	Homeobox domain
119016	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
119016	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
119180	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes
119358	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
119391	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
119391	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
119548	Lipase
119548	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
119558	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
119558	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
119559	Tricarboxylate carrier
119559	Tricarboxylate carrier
119563	Ribosomal protein S8e
119587	Zinc carboxypeptidase
119587	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
119676	7 transmembrane receptor (rhodopsin family)
119678	7 transmembrane receptor (rhodopsin family)
119679	7 transmembrane receptor (rhodopsin family)
119682	7 transmembrane receptor (rhodopsin family)
119683	7 transmembrane receptor (rhodopsin family)
119684	7 transmembrane receptor (rhodopsin family)
119685	7 transmembrane receptor (rhodopsin family)
119686	7 transmembrane receptor (rhodopsin family)
119687	7 transmembrane receptor (rhodopsin family)
119689	7 transmembrane receptor (rhodopsin family)
119690	7 transmembrane receptor (rhodopsin family)
119691	7 transmembrane receptor (rhodopsin family)
119692	7 transmembrane receptor (rhodopsin family)
119694	7 transmembrane receptor (rhodopsin family)
119695	7 transmembrane receptor (rhodopsin family)
119696	7 transmembrane receptor (rhodopsin family)
119722	Intermediate filament protein
119749	7 transmembrane receptor (rhodopsin family)
119750	7 transmembrane receptor (rhodopsin family)
119764	7 transmembrane receptor (rhodopsin family)
119765	7 transmembrane receptor (rhodopsin family)
119766	Ribosomal protein L21e
119770	7 transmembrane receptor (rhodopsin family)
119772	7 transmembrane receptor (rhodopsin family)
119774	7 transmembrane receptor (rhodopsin family)
119775	7 transmembrane receptor (rhodopsin family)
120061	7 transmembrane receptor (rhodopsin family)
120062	7 transmembrane receptor (rhodopsin family)
120065	7 transmembrane receptor (rhodopsin family)
120066	7 transmembrane receptor (rhodopsin family)
120071	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl
120082	jmjN domain
120082	jmjC domain
120084	jmjN domain
120084	jmjC domain
120088	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
120103	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
120105	Cadherin domain
120123	Zinc finger, C3HC4 type (RING finger)
120123	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
120126	HMG (high mobility group) box
120127	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
120128	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
120144	HMG (high mobility group) box
120145	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
120146	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
120227	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
120237	Homeobox domain
120318	Ribosomal protein S6e
120323	Ribosomal protein L21e
120332	ICE-like protease (caspase) p20 domain
120332	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
120356	Ribosomal protein S2
120364	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
120449	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
120526	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
120528	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
120586	7 transmembrane receptor (rhodopsin family)
120643	7 transmembrane receptor (rhodopsin family)
120718	14-3-3 protein
120775	7 transmembrane receptor (rhodopsin family)
120776	7 transmembrane receptor (rhodopsin family)
120787	7 transmembrane receptor (rhodopsin family)
120793	Intermediate filament protein
120793	7 transmembrane receptor (rhodopsin family)
120796	7 transmembrane receptor (rhodopsin family)
120824	B-box zinc finger
120824	Zinc finger, C3HC4 type (RING finger)
120829	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
120864	Dehydratase family
120864	Ribosomal family S4e
120872	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
120892	Protein kinase domain
120892	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
120914	Ribosomal protein L31e
120940	Lectin C-type domain. This family includes both long and short form C-type
120943	Aminotransferase class I and II
120950	Ribosomal protein S26e
121021	Laminin G domain
121054	Intermediate filament protein
121071	Zinc-binding dehydrogenase
121071	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
121078	Ribosomal protein L13
121129	7 transmembrane receptor (rhodopsin family)
121130	7 transmembrane receptor (rhodopsin family)
121155	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
121214	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
121216	Trefoil (P-type) domain
121216	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
121227	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
121260	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
121270	7 transmembrane receptor (rhodopsin family)
121275	7 transmembrane receptor (rhodopsin family)
121278	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
121340	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
121360	7 transmembrane receptor (rhodopsin family)
121364	7 transmembrane receptor (rhodopsin family)
121391	Intermediate filament protein
121456	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
121498	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
121498	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
121512	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
121519	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
121520	Aminotransferase class I and II
121549	Helix-loop-helix DNA-binding domain
121550	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
121551	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
121665	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
121734	Cyclophilin type peptidyl-prolyl cis-trans isomerase
121790	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
121790	pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinit
121906	Proteasome A-type and B-type
121916	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
121940	ADP-ribosylation factor family
121940	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
121981	Cyclophilin type peptidyl-prolyl cis-trans isomerase
122011	Protein kinase domain
122013	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
122013	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
122042	7 transmembrane receptor (rhodopsin family)
122055	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
122107	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
122128	Ribosomal L18ae protein family
122134	Protein kinase domain
122201	START domain
122217	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
122335	Cyclophilin type peptidyl-prolyl cis-trans isomerase
122402	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
122402	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
122500	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
122501	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
122524	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
122529	Intermediate filament protein
122553	Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits
122570	Ribosomal protein L6
122585	Ribosomal protein L23
122589	Ribosomal protein L10
122592	Intermediate filament protein
122622	Adenylosuccinate synthetase
122649	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
122651	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
122703	7 transmembrane receptor (rhodopsin family)
122706	Proteasome A-type and B-type
122718	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
122739	7 transmembrane receptor (rhodopsin family)
122740	7 transmembrane receptor (rhodopsin family)
122741	7 transmembrane receptor (rhodopsin family)
122742	7 transmembrane receptor (rhodopsin family)
122744	7 transmembrane receptor (rhodopsin family)
122745	7 transmembrane receptor (rhodopsin family)
122747	7 transmembrane receptor (rhodopsin family)
122748	7 transmembrane receptor (rhodopsin family)
122749	7 transmembrane receptor (rhodopsin family)
122750	7 transmembrane receptor (rhodopsin family)
122772	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
122783	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
122788	Cytochrome c oxidase subunit Va. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit Va
122799	S25 ribosomal protein
122830	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
122867	pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium
122873	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
122927	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
122950	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
122961	HesB-like domain. This family includes HesB which may be involved in nitrogen fixation
123031	Ribosomal protein L21e
123041	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
123041	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
123041	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
123096	Mitochondrial carrier protein
123099	Fatty acid desaturase
123103	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
123103	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
123146	Intermediate filament protein
123169	Leo1-like protein. Members of this family are part of the Paf1/RNA polymerase II complex. The Paf1 complex probably functions during the elongation phase of transcription
123228	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
123257	HMG14 and HMG17
123278	7 transmembrane receptor (rhodopsin family)
123279	7 transmembrane receptor (rhodopsin family)
123374	Core histone H2A/H2B/H3/H4
123396	Intermediate filament protein
123396	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
123397	Ribosomal L29e protein family
123442	Intermediate filament protein
123722	Fibronectin type III domain
123722	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
123745	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop
123788	Trypsin
123876	AMP-binding enzyme
123879	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
123887	Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein
124132	Ribosomal protein S8
124149	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
124199	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
124199	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
124199	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
124221	Trypsin
124222	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h
124274	7 transmembrane receptor (rhodopsin family)
124321	Eukaryotic ribosomal protein L18
124401	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
124401	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
124404	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
124411	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
124451	Homeobox domain
124451	LIM domain. This family represents two copies of the LIM structural domain
124454	tRNA synthetases class I (E and Q), catalytic domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase aminoacyla
124460	PX domain. PX domains bind to phosphoinositides
124461	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
124497	Homeobox domain
124535	HSF-type DNA-binding
124538	7 transmembrane receptor (rhodopsin family)
124555	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
124563	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
124565	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
124602	Kinesin motor domain
124637	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
124641	Domain of unknown function (DUF341)
124663	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
124664	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
124667	HesB-like domain. This family includes HesB which may be involved in nitrogen fixation
124751	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
124760	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
124817	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
124827	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
124858	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
124872	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
124912	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzyme
124925	Sushi domain (SCR repeat)
124925	CUB domain
124925	Sushi domain (SCR repeat)
124936	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
124975	Gamma-glutamyltranspeptidase
124976	Sugar (and other) transporter
125058	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
125110	Cyclophilin type peptidyl-prolyl cis-trans isomerase
125111	Connexin
125113	Intermediate filament protein
125115	Intermediate filament protein
125115	Intermediate filament protein
125115	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
125182	7 transmembrane receptor (rhodopsin family)
125206	Sodium:solute symporter family
125228	Late embryogenesis abundant protein. Different types of LEA proteins are expressed at different stages of late embryogenesis in higher plant seed embryos and under conditions of dehydration stress. The function of these proteins is unknown
125242	Intermediate filament protein
125306	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
125336	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
125338	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
125491	Cyclophilin type peptidyl-prolyl cis-trans isomerase
125491	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
125836	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
125850	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
125884	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
125910	Ribosomal protein S8
125950	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
125950	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
125958	7 transmembrane receptor (rhodopsin family)
125962	7 transmembrane receptor (rhodopsin family)
125963	7 transmembrane receptor (rhodopsin family)
125966	Tropomyosin
125972	Calreticulin family
125980	Adenylate kinase
126003	Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits
126005	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
126005	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
126006	LGN motif, putative GEF specific for G-alpha GTPase
126015	Homeobox domain
126037	EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains
126060	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
126070	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
126119	Josephin
126129	Choline/Carnitine o-acyltransferase
126147	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
126170	Cyclophilin type peptidyl-prolyl cis-trans isomerase
126204	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
126205	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
126206	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
126208	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
126235	Ribosomal family S4e
126242	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
126252	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
126295	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
126308	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru
126341	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
126370	7 transmembrane receptor (rhodopsin family)
126374	LIM domain. This family represents two copies of the LIM structural domain
126393	Hsp20/alpha crystallin family
126410	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
126432	Vacuolar sorting protein 9 (VPS9) domain
126433	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate
126433	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated
126438	7 transmembrane receptor (rhodopsin family)
126439	7 transmembrane receptor (rhodopsin family)
126441	7 transmembrane receptor (rhodopsin family)
126476	Intermediate filament protein
126494	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
126504	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
126520	Protein kinase domain
126537	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
126538	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
126541	7 transmembrane receptor (rhodopsin family)
126549	LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1
126567	C2 domain
126576	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whic
126598	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
126611	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
126636	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
126637	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
126668	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
126669	SH2 domain
126755	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
126792	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
126811	Intermediate filament protein
126823	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
126849	Adenine deaminase. Adenine deaminase EC:3.5.4.2 hydrolyses adenine to form hypoxanthine and ammonia. The enzyme is part of a large metal dependent hydrolase superfamily. Adenine deaminases reaction is important for adenine utilization as a purine and also
126860	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
126860	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
126860	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
126862	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
126868	Mab-21 protein
126905	Sushi domain (SCR repeat)
126917	Intermediate filament protein
126963	Ribosomal protein L6e
126963	Ribosomal protein L6, N-terminal domain
126971	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
126971	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
127018	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
127059	7 transmembrane receptor (rhodopsin family)
127062	7 transmembrane receptor (rhodopsin family)
127063	7 transmembrane receptor (rhodopsin family)
127064	7 transmembrane receptor (rhodopsin family)
127065	7 transmembrane receptor (rhodopsin family)
127066	7 transmembrane receptor (rhodopsin family)
127068	7 transmembrane receptor (rhodopsin family)
127069	7 transmembrane receptor (rhodopsin family)
127071	7 transmembrane receptor (rhodopsin family)
127074	7 transmembrane receptor (rhodopsin family)
127077	7 transmembrane receptor (rhodopsin family)
127086	Ribosomal protein L10
127086	Ku70/Ku80 C-terminal arm. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the C terminal arm. Thi
127096	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
127099	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
127118	Ribosomal protein L23
127118	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
127118	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
127118	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
127124	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
127133	Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase
127148	7 transmembrane receptor (rhodopsin family)
127149	7 transmembrane receptor (rhodopsin family)
127150	Pentaxin family. Pentaxins are also known as pentraxins
127184	Polyprenyl synthetase
127223	Eukaryotic porin
127253	Uncharacterized ACR, COG1590
127257	Cyclophilin type peptidyl-prolyl cis-trans isomerase
127295	Ribosomal protein L36e
127300	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
127300	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
127324	Sugar (and other) transporter
127343	Homeobox domain
127343	Homeobox domain
127372	Ribosomal protein L15
127380	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
127385	7 transmembrane receptor (rhodopsin family)
127406	Ribosomal protein S2
127410	Cytochrome C oxidase subunit II, periplasmic domain
127534	Connexin
127540	HMG (high mobility group) box
127540	HMG (high mobility group) box
127545	Ribosomal L18ae protein family
127550	Glycosyltransferase family 6
127557	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
127572	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
127572	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
127572	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
127590	Ribosomal S3Ae family
127602	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
127608	7 transmembrane receptor (rhodopsin family)
127614	7 transmembrane receptor (rhodopsin family)
127617	7 transmembrane receptor (rhodopsin family)
127619	7 transmembrane receptor (rhodopsin family)
127621	7 transmembrane receptor (rhodopsin family)
127623	7 transmembrane receptor (rhodopsin family)
127707	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
127739	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
127739	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
127779	Poly-adenylate binding protein, unique domain
127784	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
127829	ADP-ribosylation factor family
127845	Got1-like family. Traffic through the yeast Golgi complex depends on a member of the syntaxin family of SNARE proteins, Sed5, present in early Golgi cisternae. Got1 is thought to facilitate Sed5-dependent fusion events
127850	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
127875	Ribosomal L29e protein family
127881	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
127884	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
127884	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
127887	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
127887	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
127933	Protein kinase domain
127933	Protein kinase domain
128066	Ribosomal protein S8
128100	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
128102	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
128103	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
128103	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
128136	Hydroxymethylglutaryl-coenzyme A synthase
128192	Cyclophilin type peptidyl-prolyl cis-trans isomerase
128209	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
128239	RasGAP C-terminus
128239	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
128240	YjeF-related protein N-terminus
128322	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
128337	7 transmembrane receptor (rhodopsin family)
128340	Glycosyl hydrolases family 18
128341	Glycosyl hydrolases family 18
128345	Glycosyl hydrolases family 18
128360	7 transmembrane receptor (rhodopsin family)
128361	7 transmembrane receptor (rhodopsin family)
128363	7 transmembrane receptor (rhodopsin family)
128365	7 transmembrane receptor (rhodopsin family)
128367	7 transmembrane receptor (rhodopsin family)
128368	7 transmembrane receptor (rhodopsin family)
128369	7 transmembrane receptor (rhodopsin family)
128370	7 transmembrane receptor (rhodopsin family)
128371	7 transmembrane receptor (rhodopsin family)
128372	7 transmembrane receptor (rhodopsin family)
128373	7 transmembrane receptor (rhodopsin family)
128387	TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme
128441	Domain of unknown function (DUF392). A eukaryotic specific domain of undetermined function.`
128488	WAP-type (Whey Acidic Protein) 'four-disulfide core'
128646	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
128665	7 transmembrane receptor (rhodopsin family)
128674	7 transmembrane receptor (rhodopsin family)
128762	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
128854	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
128872	HMG (high mobility group) box
129032	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
129049	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
129081	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
129097	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
129116	Aconitase family (aconitate hydratase)
129116	Aconitase C-terminal domain. Members of this family usually also match to pfam00330. This domain undergoes conformational change in the enzyme mechanism
129186	Intermediate filament protein
129316	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
129374	Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analogu
129402	'Cold-shock' DNA-binding domain
129426	Ribosomal protein L19e
129439	Actin
129481	Intermediate filament protein
129486	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
129499	HMG (high mobility group) box
129522	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
129522	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
129531	MIT domain
129560	S-adenosyl-L-homocysteine hydrolase
129560	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
129602	Phosphatidylinositol 3- and 4-kinase
129607	Thymidylate kinase
129684	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
129684	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
129804	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
129841	Ribosomal protein S2
129844	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
129868	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
129870	HMG (high mobility group) box
130023	S-adenosyl-L-homocysteine hydrolase
130023	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
130075	7 transmembrane receptor (rhodopsin family)
130120	Lectin C-type domain. This family includes both long and short form C-type
130195	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
130271	PH domain. PH stands for pleckstrin homology
130271	MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins
130271	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
130340	Clathrin adaptor complex small chain
130340	Clathrin adaptor complex small chain
130340	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
130367	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
130399	Protein kinase domain
130399	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
130429	Ribosomal protein S2
130430	Translationally controlled tumor protein
130450	Vinculin family
130502	TPR Domain
130505	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
130517	Sulfotransferase protein
130535	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
130589	Aldose 1-epimerase
130600	Ribosomal protein L14p/L23e
130633	Zinc finger, C3HC4 type (RING finger)
130633	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
130672	Ribosomal protein S19
130678	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
130728	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
130752	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
130773	Ribosomal protein L23
130888	F-box domain
130916	mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein
130916	mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein
130985	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
131034	C2 domain
131055	Cyclophilin type peptidyl-prolyl cis-trans isomerase
131096	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
131096	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
131117	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
131118	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con
131149	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
131149	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
131185	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
131185	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
131284	7 transmembrane receptor (rhodopsin family)
131364	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
131375	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzyme
131377	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
131377	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
131405	B-box zinc finger
131405	Filamin/ABP280 repeat
131405	Ribosomal protein L23
131405	NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in some members, proteolysis has shown that the Peptidyl-alph
131523	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
131566	LCCL domain
131566	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
131572	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
131691	Cyclophilin type peptidyl-prolyl cis-trans isomerase
131692	Cyclophilin type peptidyl-prolyl cis-trans isomerase
131692	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
131873	von Willebrand factor type A domain
131880	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
131889	Intermediate filament protein
131961	FMN-dependent dehydrogenase
131961	2-nitropropane dioxygenase. Members of this family catalyse the denitrification of a number of nitroalkanes using either FAD or FMN as a cofactor
131961	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
131973	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa
132048	3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
132049	Polyribonucleotide nucleotidyltransferase, RNA binding domain. This family contains the RNA binding domain of Polyribonucleotide nucleotidyltransferase (PNPase) PNPase is involved in mRNA degradation in a 3'-5' direction
132049	3' exoribonuclease family, domain 2. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
132049	3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
132170	S25 ribosomal protein
132187	Protein kinase domain
132204	Synaptophysin / synaptoporin
132205	Intermediate filament protein
132243	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
132391	Intermediate filament protein
132430	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
132524	Peptidase family M49
132535	Ribosomal protein S8
132556	NAC domain
132562	Zinc-binding dehydrogenase
132602	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
132612	Adenosine-deaminase (editase) domain
132689	B-box zinc finger
132689	Zinc finger, C3HC4 type (RING finger)
132689	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
132704	Intermediate filament protein
132719	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
132719	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
132722	Trypsin
132724	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
132724	Trypsin
132724	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
132789	Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase
132798	UDP-glucoronosyl and UDP-glucosyl transferase
132864	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
132869	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
132869	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
132929	Intermediate filament protein
132946	ADP-ribosylation factor family
132949	AMP-binding enzyme
132949	Phosphopantetheine attachment site. A 4'-phosphopantetheine prosthetic group is attached through a serine. This prosthetic group acts as a a 'swinging arm' for the attachment of activated fatty acid and amino-acid groups. This domain forms a four helix b
132954	Phosducin
133052	Intermediate filament protein
133060	Protein of unknown function, DUF270
133083	Insulinase (Peptidase family M16)
133121	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle
133205	TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway
133214	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
133256	NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane
133277	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
133277	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
133299	Exo70 exocyst complex subunit. The Exo70 protein forms one subunit of the exocyst complex. First discovered in S. cerevisiae , Exo70 and other exocyst proteins have been observed in several other eukaryotes, including humans. In S. cerevisiae, the exocyst
133308	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
133320	Ribosomal protein S26e
133332	Ribosomal protein S5, C-terminal domain
133384	Transforming growth factor beta like domain
133418	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
133482	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
133482	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
133486	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
133486	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
133539	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
133569	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
133584	Fibronectin type III domain
133584	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
133584	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
133584	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
133609	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
133628	Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding
133629	Ribosomal protein L5
133629	ribosomal L5P family C-terminus. This region is found associated with pfam00281
133635	Ribosomal protein L44
133666	Ribosomal protein L19e
133686	ATP-NAD kinase. Members of this family are ATP-NAD kinases EC:2.7.1.23. Catalyses the phosphorylation of NAD to NADP utilizing ATP and other nucleoside triphosphates as well as inorganic polyphosphate as a source of phosphorus
133688	UDP-glucoronosyl and UDP-glucosyl transferase
133730	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
133744	Ribosomal S3Ae family
133789	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
133790	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
134082	7 transmembrane receptor (rhodopsin family)
134083	7 transmembrane receptor (rhodopsin family)
134111	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
134187	Homeobox domain
134265	PH domain. PH stands for pleckstrin homology
134337	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
134356	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
134357	Aconitase family (aconitate hydratase)
134357	Aconitase C-terminal domain. Members of this family usually also match to pfam00330. This domain undergoes conformational change in the enzyme mechanism
134391	7 transmembrane receptor (rhodopsin family)
134412	Intermediate filament protein
134429	START domain
134430	Utp21 specific WD40 associated domain. Utp21 is a subunit of U3 snoRNP, which is essential for synthesis of 18S rRNA
134469	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
134478	Actin
134479	Intermediate filament protein
134492	Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration
134494	Tropomyosin
134502	PH domain. PH stands for pleckstrin homology
134502	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
134530	Cyclophilin type peptidyl-prolyl cis-trans isomerase
134579	Ribosomal protein S7e
134581	Intermediate filament protein
134611	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
134637	Cytidine and deoxycytidylate deaminase zinc-binding region
134725	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
134800	Stathmin family
134829	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
134829	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
134850	Ribosomal protein L21e
134860	7 transmembrane receptor (rhodopsin family)
134864	7 transmembrane receptor (rhodopsin family)
134870	Ribosomal protein L23
134897	SH2 domain
134897	Protein-tyrosine phosphatase
135112	LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known
135112	LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known
135114	HIT family
135152	Glycosyltransferase family 43
135228	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
135250	Class I Histocompatibility antigen, domains alpha 1 and 2
135257	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
135270	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
135293	Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases
135458	Hus1-like protein. Hus1, Rad1, and Rad9 are three evolutionarily conserved proteins required for checkpoint control in fission yeast. These proteins are known to form a stable complex in vivo. Hus1-Rad1-Rad9 complex may form a PCNA-like ring structure, a
135521	FKBP-type peptidyl-prolyl cis-trans isomerase
135522	Intermediate filament protein
135892	B-box zinc finger
135892	B-box zinc finger
135892	Zinc finger, C3HC4 type (RING finger)
135892	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
135896	7 transmembrane receptor (rhodopsin family)
135905	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
135924	7 transmembrane receptor (rhodopsin family)
135935	Homeobox domain
135935	Cyclophilin type peptidyl-prolyl cis-trans isomerase
135935	Herpesvirus transcription activation factor (transactivator). This family includes EBV BRLF1 and similar ORF 50 proteins from other herpesviruses
135941	7 transmembrane receptor (rhodopsin family)
135942	7 transmembrane receptor (rhodopsin family)
135943	7 transmembrane receptor (rhodopsin family)
135944	7 transmembrane receptor (rhodopsin family)
135946	7 transmembrane receptor (rhodopsin family)
135948	7 transmembrane receptor (rhodopsin family)
136046	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
136132	Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities
136143	Eukaryotic ribosomal protein L18
136178	GDP dissociation inhibitor
136242	Trypsin
136248	EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains
136256	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
136306	Sugar (and other) transporter
136306	Sugar (and other) transporter
136307	FMN-dependent dehydrogenase
136307	2-nitropropane dioxygenase. Members of this family catalyse the denitrification of a number of nitroalkanes using either FAD or FMN as a cofactor
136307	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
136321	Ribosomal L15
136337	Elongation factor 1 gamma, conserved domain
136371	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
136505	Ribosomal protein L21e
136992	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
137075	PMP-22/EMP/MP20/Claudin family
137107	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
137133	Aminotransferase class-V
137165	Ribosomal protein L3
137202	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t
137207	CUB domain
137520	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
137814	Homeobox domain
137839	Protein kinase domain
137885	Cyclophilin type peptidyl-prolyl cis-trans isomerase
137886	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
137902	Animal haem peroxidase
137902	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
137964	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
138031	N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation o
138040	Intermediate filament protein
138115	S-adenosyl-L-homocysteine hydrolase
138151	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
138160	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
138198	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
138267	Putative snoRNA binding domain. This family consists of various Pre RNA processing ribonucleoproteins. The function of the aligned region is unknown however it may be a common RNA or snoRNA or Nop1p binding domain. Nop5p (Nop58p) from yeast is the protein
138412	Mitochondrial carrier protein
138414	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
138416	Intermediate filament protein
138428	Peptidyl-tRNA hydrolase
138453	Ribosomal protein S5, C-terminal domain
138453	Ribosomal protein S5, N-terminal domain
138512	Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex
138512	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
138562	14-3-3 protein
138604	Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes
138649	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
138652	Trypsin
138715	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
138721	Glutamine synthetase, beta-Grasp domain
138721	Glutamine synthetase, catalytic domain
138741	7 transmembrane receptor (rhodopsin family)
138742	7 transmembrane receptor (rhodopsin family)
138748	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
138799	7 transmembrane receptor (rhodopsin family)
138802	7 transmembrane receptor (rhodopsin family)
138803	7 transmembrane receptor (rhodopsin family)
138804	7 transmembrane receptor (rhodopsin family)
138805	7 transmembrane receptor (rhodopsin family)
138850	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
138855	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
138879	Intermediate filament protein
138881	7 transmembrane receptor (rhodopsin family)
138882	7 transmembrane receptor (rhodopsin family)
138883	7 transmembrane receptor (rhodopsin family)
138888	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
138924	Cyclophilin type peptidyl-prolyl cis-trans isomerase
138925	Giardia variant-specific surface protein
138926	Giardia variant-specific surface protein
138932	Microtubule associated protein (MAP65/ASE1 family)
138961	Nucleoside diphosphate kinase
138971	Peptidase family M49
139060	Intermediate filament protein
139065	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
139075	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
139076	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
139081	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
139081	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
139084	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
139115	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
139125	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
139170	WD domain, G-beta repeat
139171	Cytochrome C and Quinol oxidase polypeptide I
139171	NADH-ubiquinone/plastoquinone oxidoreductase chain 4L
139189	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
139189	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
139189	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
139193	HMG (high mobility group) box
139211	Intermediate filament protein
139216	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
139216	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
139226	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
139242	Ribosomal protein S17
139270	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
139270	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
139271	7 transmembrane receptor (rhodopsin family)
139272	HSF-type DNA-binding
139272	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
139275	eRF1 domain 1. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
139275	eRF1 domain 3. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
139279	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
139281	Intermediate filament protein
139282	14-3-3 protein
139284	SH2 domain
139284	Phosphotyrosine interaction domain (PTB/PID)
139302	ENV polyprotein (coat polyprotein)
139304	Ribosomal protein L6e
139306	HMG (high mobility group) box
139309	Ku70/Ku80 C-terminal arm. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the C terminal arm. Thi
139309	Ku70/Ku80 N-terminal alpha/beta domain. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the amino
139309	Ku70/Ku80 beta-barrel domain. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the central DNA-bin
139318	Cyclophilin type peptidyl-prolyl cis-trans isomerase
139324	Homeobox domain
139334	CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh
139334	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
139340	Polyprenyl synthetase
139362	Intermediate filament protein
139363	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
139378	7 transmembrane receptor (Secretin family)
139385	Intermediate filament protein
139390	Cyclic nucleotide-binding domain
139390	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
139397	Ribosomal protein L3
139401	Translationally controlled tumor protein
139411	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
139422	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
139422	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
139423	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
139425	WD domain, G-beta repeat
139431	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
139433	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
139451	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
139452	Ribosomal protein L6e
139457	Translationally controlled tumor protein
139473	7 transmembrane receptor (rhodopsin family)
139497	ATP synthase subunit C
139542	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
139543	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
139543	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
139562	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
139576	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
139599	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
139604	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
139628	Fork head domain
139647	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
139649	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
139707	Fork head domain
139728	Protein kinase domain
139735	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
139735	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
139741	Actin
139748	Intermediate filament protein
139760	7 transmembrane receptor (rhodopsin family)
139760	7 transmembrane receptor (rhodopsin family)
139805	Ribosomal protein L36e
139808	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
139808	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
139835	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
139839	Intermediate filament protein
139839	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
139871	7 transmembrane receptor (rhodopsin family)
139880	Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes
139886	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
139927	Ribosomal protein S7e
139932	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
139952	Protein kinase domain
139952	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
139957	Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellul
140004	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
140028	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
140036	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
140038	Trypsin
140065	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
140100	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
140101	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
140123	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
140129	'chromo' (CHRromatin Organization MOdifier) domain
140129	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
140149	'chromo' (CHRromatin Organization MOdifier) domain
140149	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
140164	PXA domain. This domain is associated with PX domains pfam00787
140338	Ribosomal protein L23
140338	Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a
140446	Intermediate filament protein
140447	Calx-beta domain
140447	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
140448	Calx-beta domain
140448	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
140453	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
140456	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
140458	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
140459	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
140459	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
140473	7 transmembrane receptor (rhodopsin family)
140474	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
140474	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
140477	Homeobox domain
140479	7 transmembrane receptor (rhodopsin family)
140481	Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism
140482	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
140489	Helix-loop-helix DNA-binding domain
140491	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase,
140491	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p
140492	Calcium-activated SK potassium channel
140492	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-
140493	Calcium-activated SK potassium channel
140493	pfam02888, CaMBD, Calmodulin binding domain. Small-conductance Ca2+-activated K+ channels (SK channels) are independent of voltage and gated solely by intracellular Ca2+. These membrane channels are heteromeric complexes that comprise pore-forming alpha-
140494	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
140497	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
140498	7 transmembrane receptor (rhodopsin family)
140500	PH domain. PH stands for pleckstrin homology
140500	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
140500	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
140500	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
140547	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
140568	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
140571	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
140576	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
140577	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
140578	Lectin C-type domain. This family includes both long and short form C-type
140584	wnt family
140585	Protein kinase domain
140586	HMG (high mobility group) box
140592	Uncharacterized ACR, COG1579
140592	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
140592	PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins
140592	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
140592	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
140593	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result
140594	C2 domain
140594	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
140595	7 transmembrane receptor (rhodopsin family)
140625	Actin
140638	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
140654	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
140654	Ribosomal protein S3, C-terminal domain. This family contains a central domain pfam00013, hence the amino and carboxyl terminal domains are stored separately. This is a minimal carboxyl-terminal domain. Some are much longer
140655	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
140661	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
140667	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
140670	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
140670	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
140674	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
140674	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
140675	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
140675	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
140679	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
140683	Coronavirus nucleocapsid protein
140693	PH domain. PH stands for pleckstrin homology
140694	Dynamin GTPase effector domain
140694	PH domain. PH stands for pleckstrin homology
140694	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
140700	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
140701	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
140721	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
140723	PMP-22/EMP/MP20/Claudin family
140724	PMP-22/EMP/MP20/Claudin family
140725	PMP-22/EMP/MP20/Claudin family
140726	PMP-22/EMP/MP20/Claudin family
140728	PMP-22/EMP/MP20/Claudin family
140729	PMP-22/EMP/MP20/Claudin family
140732	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
140734	Dynein light chain type 1
140735	Dynein light chain type 1
140766	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
140766	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
140766	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
140766	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
140781	Myosin head (motor domain)
140795	7 transmembrane receptor (rhodopsin family)
140801	Ribosomal L10
140803	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a
140803	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
140807	Intermediate filament protein
140810	Protein kinase domain
140826	Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids
140836	Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration
140856	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
140860	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
140860	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
140866	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
140866	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
140868	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
140870	WAP-type (Whey Acidic Protein) 'four-disulfide core'
140894	Cyclic nucleotide-binding domain
140898	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
140902	SCP-like extracellular protein. This domain is also found in prokaryotes
140910	Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues.
140915	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
140915	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
140922	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
140926	Daxx Family. The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis, but precise role played by Daxx remians to be determined
140928	3'5'-cyclic nucleotide phosphodiesterase
140929	3'5'-cyclic nucleotide phosphodiesterase
140931	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
140932	Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway
140933	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
140935	PH domain. PH stands for pleckstrin homology
140939	B-box zinc finger
140941	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result
140943	HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats
140944	Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin
140945	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
140945	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
140946	SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase
140946	AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this
142679	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
142680	Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor
142681	Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor
142685	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
142688	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
142688	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
142689	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
142827	AMP-binding enzyme
142850	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
142891	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
142901	Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal
142910	ab-hydrolase associated lipase region
142980	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
143038	Enolase, C-terminal TIM barrel domain
143098	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
143125	Ribosomal protein L21e
143158	PH domain. PH stands for pleckstrin homology
143158	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
143158	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
143243	Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold
143244	Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold
143339	Homeobox domain
143383	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
143425	C2 domain
143496	7 transmembrane receptor (rhodopsin family)
143502	7 transmembrane receptor (rhodopsin family)
143503	7 transmembrane receptor (rhodopsin family)
143543	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
143548	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL
143689	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
143689	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
143712	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
143712	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
143828	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
143879	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
143879	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
143903	Lectin C-type domain. This family includes both long and short form C-type
143903	Lectin C-type domain. This family includes both long and short form C-type
144124	7 transmembrane receptor (rhodopsin family)
144124	7 transmembrane receptor (rhodopsin family)
144125	7 transmembrane receptor (rhodopsin family)
144132	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
144195	Sugar (and other) transporter
144247	Ribosomal protein S2
144253	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
144402	C2 domain
144404	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
144423	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari
144448	Tetraspanin family
144453	Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown
144455	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
144483	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
144501	Intermediate filament protein
144501	Intermediate filament protein
144501	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
144568	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
144581	Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14
144715	Rad9. Rad9 is required for transient cell-cycle arrests and transcriptional induction of DNA repair in response to DNA damage
144717	PH domain. PH stands for pleckstrin homology
144767	Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species
144983	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
145009	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
145047	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
145173	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
145226	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
145258	Homeobox domain
145264	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
145288	Ribosomal protein L11, RNA binding domain
145288	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
145371	ADP-ribosylation factor family
145371	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
145371	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
145389	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
145414	Ribosomal protein L3
145497	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
145501	AMOP domain. This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges
145501	Thrombospondin type 1 domain
145501	AMOP domain. This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges
145567	Coatomer WD associated domain
145581	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
145767	Ribosomal S3Ae family
145790	C2 domain
145799	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
145800	Dynamin GTPase effector domain
145814	Pyroglutamyl peptidase
145990	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
146053	Ribosomal S3Ae family
146057	Protein kinase domain
146057	Intermediate filament protein
146110	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
146223	Synaptogyrin. This family of proteins is distantly related to pfam01284
146223	Synaptogyrin. This family of proteins is distantly related to pfam01284
146223	Synaptogyrin. This family of proteins is distantly related to pfam01284
146279	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
146310	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina result
146347	Ribosomal protein S2
146395	PMP-22/EMP/MP20/Claudin family
146434	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
146456	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
146485	Vacuolar protein sorting-associated protein 35. Vacuolar protein sorting-associated protein (Vps) 35 is one of around 50 proteins involved in protein trafficking. In particular, Vps35 assembles into a retromer complex with at least four other proteins Vps
146488	Sodium:neurotransmitter symporter family
146603	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
146656	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
146691	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
146754	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
146818	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
147015	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
147040	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
147072	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
147079	Ribosomal protein S26e
147111	Pectinacetylesterase
147111	Pectinacetylesterase
147166	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
147183	Intermediate filament protein
147183	Intermediate filament protein
147199	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
147228	Intermediate filament protein
147323	START domain
147346	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
147358	Ribosomal S3Ae family
147374	Glutamate/Leucine/Phenylalanine/Valine dehydrogenase
147381	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
147398	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
147400	Helper component proteinase. This protein is found in genome polyproteins of potyviruses
147400	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
147407	Mitochondrial carrier protein
147409	Cadherin domain
147409	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
147655	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
147658	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
147664	ENV polyprotein (coat polyprotein)
147686	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
147687	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
147694	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
147699	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
147740	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
147740	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
147742	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
147804	Tropomyosin
147807	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
147808	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
147837	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
147923	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
147980	Ribosomal protein L21e
148003	Galactoside-binding lectin
148064	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
148066	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
148066	Zinc finger, C3HC4 type (RING finger)
148066	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
148103	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
148203	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
148206	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
148254	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
148276	7 transmembrane receptor (rhodopsin family)
148281	C2 domain
148330	Ribosomal protein S2
148356	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
148362	BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1
148370	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
148430	Ribosomal protein S5, C-terminal domain
148430	Ribosomal protein S5, N-terminal domain
148446	Reeler domain
148479	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
148549	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
148640	Ribosomal protein S19
148709	Actin
148713	Protein-tyrosine phosphatase
148789	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
148854	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
148864	Homeobox domain
148864	LIM domain. This family represents two copies of the LIM structural domain
148932	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru
148947	Ribosomal protein S2
148953	Ribosomal protein L21e
148955	Transcription initiation factor IIF, beta subunit. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIF (TFIIF) is a tetramer of two beta subun
148960	Ribosomal protein S5, C-terminal domain
148960	Ribosomal protein S5, N-terminal domain
148995	Ribosomal protein S7e
149041	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
149050	Aminotransferase class-V
149086	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
149111	Cornichon protein
149224	Ribosomal protein S7e
149318	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
149329	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
149360	Ribosomal protein L24e
149461	PMP-22/EMP/MP20/Claudin family
149476	Ribosomal protein S8
149501	Intermediate filament protein
149501	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
149620	Glycosyl hydrolases family 18
149620	Chitin binding Peritrophin-A domain. This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruse
149628	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
149767	HMG (high mobility group) box
150000	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
150000	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
150160	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
150200	Ubiquitin carboxyl-terminal hydrolase family 2
150207	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
150209	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
150212	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
150221	Fibronectin type III domain
150221	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
150280	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
150287	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
150356	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
150379	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o
150400	Ribosomal L29 protein
150445	Phosphatidylethanolamine-binding protein
150465	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
150472	Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans
150483	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
150498	14-3-3 protein
150519	Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included
150572	MYND finger
150572	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
150580	HMG14 and HMG17
150610	Phosphorylase family 2
150681	7 transmembrane receptor (rhodopsin family)
150739	Intermediate filament protein
150739	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
150771	Mab-21 protein
150771	Mab-21 protein
150821	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
150868	Ribosomal protein L11, RNA binding domain
150868	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
150940	WD domain, G-beta repeat
150948	PMP-22/EMP/MP20/Claudin family
150969	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
150978	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
150984	Eukaryotic porin
151011	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
151011	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
151056	Lipase/Acylhydrolase with GDSL-like motif
151094	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
151155	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
151195	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
151234	Sulfotransferase protein
151258	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
151295	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predic
151313	Fumarylacetoacetate (FAA) hydrolase family. This family consists of fumarylacetoacetate (FAA) hydrolase, or fumarylacetoacetate hydrolase (FAH) and it also includes HHDD isomerase/OPET decarboxylase from E. coli strain W. FAA is the last enzyme in the tyr
151320	HMG (high mobility group) box
151437	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
151449	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
151449	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
151451	Transposase family tnp2
151457	TGS domain. The TGS domain is named after ThrRS, GTPase, and SpoT. Interestingly, TGS domain was detected also at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochae
151470	Tropomyosin
151516	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
151561	Core histone H2A/H2B/H3/H4
151579	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
151723	Homeobox domain
151741	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
151790	WD domain, G-beta repeat
151825	Intermediate filament protein
151835	C2 domain
151956	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
151963	Mab-21 protein
151963	Mab-21 protein
151973	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
152002	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl
152015	Regulatory subunit of type II PKA R-subunit
152059	CAP protein
152098	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
152101	Homeobox domain
152160	Ribosomal protein L21e
152330	Fibronectin type III domain
152330	Fibronectin type III domain
152330	Fibronectin type III domain
152503	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
152504	Ribosomal S3Ae family
152559	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h
152579	Sec1 family
152682	NAC domain
152829	WD domain, G-beta repeat
152831	Glycosyl hydrolase family 1
152890	Glycosyl hydrolases family 2, sugar binding domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities and has a jelly-roll fold
152905	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
152922	Nucleotide-sensitive chloride conductance regulator (ICln)
152972	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
153027	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
153129	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
153201	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
153217	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
153218	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
153328	Mitochondrial carrier protein
153364	Metallo-beta-lactamase superfamily
153393	EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains
153561	Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities
153565	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
153642	Sulfatase
153817	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
153886	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
154062	Nitroreductase family. Members of this family utilise FMN as a cofactor
154064	Class I Histocompatibility antigen, domains alpha 1 and 2
154091	Sugar (and other) transporter
154094	Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown
154150	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
154165	Ribosomal protein L21e
154337	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
154337	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
154516	UDP-glucoronosyl and UDP-glucosyl transferase
154545	Ribosomal S17
154562	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
154661	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
154849	14-3-3 protein
154865	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
154865	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
154866	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
154881	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
154982	Protein kinase domain
154985	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
154995	EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains
155012	Sushi domain (SCR repeat)
155051	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
155054	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
155100	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
155205	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
155268	PMP-22/EMP/MP20/Claudin family
155423	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
155696	LCCL domain
155696	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
155812	Fibronectin type III domain
155812	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
156726	Runt domain
156873	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
157285	Protein kinase domain
157295	Mammalian defensin
157295	Defensin propeptide
157310	Phosphatidylethanolamine-binding protein
157317	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
157506	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
157543	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
157552	Ribosomal protein S2
157567	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
157574	F-box domain
157589	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
157589	Pentapeptide repeats (8 copies). These repeats are found in many cyanobacterial proteins. The repeats were first identified in hglK. The function of these repeats is unknown. The structure of this repeat has been predicted to be a beta-helix. The repeat c
157589	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
157667	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
157708	Signal peptidase I
157777	MCM2/3/5 family
157784	Tropomyosin
157793	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
157807	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
157807	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
157848	Homeobox domain
158056	MAM domain. An extracellular domain found in many receptors
158062	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
158067	Adenylate kinase
158104	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
158125	WD domain, G-beta repeat
158131	7 transmembrane receptor (rhodopsin family)
158135	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
158160	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
158160	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
158200	Ribosomal protein S26e
158204	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
158234	Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue
158237	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
158237	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
158284	Protein kinase domain
158345	Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA
158352	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
158381	A20-like zinc finger. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappa B activation
158399	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
158399	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
158452	Cyclin-dependent kinase regulatory subunit
158452	Arginosuccinate synthase. This family contains a PP-loop motif
158473	Ribosomal protein S2
158624	FMN-dependent dehydrogenase
158624	2-nitropropane dioxygenase. Members of this family catalyse the denitrification of a number of nitroalkanes using either FAD or FMN as a cofactor
158624	CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh
158624	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
158656	Ribosomal protein S2
158668	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
158678	Ribosomal protein S2
158747	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
158773	Ribosomal protein S5, C-terminal domain
158773	Ribosomal protein S5, N-terminal domain
158777	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
158800	Homeobox domain
158809	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
158833	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
158835	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
158880	Ubiquitin carboxyl-terminal hydrolase family 2
158880	Ubiquitin carboxyl-terminal hydrolases family 2
158943	RanBP1 domain
158983	Core histone H2A/H2B/H3/H4
158997	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
159119	HSF-type DNA-binding
159163	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
159184	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
159296	Homeobox domain
159371	Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region
159686	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
159815	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
159963	Sodium:solute symporter family
160382	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
160418	TPR Domain
160428	pfam02911, formyl_trans_C, Formyl transferase, C-terminal domain
160428	Formyl transferase. This family includes the following members. Glycinamide ribonucleotide transformylase catalyses the third step in de novo purine biosynthesis, the transfer of a formyl group to 5'-phosphoribosylglycinamide. Formyltetrahydrofolate defor
160428	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
160492	Intermediate filament tail domain
160518	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
160518	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
160518	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
160518	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
160518	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
160622	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
160851	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
160851	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
161003	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
161253	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
161291	LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in respo
161356	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
161357	MAM domain. An extracellular domain found in many receptors
161452	HMG (high mobility group) box
161476	Hsp20/alpha crystallin family
161477	Ribosomal protein S15
161717	Dynamin GTPase effector domain
161823	Adenosine/AMP deaminase
162073	Mab-21 protein
162282	Fibronectin type III domain
162466	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
162494	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f
162514	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
162540	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
162540	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
162605	Intermediate filament protein
162605	Intermediate filament protein
162655	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
162835	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
162952	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
162962	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
162968	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
162993	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
162998	7 transmembrane receptor (rhodopsin family)
163049	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
163050	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
163051	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
163059	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
163071	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
163175	EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function,
163201	7 transmembrane receptor (rhodopsin family)
163233	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
163351	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
163404	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
163486	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
163486	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
163486	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
163589	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
163742	Mitochondrial carrier protein
163742	Mitochondrial carrier protein
163782	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
163786	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
164022	Cyclophilin type peptidyl-prolyl cis-trans isomerase
164022	Cyclophilin type peptidyl-prolyl cis-trans isomerase
164036	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
164036	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
164045	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
164091	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It h
164091	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
164153	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
164395	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
164633	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
164668	Cytidine and deoxycytidylate deaminase zinc-binding region
164668	Cytidine and deoxycytidylate deaminase zinc-binding region
164781	WD domain, G-beta repeat
164832	Zinc finger, C3HC4 type (RING finger)
164832	ATP-dependent protease La (LON) domain
165082	7 transmembrane receptor (Secretin family)
165140	7 transmembrane receptor (rhodopsin family)
165257	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
165324	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
165324	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
165829	7 transmembrane receptor (metabotropic glutamate family)
166012	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
166336	LIM domain. This family represents two copies of the LIM structural domain
166348	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
166348	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
166379	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
166496	TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway
166522	SH2 domain
166647	7 transmembrane receptor (Secretin family)
166647	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
166647	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
166657	B-box zinc finger
166657	Zinc finger, C3HC4 type (RING finger)
166657	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
166752	Calx-beta domain
166752	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
166793	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
166815	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d
166815	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
166968	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
166974	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
166975	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
167127	UDP-glucoronosyl and UDP-glucosyl transferase
167227	NUDIX domain
167681	Trypsin
167826	Helix-loop-helix DNA-binding domain
168283	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
168391	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
168433	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
168474	AIR synthase related protein, N-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The N-terminal domain of AIR synthase forms the d
168507	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
168507	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
168507	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
168507	REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six c
168620	Helix-loop-helix DNA-binding domain
168667	von Willebrand factor type D domain
168667	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
168975	Cyclic nucleotide-binding domain
169044	von Willebrand factor type A domain
169044	Thrombospondin N-terminal -like domain
169166	PX domain. PX domains bind to phosphoinositides
169355	Indoleamine 2,3-dioxygenase
169436	Protein kinase domain
169505	Shikimate kinase
169522	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
169522	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
169611	Olfactomedin-like domain
169611	Olfactomedin-like domain
169714	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
169831	PX domain. PX domains bind to phosphoinositides
169841	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
169981	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
170106	Actin
170302	Homeobox domain
170384	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
170392	Zona pellucida-like domain
170439	GNS1/SUR4 family
170441	Sugar (and other) transporter
170442	Gamma-butyrobetaine hydroxylase. Members of this family are gamma-Butyrobetaine hydroxylase enzymes EC:1.14.11.1
170460	START domain
170461	START domain
170465	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
170483	Bacterial extracellular solute-binding proteins, family 3
170483	Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors
170484	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
170496	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
170501	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
170520	Globin
170538	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
170544	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
170567	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
170568	Zona pellucida-like domain
170571	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
170571	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
170572	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
170572	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
170573	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
170574	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
170575	GTPase of unknown function
170577	Kinase associated domain 1
170587	Citrate synthase
170591	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
170625	PX domain. PX domains bind to phosphoinositides
170633	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
170634	7 transmembrane receptor (metabotropic glutamate family)
170634	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
170635	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
170639	7 transmembrane receptor (rhodopsin family)
170641	Galactose binding lectin domain
170641	7 transmembrane receptor (Secretin family)
170641	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
170641	Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein cou
170643	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv
170667	ATP synthase subunit C
170668	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
170669	Utp11 protein. This protein is found to be part of a large ribonucleoprotein complex containing the U3 snoRNA. Depletion of the Utp proteins impedes production of the 18S rRNA, indicating that they are part of the active pre-rRNA processing complex. This
170670	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
170672	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
170672	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
170673	Paralemmin
170676	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
170677	Cadherin domain
170685	NUDIX domain
170689	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
170689	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
170690	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
170690	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
170691	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
170691	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
170692	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
170692	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
170698	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
170698	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
170700	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling
170704	7 transmembrane receptor (rhodopsin family)
170714	Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyn
170715	Poly-adenylate binding protein, unique domain
170716	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
170719	LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known
170720	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
170721	Thrombospondin type 1 domain
170721	Thrombospondin type 1 domain
170721	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
170721	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
170722	TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucl
170722	TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucle
170725	Calpain family cysteine protease
170725	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
170729	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
170732	7 transmembrane receptor (rhodopsin family)
170733	Lectin C-type domain. This family includes both long and short form C-type
170735	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
170735	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
170736	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
170738	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
170738	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
170738	Cyclic nucleotide-binding domain
170738	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
170739	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
170743	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
170744	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
170752	Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequenc
170755	PX domain. PX domains bind to phosphoinositides
170755	PX domain. PX domains bind to phosphoinositides
170756	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
170757	7 transmembrane receptor (Secretin family)
170757	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
170760	Acyl CoA binding protein
170760	Acyl CoA binding protein
170762	Non-repetitive/WGA-negative nucleoporin. This is a family of nucleoporin proteins. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells, and mediate bidirectional nucleocytoplasmic transport, especially of mRNA and protein
170768	6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta
170768	6-phosphofructo-2-kinase. This enzyme occurs as a bifunctional enzyme with fructose-2,6-bisphosphatase. The bifunctional enzyme catalyses both the synthesis and degradation of fructose-2,6-bisphosphate, a potent regulator of glycolysis. This enzyme conta
170774	Caveolin
170776	Lectin C-type domain. This family includes both long and short form C-type
170779	Lectin C-type domain. This family includes both long and short form C-type
170780	Lectin C-type domain. This family includes both long and short form C-type
170787	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
170808	Calpain family cysteine protease
170808	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
170813	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
170816	7 transmembrane receptor (rhodopsin family)
170825	Homeobox domain
170829	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
170842	BTG1 family. A novel family of anti-proliferative proteins
170847	Inward rectifier potassium channel
170848	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
170848	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
170849	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
170850	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
170850	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
170850	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
170850	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
170895	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
170898	Gamma-butyrobetaine hydroxylase. Members of this family are gamma-Butyrobetaine hydroxylase enzymes EC:1.14.11.1
170899	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
170899	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
170903	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
170905	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
170907	TMS membrane protein/tumour differentially expressed protein (TDE)
170908	Protein kinase domain
170909	Ets-domain
170909	Sterile alpha motif (SAM)/Pointed domain
170911	Phosphatidylinositol 3- and 4-kinase
170911	PI3-kinase family, p85-binding domain
170911	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
170911	Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation
170911	PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains
170914	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
170917	FAD binding domain of DNA photolyase
170920	Protein kinase domain
170920	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
170921	Integrin alpha cytoplasmic region. This family contains the short intracellular region of integrin alpha chains
170922	Protein kinase domain
170924	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
170924	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
170925	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
170925	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
170926	7 transmembrane receptor (rhodopsin family)
170928	Uncharacterized ACR, YneC family COG1359
170929	Apoptosis regulator proteins, Bcl-2 family
170936	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
170936	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
170936	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
170938	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
170943	Mitochondrial carrier protein
170945	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
170945	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
170956	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment
170959	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
170961	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
171009	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
171011	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
171011	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
171015	Oxidoreductase FAD-binding domain
171015	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
171016	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
171019	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
171019	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
171024	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
171026	Protein kinase A anchor
171039	C2 domain
171039	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
171044	7 transmembrane receptor (rhodopsin family)
171045	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
171045	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
171045	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidase
171046	Helix-loop-helix DNA-binding domain
171048	Tetraspanin family
171049	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
171050	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru
171052	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
171052	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
171052	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases
171053	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
171054	7 transmembrane receptor (rhodopsin family)
171056	7 transmembrane receptor (rhodopsin family)
171057	Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans
171058	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
171062	PMP-22/EMP/MP20/Claudin family
171065	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
171068	Homeobox domain
171068	Pou domain - N-terminal to homeobox domain
171072	Sulfotransferase protein
171073	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
171074	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
171078	Helix-loop-helix DNA-binding domain
171079	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
171082	ATP synthase subunit C
171084	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
171085	Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic
171085	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
171087	Uncharacterised protein family (UPF0041)
171090	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
171091	Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this re
171092	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
171097	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
171104	7 transmembrane receptor (rhodopsin family)
171105	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
171106	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
171108	7 transmembrane receptor (rhodopsin family)
171109	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
171114	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t
171115	3'5'-cyclic nucleotide phosphodiesterase
171118	Carboxylesterase
171119	Fibronectin type III domain
171123	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
171126	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
171131	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
171131	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
171133	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl grou
171134	Galactoside-binding lectin
171135	Ctr copper transporter family. The redox active metal copper is an essential cofactor in critical biological processes such as respiration, iron transport, oxidative stress protection, hormone production, and pigmentation. A widely conserved family of hi
171136	CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and
171136	CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily co
171136	CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conser
171136	CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and
171136	CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily con
171136	CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserv
171138	Slow voltage-gated potassium channel
171142	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
171143	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
171144	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
171144	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
171146	Cyclic nucleotide-binding domain
171146	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
171146	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
171147	Sugar (and other) transporter
171151	Mitochondrial carrier protein
171152	Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tra
171154	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
171155	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
171162	Lectin C-type domain. This family includes both long and short form C-type
171163	Sodium:neurotransmitter symporter family
171164	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
171164	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
171166	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
171167	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
171171	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
171180	C2 domain
171183	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171184	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171185	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171186	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171187	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171188	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171189	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171190	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171191	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171192	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171193	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171194	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171195	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171195	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171196	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171197	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171198	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171199	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171200	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171201	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171202	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171203	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171205	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171206	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171207	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
171224	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171225	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171226	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171227	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171228	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171229	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171230	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171231	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171232	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171233	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171234	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171235	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171236	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171237	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171238	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171239	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171240	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171241	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171242	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171243	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171244	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171245	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171246	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171247	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171248	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171250	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171251	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171252	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171253	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171254	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171255	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171256	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171257	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171258	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171259	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171260	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171261	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171262	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171263	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171264	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171265	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171266	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171267	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171268	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171269	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171270	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171271	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171272	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171273	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171274	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171275	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171277	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171279	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
171287	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
171291	Kinesin motor domain
171293	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whi
171296	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
171297	Carboxylesterase
171299	Fork head domain
171333	Fibronectin type III domain
171341	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
171350	Poly-adenylate binding protein, unique domain
171352	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
171355	Homeobox domain
171355	Pou domain - N-terminal to homeobox domain
171357	Eukaryotic-type carbonic anhydrase
171357	Protein-tyrosine phosphatase
171360	Homeobox domain
171362	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
171362	MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains
171365	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
171366	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
171372	Sugar (and other) transporter
171374	ATP synthase alpha/beta chain, C terminal domain
171374	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
171374	pfam02874, ATP-synt_ab_N, ATP synthase alpha/beta family, beta-barrel domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
171380	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
171381	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
171382	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
171384	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
171387	Zinc finger, C3HC4 type (RING finger)
171396	Sulfatase
171397	Sugar (and other) transporter
171398	Sugar (and other) transporter
171399	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectr
171400	GNS1/SUR4 family
171402	GNS1/SUR4 family
171406	Somatotropin hormone family
171408	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
171415	Autophagocytosis associated protein, C-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole
171415	Autophagocytosis associated protein, N-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole
171429	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
171429	Sulfate transporter family. Mutations may lead to several human diseases
171432	Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised
171440	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
171441	Hrf1 family. This family includes a number of eukaryotic proteins. It is an integral membrane protein, conserved in at least 1 copy in all sequenced eukaryotes. The gene name in S. pombe is hrf1+ for Heavy metal Resistance Factor 1
171442	Sugar (and other) transporter
171447	Galactose binding lectin domain
171447	7 transmembrane receptor (Secretin family)
171447	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
171447	Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein cou
171449	Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein
171454	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
171455	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
171458	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
171459	Zona pellucida-like domain
171469	7 transmembrane receptor (rhodopsin family)
171469	7 transmembrane receptor (rhodopsin family)
171493	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
171494	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
171494	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
171495	C2 domain
171495	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated b
171497	Protein kinase domain
171497	Protein kinase C terminal domain
171506	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
171516	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
171517	Glypican
171518	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
171521	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
171522	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
171528	Trypsin
171529	Kinesin motor domain
171531	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
171551	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
171552	Prp18 domain. The splicing factor Prp18 is required for the second step of pre-mRNA splicing. The structure of a large fragment of the Saccharomyces cerevisiae Prp18 is known. This fragment is fully active in yeast splicing in vitro and includes the sequ
171553	Glycosyltransferase family 6
171554	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12
171556	Somatotropin hormone family
171562	Endoplasmic Reticulum Oxidoreductin 1 (ERO1). Members of this family are required for the formation of disulphide bonds in the ER
171564	Phenazine biosynthesis-like protein. PhzC/PhzF is involved in dimerization of two 2,3-dihydro-3-oxo-anthranilic acid molecules to create PCA by P. fluorescens. This family appears to be distantly related to pfam01678, including containing a weak internal
171565	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has
171571	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
171572	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
171574	Dynamin family
171574	Dynamin GTPase effector domain
171574	PH domain. PH stands for pleckstrin homology
171575	Deoxyribonuclease II
171577	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 a
171580	LIM domain. This family represents two copies of the LIM structural domain
171580	FAD binding domain. This domain is involved in FAD binding in a number of enzymes
171580	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
191569	3'5'-cyclic nucleotide phosphodiesterase
191570	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
191571	Cadherin domain
191571	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
191574	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
191575	Nucleoside diphosphate kinase
191576	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster
191578	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
191578	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
192117	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
192119	Domain of unknown function. This putative domain is found in members of the Dicer protein family. This protein is a dsRNA nuclease that is involved in RNAi and related processes. This domain of about 100 amino acids has no known function, but does contai
192119	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
192119	RNase3 domain
192119	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
192119	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
192119	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
192119	Domain of unknown function. This putative domain is found in members of the Dicer protein family. This protein is a dsRNA nuclease that is involved in RNAi and related processes. This domain of about 100 amino acids has no known function, but does contain
192134	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
192153	7 transmembrane receptor (rhodopsin family)
192154	I/LWEQ domain. I/LWEQ domains bind to actin. It has been shown that the I/LWEQ domains from mouse talin and yeast Sla2p interact with F-actin. I/LWEQ domains can be placed into four major groups based on sequence similarity: (1) Metazoan talin
192155	Surfactant protein B
192155	Saposin A-type domain
192167	Carboxylesterase
192176	Filamin/ABP280 repeat
192176	Filamin/ABP280 repeat
192177	C2 domain
192179	Myelin proteolipid protein (PLP or lipophilin)
192185	ATP-NAD kinase. Members of this family are ATP-NAD kinases EC:2.7.1.23. Catalyses the phosphorylation of NAD to NADP utilizing ATP and other nucleoside triphosphates as well as inorganic polyphosphate as a source of phosphorus
192188	Extracellular link domain
192188	Fasciclin domain. This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria
192189	C2 domain
192190	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
192192	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
192193	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
192195	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
192195	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
192195	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
192196	Protein of unknown function, DUF259
192198	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
192201	WAP-type (Whey Acidic Protein) 'four-disulfide core'
192203	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
192207	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
192208	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
192215	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
192218	MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases
192223	7 transmembrane receptor (rhodopsin family)
192224	Somatotropin hormone family
192226	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated p
192234	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
192235	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
192241	ATP synthase delta (OSCP) subunit. The ATP D subunit from E. coli is the same as the OSCP subunit which is this family. The ATP D subunit from metazoa are found in family pfam00401
192242	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
192245	Hsp20/alpha crystallin family
192246	Uncharacterised protein family (UPF0123). This family of proteins are uncharacterised, they contain five CXXC motifs
192248	Cadherin domain
192255	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
192257	Carboxylesterase
192258	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
192258	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
192258	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
192259	Lipase/Acylhydrolase with GDSL-like motif
192260	CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond
192262	CUB domain
192262	Sushi domain (SCR repeat)
192267	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
192269	Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15
192271	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
192271	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
192273	Sugar (and other) transporter
192275	DAD family. Members of this family are thought to be integral membrane proteins. Some members of this family have been shown to cause apoptosis if mutated, these proteins are known as DAD for defender against death. The family also includes the epsilon s
192277	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
192280	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase,
192285	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
192287	Mitochondrial carrier protein
192289	Gamma-butyrobetaine hydroxylase. Members of this family are gamma-Butyrobetaine hydroxylase enzymes EC:1.14.11.1
192348	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
192351	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate
192360	HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes
192363	Phosphotyrosine interaction domain (PTB/PID)
192645	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
192646	7 transmembrane receptor (rhodopsin family)
192648	7 transmembrane receptor (rhodopsin family)
192649	7 transmembrane receptor (rhodopsin family)
192654	Lecithin:cholesterol acyltransferase. Lecithin:cholesterol acyltransferase (LACT) is involved in extracellular metabolism of plasma lipoproteins, including cholesterol
192656	Protein kinase domain
192656	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
192657	Occludin/ELL family
192662	RHO protein GDP dissociation inhibitor
192666	Intermediate filament protein
192669	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
192669	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
192669	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
192669	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
192670	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
192670	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
192678	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
192683	SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a c
192690	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
192786	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
192797	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
192837	Ribosomal protein L21e
192882	Guanylate kinase
192882	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
192882	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
192882	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
192885	ENV polyprotein (coat polyprotein)
192895	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
192897	Calx-beta domain
192897	Fibronectin type III domain
192897	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
192897	Integrins, beta chain. Sequences cut off at repeats due to overlap with EGF
192912	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
192940	Ribosomal protein L13e
192950	NAC domain
192970	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
192981	Eukaryotic ribosomal protein L18
193025	Ribosomal L22e protein family
193028	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
193053	7 transmembrane receptor (rhodopsin family)
193055	7 transmembrane receptor (rhodopsin family)
193058	7 transmembrane receptor (rhodopsin family)
193063	7 transmembrane receptor (rhodopsin family)
193073	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
193077	7 transmembrane receptor (rhodopsin family)
193079	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
193079	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
193098	ENV polyprotein (coat polyprotein)
193113	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
193113	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
193128	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
193137	7 transmembrane receptor (rhodopsin family)
193138	7 transmembrane receptor (rhodopsin family)
193139	7 transmembrane receptor (rhodopsin family)
193140	7 transmembrane receptor (rhodopsin family)
193147	7 transmembrane receptor (rhodopsin family)
193182	7 transmembrane receptor (rhodopsin family)
193190	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
193223	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
193237	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
193255	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
193296	Protein kinase domain
193301	Translation initiation factor SUI1
193312	Frataxin-like domain. This family contains proteins that have a domain related to the globular C-terminus of Frataxin the protein that is mutated in Friedreich's ataxia. This domain is found in a family of bacterial proteins. The function of this domain i
193325	L1 transposable element
193328	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
193330	Eukaryotic initiation factor 1A
193360	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193361	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193383	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
193403	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
193408	Core histone H2A/H2B/H3/H4
193418	Core histone H2A/H2B/H3/H4
193431	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
193433	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
193438	Triosephosphate isomerase
193445	Core histone H2A/H2B/H3/H4
193449	Core histone H2A/H2B/H3/H4
193449	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
193450	Core histone H2A/H2B/H3/H4
193450	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
193451	Ribosomal protein L21e
193454	Core histone H2A/H2B/H3/H4
193455	Core histone H2A/H2B/H3/H4
193455	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
193456	7 transmembrane receptor (rhodopsin family)
193456	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
193459	7 transmembrane receptor (rhodopsin family)
193459	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
193461	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
193474	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
193475	7 transmembrane receptor (rhodopsin family)
193475	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
193475	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
193476	7 transmembrane receptor (rhodopsin family)
193476	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
193476	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
193476	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
193476	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
193477	ENV polyprotein (coat polyprotein)
193477	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
193478	Competence-damaged protein. CinA is the first gene in the competence-inducible (cin) operon, and is thought to be specifically required at some stage in the process of transformation. This Pfam family consists of putative competence-damaged proteins from
193478	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
193481	7 transmembrane receptor (rhodopsin family)
193481	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
193482	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
193488	Somatotropin hormone family
193510	Core histone H2A/H2B/H3/H4
193510	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
193522	ENV polyprotein (coat polyprotein)
193522	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
193522	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
193525	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
193533	Proteasome A-type and B-type
193534	7 transmembrane receptor (rhodopsin family)
193543	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193546	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193556	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193564	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193565	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193574	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193576	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193582	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193587	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193588	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193589	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193590	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193678	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
193690	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
193690	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
193695	Protein kinase domain
193707	Ribosomal protein S5, N-terminal domain
193707	Ribosomal protein S5, C-terminal domain
193736	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
193736	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
193740	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
193742	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
193742	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
193742	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
193747	L1 transposable element
193747	7 transmembrane receptor (rhodopsin family)
193753	7 transmembrane receptor (rhodopsin family)
193762	Class I Histocompatibility antigen, domains alpha 1 and 2
193764	Class I Histocompatibility antigen, domains alpha 1 and 2
193765	Class I Histocompatibility antigen, domains alpha 1 and 2
193765	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
193766	Class I Histocompatibility antigen, domains alpha 1 and 2
193768	Zinc finger, C3HC4 type (RING finger)
193772	7 transmembrane receptor (rhodopsin family)
193773	7 transmembrane receptor (rhodopsin family)
193774	7 transmembrane receptor (rhodopsin family)
193782	7 transmembrane receptor (rhodopsin family)
193784	7 transmembrane receptor (rhodopsin family)
193785	7 transmembrane receptor (rhodopsin family)
193787	7 transmembrane receptor (rhodopsin family)
193788	7 transmembrane receptor (rhodopsin family)
193796	jmjC domain
193796	jmjN domain
193798	Ribosomal L29e protein family
193810	Protein kinase domain
193811	7 transmembrane receptor (rhodopsin family)
193821	7 transmembrane receptor (rhodopsin family)
193822	7 transmembrane receptor (rhodopsin family)
193824	7 transmembrane receptor (rhodopsin family)
193825	7 transmembrane receptor (rhodopsin family)
193827	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
193840	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
193840	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
193840	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
193840	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
193845	L1 transposable element
193845	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
193853	L1 transposable element
193887	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whic
193921	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
193993	L1 transposable element
194003	C2 domain
194003	Type IV secretion system CagX conjugation protein
194003	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
194003	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
194003	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
194022	Ribosomal protein L6
194056	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
194103	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
194103	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
194119	Ribosomal protein S16
194157	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
194175	Protein kinase domain
194188	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
194189	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
194189	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
194189	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u
194197	Ribosomal protein L5
194197	ribosomal L5P family C-terminus. This region is found associated with pfam00281
194198	Ribosomal protein S7e
194229	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
194229	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
194237	Carbamoyl-phosphate synthase L chain, ATP binding domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosy
194240	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
194245	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
194245	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u
194253	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
194253	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
194253	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
194261	7 transmembrane receptor (rhodopsin family)
194269	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
194269	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
194269	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u
194299	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
194299	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
194299	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
194350	L1 transposable element
194352	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
194352	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
194357	Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
194357	Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
194360	Trypsin
194401	FAD binding domain. This domain is involved in FAD binding in a number of enzymes
194401	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
194419	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
194419	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
194431	7 transmembrane receptor (rhodopsin family)
194432	7 transmembrane receptor (rhodopsin family)
194433	7 transmembrane receptor (rhodopsin family)
194436	L1 transposable element
194440	L1 transposable element
194446	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
194456	Sulfotransferase protein
194456	Sulfotransferase protein
194481	Ribosomal protein S5, C-terminal domain
194481	Ribosomal protein S5, N-terminal domain
194507	GH3 auxin-responsive promoter
194507	Repeat in ubiquitin-activating (UBA) protein
194507	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
194508	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
194508	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
194508	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
194538	7 transmembrane receptor (rhodopsin family)
194539	7 transmembrane receptor (rhodopsin family)
194540	7 transmembrane receptor (rhodopsin family)
194578	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
194580	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
194586	Sulfotransferase protein
194587	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
194588	Homeobox domain
194597	Trypsin
194597	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
194604	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
194612	7 transmembrane receptor (metabotropic glutamate family)
194612	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
194633	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
194633	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
194633	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
194634	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
194634	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
194634	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
194642	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
194655	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
194655	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
194664	7 transmembrane receptor (rhodopsin family)
194673	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
194673	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
194694	L1 transposable element
194694	Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length
194700	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
194735	BTG1 family. A novel family of anti-proliferative proteins
194736	BTG1 family. A novel family of anti-proliferative proteins
194737	Homeobox domain
194738	Homeobox domain
194739	Homeobox domain
194740	Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length
194744	Mitochondrial carrier protein
194765	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
194776	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
194788	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
194788	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
194810	ENV polyprotein (coat polyprotein)
194812	Actin
194825	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
194826	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
194837	Actin
194840	Hsp90 protein
194843	Proteasome A-type and B-type
194856	Homeobox domain
194861	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
194861	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
194861	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
194861	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
194861	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
194862	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
194862	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
194862	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
194862	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
194862	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
194862	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
194862	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
194862	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
194868	L1 transposable element
194868	Repeat in ubiquitin-activating (UBA) protein
194868	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
194884	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
194884	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
194884	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
194884	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
194884	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
194887	Ribosomal protein S21e
194895	Protein-tyrosine phosphatase
194895	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
194905	Ribosomal protein L6
194910	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
194913	Ribosomal protein S5, C-terminal domain
194913	Ribosomal protein S5, N-terminal domain
194952	jmjC domain
194952	jmjC domain
194957	ENV polyprotein (coat polyprotein)
194960	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
194985	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
194985	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
195014	Ribosomal protein L36e
195018	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
195042	Ribosomal protein L35Ae
195046	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
195071	Core histone H2A/H2B/H3/H4
195084	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
195149	L1 transposable element
195149	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
195150	Eukaryotic initiation factor 1A
195151	Eukaryotic initiation factor 1A
195152	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
195154	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
195155	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
195155	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
195155	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
195155	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
195156	L1 transposable element
195156	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
195156	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
195172	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195176	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195177	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195180	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195183	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195190	Cyclophilin type peptidyl-prolyl cis-trans isomerase
195208	Doublecortin
195211	Core histone H2A/H2B/H3/H4
195212	Core histone H2A/H2B/H3/H4
195212	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
195213	Core histone H2A/H2B/H3/H4
195221	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
195221	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
195221	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
195228	Core histone H2A/H2B/H3/H4
195229	Core histone H2A/H2B/H3/H4
195230	Core histone H2A/H2B/H3/H4
195230	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
195237	Core histone H2A/H2B/H3/H4
195240	ENV polyprotein (coat polyprotein)
195240	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
195240	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
195240	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
195252	HMG (high mobility group) box
195262	Core histone H2A/H2B/H3/H4
195266	Core histone H2A/H2B/H3/H4
195267	Core histone H2A/H2B/H3/H4
195267	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
195281	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195284	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195285	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195286	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195289	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195291	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195292	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195295	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195296	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195297	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195298	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195300	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195322	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
195322	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
195324	ENV polyprotein (coat polyprotein)
195324	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
195324	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
195324	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
195324	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
195333	Homeobox domain
195333	Homeobox domain
195349	L1 transposable element
195349	Uncharacterized ACR, COG1579
195349	Poly-adenylate binding protein, unique domain
195349	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
195349	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
195355	7 transmembrane receptor (rhodopsin family)
195356	Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
195356	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
195359	Zinc finger, C3HC4 type (RING finger)
195363	7 transmembrane receptor (rhodopsin family)
195370	Ribosomal L29e protein family
195372	7 transmembrane receptor (rhodopsin family)
195401	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
195487	Ribosomal L10
195509	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
195509	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
195514	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
195522	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
195531	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
195533	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
195534	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
195559	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
195562	Peptidase family C54
195569	7 transmembrane receptor (rhodopsin family)
195576	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
195601	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195608	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195623	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
195623	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
195624	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
195646	Sulfotransferase protein
195671	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
195678	Translationally controlled tumor protein
195685	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
195686	ENV polyprotein (coat polyprotein)
195686	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
195686	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
195686	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
195686	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
195686	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
195686	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
195691	Sugar (and other) transporter
195691	HMG (high mobility group) box
195691	7 transmembrane receptor (metabotropic glutamate family)
195691	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
195701	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
195701	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
195707	ENV polyprotein (coat polyprotein)
195707	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195709	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195710	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
195712	RNA polymerase Rpb4
195726	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
195745	L1 transposable element
195746	L1 transposable element
195746	ENV polyprotein (coat polyprotein)
195751	L1 transposable element
195760	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
195760	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
195772	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
195772	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
195775	Renal dipeptidase
195783	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
195805	Homeobox domain
195805	CLN3 protein. This is a family of proteins from the CLN3 gene. A missense mutation of glutamic acid (E) to lysine (K) at position 295 in the human protein has been implicated in Juvenile neuronal ceroid lipofuscinosis (Batten disease)
195809	L1 transposable element
195809	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
195814	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
195977	von Willebrand factor type A domain
195979	PH domain. PH stands for pleckstrin homology
195979	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
195979	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
196051	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
196114	7 transmembrane receptor (rhodopsin family)
196116	7 transmembrane receptor (rhodopsin family)
196117	7 transmembrane receptor (rhodopsin family)
196188	Ribosomal protein S5, C-terminal domain
196188	Ribosomal protein S5, N-terminal domain
196200	7 transmembrane receptor (rhodopsin family)
196226	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
196266	Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic d
196294	Signal peptidase I
196335	7 transmembrane receptor (rhodopsin family)
196338	7 transmembrane receptor (rhodopsin family)
196346	HMG (high mobility group) box
196374	Intermediate filament protein
196376	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
196385	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
196400	C2 domain
196403	Zinc finger, C3HC4 type (RING finger)
196743	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
196883	Adenylate and Guanylate cyclase catalytic domain
196988	Dynamin GTPase effector domain
196996	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
197131	Putative zinc finger in N-recognin
197209	Eukaryotic-type carbonic anhydrase
197279	Ribosomal L10
197320	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
197350	ICE-like protease (caspase) p10 domain
197350	ICE-like protease (caspase) p20 domain
197358	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
199692	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
199699	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerb
199709	Ribosomal S3Ae family
199718	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
199744	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
199746	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
199783	Trypsin
199799	S4 domain. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yea
199813	Sodium:neurotransmitter symporter family
199858	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
199873	Ribosomal protein L23
199974	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
199974	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
200008	Oxidoreductase FAD-binding domain
200008	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
200010	Sodium:solute symporter family
200014	C2 domain
200104	EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains
200132	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
200225	C2 domain
200226	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
200312	Zinc finger, C3HC4 type (RING finger)
200316	Cytidine and deoxycytidylate deaminase zinc-binding region
200316	Cytidine and deoxycytidylate deaminase zinc-binding region
200350	Fork head domain
200388	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
200391	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
200466	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
200504	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
200504	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
200539	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
200575	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
200580	Ribosomal protein L6e
200583	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
200722	Cytochrome c oxidase subunit III
200722	NADH-ubiquinone/plastoquinone oxidoreductase chain 4L
200734	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl
200765	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d
200765	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
200810	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid
200845	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
200879	Lipase
200894	ADP-ribosylation factor family
200894	ADP-ribosylation factor family
200895	Dihydrofolate reductase
200909	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
200909	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
200909	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
200916	Ribosomal L22e protein family
200931	Domain of unknown function
200933	F-box domain
200933	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
200942	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
200959	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
200959	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
200972	ADP-ribosylation factor family
200972	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
201097	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
201140	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
201175	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
201176	PH domain. PH stands for pleckstrin homology
201176	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
201259	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
201294	C2 domain
201305	Sugar (and other) transporter
201382	Laminin N-terminal (Domain VI)
201382	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
201412	Cyclophilin type peptidyl-prolyl cis-trans isomerase
201475	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
201510	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
201516	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
201540	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
201595	Oligosaccharyl transferase STT3 subunit. This family consists of the oligsacharyl transferase STT3 subunit and related proteins. The STT3 subunit is part of the oligosccharyl transferase (OTase) complex of proteins and is required for its activity. OTase
201595	Oligosaccharyl transferase STT3 subunit. This family consists of the oligsacharyl transferase STT3 subunit and related proteins. The STT3 subunit is part of the oligosccharyl transferase (OTase) complex of proteins and is required for its activity. OTase
201625	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
201625	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
201654	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
201779	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
201784	Disintegrin
201784	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
201784	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
201798	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d
201798	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
201829	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
201845	UDP-glucoronosyl and UDP-glucosyl transferase
201846	UDP-glucoronosyl and UDP-glucosyl transferase
201859	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
201878	Ribosomal protein L35Ae
201929	B-box zinc finger
201929	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
201989	FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain
201989	FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain
202083	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
202108	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
202108	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
202108	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
202122	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
202146	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
202158	Cytochrome C and Quinol oxidase polypeptide I
202227	Cyclophilin type peptidyl-prolyl cis-trans isomerase
202306	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
202319	Cyclophilin type peptidyl-prolyl cis-trans isomerase
202333	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
202366	Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex
202366	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
202500	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
202500	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
202538	Cytochrome C and Quinol oxidase polypeptide I
202757	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
202758	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
202758	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
202759	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
202789	pfam02921, UCR_TM, Ubiquinol cytochrome reductase transmembrane region. Each subunit of the cytochrome bc1 complex provides a single helix (this family) to make up the transmembrane region of the complex
202861	7 transmembrane receptor (rhodopsin family)
202915	Domain of unknown function
203010	NUDIX domain
203064	Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tran
203068	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
203068	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
203068	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
203068	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
203068	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
203074	Trypsin
203082	Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15
203102	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
203102	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
203112	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
203190	EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function,
203190	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
203202	Translationally controlled tumor protein
203249	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
203261	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
203326	7 transmembrane receptor (rhodopsin family)
203427	Mitochondrial carrier protein
203429	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
203429	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
203447	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
203510	HMG (high mobility group) box
203611	'chromo' (CHRromatin Organization MOdifier) domain
203611	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
203806	Fibronectin type III domain
203806	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
203818	Intermediate filament protein
204010	Ribosomal protein S2
204219	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
204219	Homeobox domain
204219	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
204275	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
204474	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
204801	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
205501	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
206358	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
207063	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
207119	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
207120	NUDIX domain
207120	NUDIX domain
207121	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
207121	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
207123	Uncharacterised protein family (UPF0183). This family of proteins includes Lin-10 from C. elegans
207126	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
207150	NifU-like N terminal domain. This domain is found in NifU in combination with pfam01106. This domain is found on isolated in several bacterial species. The nif genes are responsible for nitrogen fixation. However this domain is found in bacteria that do n
207151	Sugar (and other) transporter
207152	7 transmembrane receptor (rhodopsin family)
207154	Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown
207155	7 transmembrane receptor (rhodopsin family)
207162	7 transmembrane receptor (rhodopsin family)
207165	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
207165	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
207177	Ribosomal protein L35Ae
207181	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
207187	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
207188	Actin
207194	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
207194	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
207194	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea
207212	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
207213	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
207213	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
207217	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
207217	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
207217	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
207235	Translation initiation factor SUI1
207252	Eukaryotic initiation factor 4E
207259	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
207278	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
207278	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
207308	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
207318	Actin
207339	ENV polyprotein (coat polyprotein)
207350	von Willebrand factor type A domain
207357	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
207365	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
207371	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
207385	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
207390	Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold
207393	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
207408	Sulfatase
207415	Lipoate synthase. Lipoate synthase (or lipoic acid synthetase) catalyses the formation of alpha-(+)-lipoic acid, required for lipoate biosynthesis
207424	HMG (high mobility group) box
207474	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
207474	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
207492	Actin
207497	Clathrin adaptor complex small chain
207497	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
207544	Actin
207557	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
207558	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
207559	Lyase
207593	NAC domain
207595	Ribosomal protein S5, N-terminal domain
207595	Ribosomal protein S5, C-terminal domain
207602	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
207620	Helix-loop-helix DNA-binding domain
207624	HMG (high mobility group) box
207627	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
207629	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
207664	Cyclophilin type peptidyl-prolyl cis-trans isomerase
207667	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
207678	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
207695	Ribosomal protein S5, C-terminal domain
207695	Ribosomal protein S5, N-terminal domain
207704	GTP1/OBG family
207704	GTPase of unknown function
207704	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
207704	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
207728	3'5'-cyclic nucleotide phosphodiesterase
207728	GAF domain. Domain present in phytochromes and cGMP-specific phosphodiesterases
207731	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
207745	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
207757	Ribosomal protein L6, N-terminal domain
207787	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
207790	7 transmembrane receptor (rhodopsin family)
207795	Alkaline phosphatase
207800	7 transmembrane receptor (rhodopsin family)
207806	Helix-loop-helix DNA-binding domain
207809	Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest
207810	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
207810	pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB
207839	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
207853	Tropomyosin
207867	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
207874	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
207885	Ribosomal protein L21e
207908	Hsp90 protein
207908	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
207911	7 transmembrane receptor (rhodopsin family)
207912	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
207913	ab-hydrolase associated lipase region
207913	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
207914	ab-hydrolase associated lipase region
207920	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
207923	Ribosomal protein L6e
207923	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
207924	Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti
207950	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
207952	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
207952	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
207952	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
207958	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid
208000	7 transmembrane receptor (rhodopsin family)
208006	7 transmembrane receptor (rhodopsin family)
208025	7 transmembrane receptor (rhodopsin family)
208030	7 transmembrane receptor (rhodopsin family)
208042	7 transmembrane receptor (rhodopsin family)
208043	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
208043	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
208044	7 transmembrane receptor (rhodopsin family)
208045	7 transmembrane receptor (rhodopsin family)
208079	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
208079	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
208080	Actin
208089	HMG (high mobility group) box
208109	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
208109	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are
208110	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
208110	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
208118	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
208118	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
208119	Ribosomal protein L3
208144	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
208144	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
208154	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
208166	TRAF-type zinc finger
208166	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
208166	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
208166	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
208166	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
208166	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
208169	Cyclic nucleotide-binding domain
208169	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
208188	7 transmembrane receptor (rhodopsin family)
208188	7 transmembrane receptor (rhodopsin family)
208211	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari
208226	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
208231	Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
208231	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
208256	Alkaline phosphatase
208264	ENV polyprotein (coat polyprotein)
208266	Intermediate filament protein
208268	pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an ess
208269	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
208269	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
208285	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
208291	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
208292	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
208331	Cyclophilin type peptidyl-prolyl cis-trans isomerase
208409	Core histone H2A/H2B/H3/H4
208416	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
208416	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
208416	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
208419	Protein kinase domain
208424	7 transmembrane receptor (rhodopsin family)
208427	C2 domain
208428	Ribosomal protein L19e
208433	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
208439	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
208440	AMP-binding enzyme
208510	Ribosomal protein S6e
208514	ENV polyprotein (coat polyprotein)
208521	TSC-22/dip/bun family
208522	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
208524	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
208543	Ribosomal protein S6e
208584	Homeobox domain
208584	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
208595	mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein
208618	Actin interacting protein 3
208634	Tetraspanin family
208635	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
208638	Mitochondrial carrier protein
208641	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
208645	7 transmembrane receptor (rhodopsin family)
208648	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
208650	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
208652	HMG (high mobility group) box
208665	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
208669	Hantavirus glycoprotein G2. The medium (M) genome segment of hantaviruses (family Bunyaviridae) encodes the two virion glycoproteins. G1 and G2, as a precursor protein in the complementary sense RNA
208674	Homeobox domain
208682	7 transmembrane receptor (rhodopsin family)
208691	Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold
208691	Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold
208715	Hydroxymethylglutaryl-coenzyme A synthase
208718	RNB-like protein. The function of this region of similarity is uncertain
208727	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
208731	7 transmembrane receptor (rhodopsin family)
208744	7 transmembrane receptor (rhodopsin family)
208748	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy
208751	7 transmembrane receptor (rhodopsin family)
208757	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
208760	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
208777	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
208782	7 transmembrane receptor (rhodopsin family)
208795	Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length
208824	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
208859	Phosphatidylethanolamine-binding protein
208866	Ribosomal protein S2
208883	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
208894	ADP-ribosylation factor family
208895	ab-hydrolase associated lipase region
208895	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
208896	Ribosomal protein L36e
208919	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
208922	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
208927	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
208934	Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in
208936	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
208943	DIL domain. The DIL domain has no known function
208946	Translation initiation factor SUI1
208953	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
208967	Shikimate kinase
208967	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
208967	Pyridoxal-phosphate dependent enzyme. Members of this family are all pyridoxal-phosphate dependent enzymes. This family includes: serine dehydratase EC:4.2.1.13 P20132, threonine dehydratase EC:4.2.1.16, tryptophan synthase beta chain EC:4.2.1.20, threoni
208976	ENV polyprotein (coat polyprotein)
208982	HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase
208982	HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase
208999	Ribosomal S17
209003	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
209027	Delta 1-pyrroline-5-carboxylate reductase
209031	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
209033	7 transmembrane receptor (rhodopsin family)
209036	7 transmembrane receptor (rhodopsin family)
209040	7 transmembrane receptor (rhodopsin family)
209043	7 transmembrane receptor (rhodopsin family)
209055	Protein kinase domain
209057	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
209088	Ribosomal protein S5, N-terminal domain
209091	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
209091	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
209098	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
209109	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
209109	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
209120	HMG (high mobility group) box
209131	PX domain. PX domains bind to phosphoinositides
209155	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
209176	Indoleamine 2,3-dioxygenase
209183	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
209200	Zinc finger, C3HC4 type (RING finger)
209202	ENV polyprotein (coat polyprotein)
209202	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
209202	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
209202	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
209203	Hsp90 protein
209230	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
209239	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
209265	Tropomyosin
209268	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv
209268	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
209275	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
209281	F-actin capping protein alpha subunit
209294	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
209294	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family
209296	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
209317	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
209324	Orbivirus VP3 (T2) protein. The orbivirus VP3 protein is part of the virus core and makes a 'subcore' shell made up of 120 copies of the 100K protein. VP3 particles can also bind RNA and are fundamental in the early stages of viral core formation. Also fo
209324	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
209324	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family
209326	Orbivirus VP3 (T2) protein. The orbivirus VP3 protein is part of the virus core and makes a 'subcore' shell made up of 120 copies of the 100K protein. VP3 particles can also bind RNA and are fundamental in the early stages of viral core formation. Also fo
209326	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
209326	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family
209334	XPG I-region
209334	XPG I-region
209334	XPG N-terminal domain
209351	WAP-type (Whey Acidic Protein) 'four-disulfide core'
209351	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
209354	Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac
209357	Transcription factor Tfb4
209357	Transcription factor Tfb4
209357	Ssl1-like. Ssl1-like proteins are 40kDa subunits of the Transcription factor II H complex
209361	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
209371	Kinesin motor domain
209384	B-box zinc finger
209384	Zinc finger, C3HC4 type (RING finger)
209384	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
209387	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
209416	KOW motif. This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG
209416	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
209423	Ribosomal protein L19e
209442	Ribosomal L29e protein family
209444	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
209444	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
209446	Helix-loop-helix DNA-binding domain
209448	Homeobox domain
209449	7 transmembrane receptor (rhodopsin family)
209450	7 transmembrane receptor (rhodopsin family)
209456	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
209456	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
209464	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
209474	Cyclophilin type peptidyl-prolyl cis-trans isomerase
209488	SH2 domain
209501	Ribosomal protein L23
209508	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
209510	7 transmembrane receptor (rhodopsin family)
209511	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
209512	7 transmembrane receptor (rhodopsin family)
209513	7 transmembrane receptor (rhodopsin family)
209517	7 transmembrane receptor (rhodopsin family)
209524	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
209524	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
209540	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
209541	Actin
209542	mRNA capping enzyme, C-terminal domain
209542	mRNA capping enzyme, catalytic domain. This family represents the ATP binding catalytic domain of the mRNA capping enzyme
209542	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
209558	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
209584	Uncharacterized ACR, COG1590
209586	Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration
209586	Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration
209589	Ribosomal protein S7e
209591	Acyl CoA binding protein
209607	Protein kinase domain
209630	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
209632	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
209632	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
209633	Ribosomal L22e protein family
209636	Ribosomal protein L21e
209637	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
209654	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
209674	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
209685	Ribosomal protein L19e
209692	Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases
209692	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
209714	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
209714	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
209737	Kinesin motor domain
209737	Kinesin motor domain
209742	von Hippel-Lindau disease tumor suppressor protein. VHL forms a ternary complex with the elonginB and elonginC proteins. This complex binds Cul2, which then is involved in regulation of vascular endothelial growth factor mRNA
209749	7 transmembrane receptor (rhodopsin family)
209749	7 transmembrane receptor (rhodopsin family)
209751	7 transmembrane receptor (rhodopsin family)
209753	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
209754	7 transmembrane receptor (rhodopsin family)
209761	7 transmembrane receptor (rhodopsin family)
209762	Class I Histocompatibility antigen, domains alpha 1 and 2
209764	7 transmembrane receptor (rhodopsin family)
209766	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
209768	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
209770	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
209772	Thymidine kinase
209773	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
209773	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
209773	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
209776	7 transmembrane receptor (rhodopsin family)
209782	7 transmembrane receptor (rhodopsin family)
209783	7 transmembrane receptor (rhodopsin family)
209786	7 transmembrane receptor (metabotropic glutamate family)
209787	7 transmembrane receptor (rhodopsin family)
209791	MCM2/3/5 family
209793	7 transmembrane receptor (rhodopsin family)
209794	ABC1 family. This family includes ABC1 from yeast and AarF from E. coli. These proteins have a nuclear or mitochondrial subcellular location in eukaryotes. The exact molecular functions of these proteins is not clear, however yeast ABC1 suppresses a cytoc
209807	7 transmembrane receptor (rhodopsin family)
209808	7 transmembrane receptor (rhodopsin family)
209824	7 transmembrane receptor (rhodopsin family)
209824	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
209824	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
209824	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
209837	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
209857	ENV polyprotein (coat polyprotein)
209861	PMP-22/EMP/MP20/Claudin family
209865	Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars
209871	Ribosomal S3Ae family
209887	ENV polyprotein (coat polyprotein)
209888	Vitamin B12 dependent methionine synthase, activation domain
209901	Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins
209917	HMG (high mobility group) box
209926	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
209988	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
210004	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
210004	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
210009	Flavodoxin
210030	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
210030	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
210044	Adenylate and Guanylate cyclase catalytic domain
210044	ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum
210045	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
210064	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
210066	Hsp90 protein
210094	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
210103	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
210103	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
210103	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
210104	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
210105	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
210105	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
210106	PAP/25A associated domain
210106	Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to
210126	LIM domain. This family represents two copies of the LIM structural domain
210129	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
210131	NADH:flavin oxidoreductase / NADH oxidase family
210145	GTPase of unknown function
210145	Protein of unknown function, DUF258
210145	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
210147	Phospholipase A2 inhibitor
210157	Ribosomal protein S6e
210162	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
210162	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
210172	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
210191	ENV polyprotein (coat polyprotein)
210197	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
210198	7 transmembrane receptor (metabotropic glutamate family)
210198	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
210212	ENV polyprotein (coat polyprotein)
210240	ENV polyprotein (coat polyprotein)
210240	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
210240	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
210240	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
210243	Eukaryotic ribosomal protein L18
210243	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
210265	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
210271	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
210271	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
210274	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
210282	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
210291	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
210297	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
210321	NLP/P60 family. The function of this domain is unknown. It is found in several lipoproteins
210351	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
210358	Homeobox domain
210364	Ribosomal protein L23
210381	Protein kinase domain
210389	Aminotransferase class I and II
210411	7 transmembrane receptor (metabotropic glutamate family)
210411	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
210413	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
210413	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
210413	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
210424	Thrombospondin type 1 domain
210429	Core histone H2A/H2B/H3/H4
210457	LIM domain. This family represents two copies of the LIM structural domain
210463	Sugar (and other) transporter
210465	Homeobox domain
210465	Sugar (and other) transporter
210465	7 transmembrane receptor (metabotropic glutamate family)
210465	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
210467	7 transmembrane receptor (metabotropic glutamate family)
210475	ENV polyprotein (coat polyprotein)
210477	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
210482	Ribosomal protein S5, C-terminal domain
210482	Ribosomal protein S5, N-terminal domain
210497	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
210497	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
210503	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
210507	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
210507	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
210510	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
210520	Glycosyl hydrolase family 1
210534	Ribosomal protein L21e
210554	Hus1-like protein. Hus1, Rad1, and Rad9 are three evolutionarily conserved proteins required for checkpoint control in fission yeast. These proteins are known to form a stable complex in vivo. Hus1-Rad1-Rad9 complex may form a PCNA-like ring structure, a
210564	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
210582	Uncharacterised protein family (UPF0083)
210588	Ribosomal protein S12
210599	Cytochrome C oxidase subunit II, periplasmic domain
210602	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
210602	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
210619	Protein kinase domain
210622	CUB domain
210622	Trypsin
210622	CUB domain
210622	Sushi domain (SCR repeat)
210668	Ribosomal protein S24e
210670	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
210708	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
210708	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
210709	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involved
210714	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
210722	Ribosomal protein S2
210742	L1 transposable element
210758	7 transmembrane receptor (rhodopsin family)
210758	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
210760	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
210772	Sec34-like family. Sec34 and Sec35 form a sub-complex, in a seven protein complex that includes Dor1 (PFAM:PF04124). This complex is thought to be important for tether vesicles to the Golgi
210775	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
210779	Ribosomal protein L21e
210784	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
210827	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
210853	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
210853	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
210869	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
210876	7 transmembrane receptor (metabotropic glutamate family)
210876	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
210905	Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15
210933	7 transmembrane receptor (Secretin family)
210933	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
210933	7 transmembrane receptor (Secretin family)
210933	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
210933	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
210964	TPR Domain
210973	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
210976	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
210988	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
210994	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
211015	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
211027	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
211036	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
211105	Profilin
211153	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
211187	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
211191	Hsp90 protein
211208	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
211208	KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
211223	7 transmembrane receptor (metabotropic glutamate family)
211223	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
211232	C2 domain
211232	C2 domain
211247	Ribosomal protein S8e
211255	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
211255	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
211266	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
211280	Hsp90 protein
211296	Fructose-bisphosphate aldolase class-I
211300	Trypsin
211303	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
211303	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
211305	F-box domain
211305	F-box domain
211321	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
211321	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
211321	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
211331	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
211332	Kinesin motor domain
211347	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
211362	7 transmembrane receptor (metabotropic glutamate family)
211362	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
211370	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
211389	Mo-co oxidoreductase dimerisation domain. This domain is found in molybdopterin cofactor (Mo-co) oxidoreductases. It is involved in dimer formation, and has an Ig-fold structure
211389	Oxidoreductase molybdopterin binding domain. This domain is found in a variety of oxidoreductases. This domain binds to a molybdopterin cofactor. Xanthine dehydrogenases, that also bind molybdopterin, have essentially no similarity
211389	Mo-co oxidoreductase dimerisation domain. This domain is found in molybdopterin cofactor (Mo-co) oxidoreductases. It is involved in dimer formation, and has an Ig-fold structure
211389	Oxidoreductase molybdopterin binding domain. This domain is found in a variety of oxidoreductases. This domain binds to a molybdopterin cofactor. Xanthine dehydrogenases, that also bind molybdopterin, have essentially no similarity
211389	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
211391	Ribosomal protein L11, RNA binding domain
211398	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
211398	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
211398	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
211421	Ribosomal protein S19
211430	C2 domain
211437	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
211437	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
211441	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
211457	7 transmembrane receptor (rhodopsin family)
211468	Cyclic nucleotide-binding domain
211468	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
211468	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
211469	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
211472	7 transmembrane receptor (rhodopsin family)
211480	Inward rectifier potassium channel
211484	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
211484	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
211490	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
211496	Protein kinase domain
211502	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
211539	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
211550	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
211551	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
211551	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
211552	SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase
211552	AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this
211563	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
211565	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
211566	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
211566	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
211572	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
211572	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
211572	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
211572	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
211577	7 transmembrane receptor (rhodopsin family)
211578	7 transmembrane receptor (rhodopsin family)
211586	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
211586	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
211587	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
211587	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
211598	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
211603	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
211603	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
211603	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
211612	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
211614	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
211622	Polyphosphate kinase. Polyphosphate kinase (Ppk) catalyzes the formation of polyphosphate from ATP, with chain lengths of up to a thousand or more orthophosphate molecules
211652	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
211666	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
211666	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
211669	Protein kinase domain
211669	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
211673	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
211685	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
211700	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
211700	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
211702	Zinc finger, C3HC4 type (RING finger)
211705	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
211706	TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme
211712	Cadherin domain
211725	Glutamine synthetase, catalytic domain
211726	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
211727	Defensin propeptide
211746	Ribosomal protein L21e
211757	Transforming growth factor beta like domain
211757	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
211776	Ribosomal S17
211795	7 transmembrane receptor (rhodopsin family)
211804	Phosphoglycerate mutase family
211832	NAD-dependent glycerol-3-phosphate dehydrogenase
211838	START domain
211857	Pou domain - N-terminal to homeobox domain
211870	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
211895	HMG (high mobility group) box
211903	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
211914	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
211914	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
211924	Cadherin domain
211924	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
211924	Cadherin domain
211924	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
211925	Ribosomal protein S5, N-terminal domain
211925	Ribosomal protein S5, C-terminal domain
211945	PH domain. PH stands for pleckstrin homology
211945	MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins
211948	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
211970	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
211973	PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is about
211980	7 transmembrane receptor (metabotropic glutamate family)
211980	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
211981	Ribosomal family S4e
211983	Protein kinase domain
212007	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
212030	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
212030	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
212034	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
212034	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
212043	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
212044	Ribosomal L29e protein family
212069	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
212084	Ribosomal protein S19
212085	Zinc finger, C3HC4 type (RING finger)
212111	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
212117	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
212117	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
212117	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
212117	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
212122	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
212129	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
212129	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
212130	Proteasome A-type and B-type
212149	ENV polyprotein (coat polyprotein)
212190	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
212198	WD domain, G-beta repeat
212205	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
212207	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
212211	ENV polyprotein (coat polyprotein)
212211	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
212211	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
212211	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
212218	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
212236	Ribosomal protein S5, N-terminal domain
212237	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
212252	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
212266	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
212266	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
212266	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
212267	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
212271	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
212271	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
212271	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
212276	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
212278	HMG (high mobility group) box
212281	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
212282	7 transmembrane receptor (rhodopsin family)
212282	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
212285	PH domain. PH stands for pleckstrin homology
212285	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
212285	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
212285	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
212289	7 transmembrane receptor (rhodopsin family)
212313	Pyruvate kinase, alpha/beta domain
212313	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
212313	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
212351	ENV polyprotein (coat polyprotein)
212366	MAS20 protein import receptor
212390	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
212399	Actin
212413	Signal recognition particle, alpha subunit, N-terminal. SRP is a complex of six distinct polypeptides and a 7S RNA that is essential for transferring nascent polypeptide chains that are destined for export from the cell to the translocation apparatus of t
212419	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
212419	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
212419	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
212419	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
212429	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
212429	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
212432	Calx-beta domain
212434	Cyclophilin type peptidyl-prolyl cis-trans isomerase
212437	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
212437	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
212446	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
212449	ENV polyprotein (coat polyprotein)
212460	Actin
212464	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
212476	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
212477	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
212503	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
212507	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
212508	GTPase of unknown function
212508	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
212508	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
212515	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
212528	Conserved hypothetical protein 95
212528	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
212528	Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat
212528	N2,N2-dimethylguanosine tRNA methyltransferase. This enzyme EC:2.1.1.32 used S-AdoMet to methylate tRNA. The TRM1 gene of Saccharomyces cerevisiae is necessary for the N2,N2-dimethylguanosine modification of both mitochondrial and cytoplasmic tRNAs. The e
212539	ADP-ribosylation factor family
212539	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
212541	7 transmembrane receptor (rhodopsin family)
212548	7 transmembrane receptor (rhodopsin family)
212549	7 transmembrane receptor (rhodopsin family)
212555	PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. How
212556	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
212558	7 transmembrane receptor (rhodopsin family)
212561	7 transmembrane receptor (rhodopsin family)
212562	7 transmembrane receptor (rhodopsin family)
212576	7 transmembrane receptor (rhodopsin family)
212607	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
212608	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
212618	Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular m
212632	Intermediate filament protein
212634	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
212647	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
212670	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
212677	Homeobox domain
212677	'Paired box' domain
212679	tRNA synthetases class I (K). This family includes only lysyl tRNA synthetases from prokaryotes
212679	tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included
212679	tRNA synthetases class I (C). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only cysteinyl tRNA synthetases
212684	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
212692	TMS membrane protein/tumour differentially expressed protein (TDE)
212699	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
212700	Domain of unknown function
212704	Ribosomal protein L35Ae
212710	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
212712	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
212718	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
212722	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
212725	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
212725	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
212727	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
212727	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
212741	Thymidine kinase
212744	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
212744	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
212746	Ribosomal protein L31e
212747	WD domain, G-beta repeat
212763	Core histone H2A/H2B/H3/H4
212782	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
212806	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
212806	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
212806	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
212808	Cyclophilin type peptidyl-prolyl cis-trans isomerase
212819	ENV polyprotein (coat polyprotein)
212819	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
212819	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
212819	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
212819	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
212819	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
212819	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
212844	Ribosomal protein L21e
212866	D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain
212866	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
212870	7 transmembrane receptor (rhodopsin family)
212870	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
212871	Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein
212884	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
212884	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
212915	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
212919	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
212921	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
212923	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
212932	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
212933	Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases
212941	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
212950	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
212950	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
212950	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
212950	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
212952	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
212958	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
212961	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
212970	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
212974	Glycosyl hydrolase family 65 central catalytic domain. This family of glycosyl hydrolases contains vacuolar acid trehalase and maltose phosphorylase.Maltose phosphorylase (MP) is a dimeric enzyme that catalyzes the conversion of maltose and inorganic pho
212989	Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown
212996	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
212997	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
212997	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
213001	7 transmembrane receptor (rhodopsin family)
213014	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
213030	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
213032	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
213037	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
213038	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
213038	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
213043	[2Fe-2S] binding domain
213043	CO dehydrogenase flavoprotein C-terminal domain
213043	FAD binding domain in molybdopterin dehydrogenase
213043	Aldehyde oxidase and xanthine dehydrogenase, a/b hammerhead domain
213043	Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain
213051	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
213053	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
213070	Chaperonin 10 Kd subunit
213070	Hint module. This is an alignment of the Hint module in the Hedgehog proteins. It does not include any Inteins which also possess the Hint module
213079	HMG (high mobility group) box
213093	Ribosomal protein L34e
213105	Ribosomal protein L19e
213109	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
213119	von Willebrand factor type A domain
213119	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
213120	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
213121	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
213121	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
213160	Ribosomal protein L21e
213177	Translation initiation factor SUI1
213179	Translation initiation factor SUI1
213212	Trefoil (P-type) domain
213212	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
213220	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
213251	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
213268	Ribosomal protein L19e
213272	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
213286	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
213287	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
213288	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
213288	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
213299	Ribonucleotide reductase, small chain
213311	F-box domain
213320	HAT (Half-A-TPR) repeat. The HAT (Half A TPR) repeat is found in several RNA processing proteins
213326	Protein kinase domain
213352	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
213363	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
213363	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
213367	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
213391	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
213404	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
213409	LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1
213411	Protein-tyrosine phosphatase
213411	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
213416	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
213420	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
213422	Ribosomal L28e protein family
213423	7 transmembrane receptor (rhodopsin family)
213426	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
213428	Fibronectin type III domain
213435	Protein kinase domain
213436	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
213436	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
213437	7 transmembrane receptor (metabotropic glutamate family)
213438	7 transmembrane receptor (rhodopsin family)
213438	7 transmembrane receptor (rhodopsin family)
213439	7 transmembrane receptor (rhodopsin family)
213440	HMG (high mobility group) box
213451	Tropomyosin
213454	Zinc finger, C3HC4 type (RING finger)
213469	EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function,
213469	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
213474	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
213478	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
213480	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
213481	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
213484	NUDIX domain
213498	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
213499	F-box domain
213499	F-box domain
213499	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
213514	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
213514	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
213522	PH domain. PH stands for pleckstrin homology
213522	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
213527	7 transmembrane receptor (Secretin family)
213527	7 transmembrane receptor (Secretin family)
213533	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
213533	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
213539	BAG domain. Domain present in Hsp70 regulators
213541	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment
213556	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
213556	PH domain. PH stands for pleckstrin homology
213556	MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins
213556	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
213560	Ribosomal protein L6
213570	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
213584	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
213586	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
213586	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
213587	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
213595	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
213611	Protein-tyrosine phosphatase
213638	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
213684	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
213693	Actin
213694	ENV polyprotein (coat polyprotein)
213701	Ribosomal protein L10
213702	Protein kinase domain
213758	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
213788	7 transmembrane receptor (rhodopsin family)
213826	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
213826	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
213827	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
213856	Core histone H2A/H2B/H3/H4
213878	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
213879	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
213881	CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and
213881	CBL proto-oncogene N-terminus, EF hand-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily con
213881	CBL proto-oncogene N-terminus, SH2-like domain. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserv
213898	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
213923	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
213943	von Willebrand factor type A domain
213943	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
213943	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
213945	von Willebrand factor type A domain
213946	WD domain, G-beta repeat
213973	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
213973	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
213977	HAT (Half-A-TPR) repeat. The HAT (Half A TPR) repeat is found in several RNA processing proteins
213985	Ribosomal protein S6e
213990	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
213994	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
214029	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
214036	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
214038	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuron
214062	Ribosomal protein L6
214066	Zn-finger in Ran binding protein and others
214072	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
214111	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
214137	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
214137	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
214141	Ribosomal protein L11, RNA binding domain
214141	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
214144	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
214149	jmjN domain
214149	jmjC domain
214150	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
214150	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
214151	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
214158	B-box zinc finger
214158	Zinc finger, C3HC4 type (RING finger)
214158	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
214158	ATP synthase B/B' CF(0). Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits a
214162	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
214162	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
214162	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
214162	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
214166	Ribosomal protein L23
214198	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
214230	Protein kinase domain
214230	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
214245	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
214247	ENV polyprotein (coat polyprotein)
214253	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
214253	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
214253	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
214270	Ribosomal protein L3
214278	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
214286	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
214290	PAP/25A associated domain
214292	ENV polyprotein (coat polyprotein)
214292	ENV polyprotein (coat polyprotein)
214299	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
214301	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
214320	Sulfotransferase protein
214321	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
214321	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
214362	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
214377	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
214377	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
214377	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
214384	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
214384	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
214390	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
214401	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
214403	Sushi domain (SCR repeat)
214410	Clathrin light chain
214421	Sushi domain (SCR repeat)
214434	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
214447	WD domain, G-beta repeat
214450	LIM domain. This family represents two copies of the LIM structural domain
214456	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
214461	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
214461	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
214521	Ribosomal protein L21e
214522	ENV polyprotein (coat polyprotein)
214531	Trypsin
214531	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
214534	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
214534	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
214545	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
214545	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
214575	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
214590	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
214590	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
214593	Animal haem peroxidase
214593	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
214593	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous
214603	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I, Apolipoprotein A-IV, Apolipoprotein E
214603	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
214611	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
214614	Fatty acid desaturase
214616	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
214620	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
214627	PAP/25A associated domain
214627	Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to
214652	Zinc finger, C3HC4 type (RING finger)
214658	ENV polyprotein (coat polyprotein)
214663	Mitochondrial carrier protein
214680	ENV polyprotein (coat polyprotein)
214680	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
214682	Myosin head (motor domain)
214686	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
214695	Ribosomal protein S6e
214714	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
214728	Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes
214732	Ribosomal protein S2
214738	Ribosomal protein S6e
214739	Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold
214739	Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold
214742	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
214748	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
214748	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
214759	Prothymosin/parathymosin family. Prothymosin alpha and parathymosin are two ubiquitous small acidic nuclear proteins that are thought to be involved in cell cycle progression, proliferation, and cell differentiation
214761	Proteasome A-type and B-type
214766	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
214767	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
214767	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
214783	Fibronectin type III domain
214783	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
214795	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
214804	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
214849	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
214880	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
214891	ENV polyprotein (coat polyprotein)
214891	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
214895	Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19-2
214897	Protein kinase domain
214899	jmjC domain
214899	jmjN domain
214899	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
214899	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
214899	jmjC domain
214899	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
214899	pfam02928, zf-C5HC2, C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function
214905	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
214913	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
214922	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
214923	Polyprenyl synthetase
214944	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru
214944	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru
214951	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f
214952	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
214967	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results
214968	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
214973	Cyclophilin type peptidyl-prolyl cis-trans isomerase
214976	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
214976	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
215001	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
215028	Somatotropin hormone family
215029	Somatotropin hormone family
215034	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
215052	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
215061	B-box zinc finger
215061	Zinc finger, C3HC4 type (RING finger)
215061	B-box zinc finger
215061	Zinc finger, C3HC4 type (RING finger)
215061	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
215065	ENV polyprotein (coat polyprotein)
215065	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
215065	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
215078	Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti
215085	Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region
215086	Transaldolase
215093	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
215095	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this famil
215098	Ribosomal protein S17
215157	Sushi domain (SCR repeat)
215159	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
215160	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f
215166	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
215166	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
215192	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
215196	Ribosomal protein S11
215204	GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins
215208	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
215231	Cyclophilin type peptidyl-prolyl cis-trans isomerase
215232	Cyclophilin type peptidyl-prolyl cis-trans isomerase
215253	Zinc-binding dehydrogenase
215253	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
215282	Galactose binding lectin domain
215288	Proteasome A-type and B-type
215332	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
215335	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
215337	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
215338	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
215338	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
215339	Ribosomal protein S5, N-terminal domain
215339	Ribosomal protein S5, C-terminal domain
215351	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
215378	Giardia variant-specific surface protein
215393	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
215394	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
215405	GTPase of unknown function
215405	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
215405	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
215408	Thrombospondin type 1 domain
215408	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
215411	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
215419	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
215422	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
215422	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
215424	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
215428	CUB domain
215428	Sushi domain (SCR repeat)
215435	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
215446	GDA1/CD39 (nucleoside phosphatase) family
215446	Protein kinase domain
215446	GDA1/CD39 (nucleoside phosphatase) family
215449	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
215452	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
215493	Glycosyltransferase family 6
215495	Ribosomal protein S5, C-terminal domain
215495	Ribosomal protein S5, N-terminal domain
215539	Vinculin family
215539	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
215539	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
215544	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
215562	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
215562	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
215605	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
215615	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
215621	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
215627	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
215632	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
215632	PH domain. PH stands for pleckstrin homology
215632	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
215640	Ribosomal protein L34e
215641	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
215641	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
215653	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
215653	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
215654	Cadherin domain
215654	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
215667	Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is possibly related to the pfam00135 family
215678	Ribosomal L29e protein family
215695	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
215705	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vp
215705	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps
215734	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
215777	ENV polyprotein (coat polyprotein)
215791	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
215791	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
215797	7 transmembrane receptor (Secretin family)
215798	CUB domain
215798	Pentaxin family. Pentaxins are also known as pentraxins
215802	HMG (high mobility group) box
215806	Cornichon protein
215823	ENV polyprotein (coat polyprotein)
215823	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
215844	FKBP-type peptidyl-prolyl cis-trans isomerase
215845	Ribosomal L29e protein family
215845	Cyclophilin type peptidyl-prolyl cis-trans isomerase
215852	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
215852	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
215853	Aconitase family (aconitate hydratase)
215854	7 transmembrane receptor (rhodopsin family)
215855	7 transmembrane receptor (rhodopsin family)
215856	7 transmembrane receptor (rhodopsin family)
215857	7 transmembrane receptor (rhodopsin family)
215859	7 transmembrane receptor (rhodopsin family)
215861	7 transmembrane receptor (rhodopsin family)
215865	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
215866	ENV polyprotein (coat polyprotein)
215867	ENV polyprotein (coat polyprotein)
215869	ENV polyprotein (coat polyprotein)
215871	Phosphatidylethanolamine-binding protein
215872	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
215873	ENV polyprotein (coat polyprotein)
215877	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
215890	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
215890	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
215890	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
215890	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
215892	ENV polyprotein (coat polyprotein)
215892	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
215893	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
215893	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
215894	ENV polyprotein (coat polyprotein)
215894	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
215894	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
215894	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
215894	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
215894	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
215895	Sulfotransferase protein
215896	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
215896	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
215905	Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
215906	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
215913	Ribosomal protein S26e
215919	Zinc finger, C3HC4 type (RING finger)
215920	Ribosomal S17
215922	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
215941	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
215944	7 transmembrane receptor (rhodopsin family)
215946	Adenylate kinase
215948	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
215950	Uracil DNA glycosylase superfamily
215962	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
215963	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
215969	Triosephosphate isomerase
215974	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
215974	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
215977	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
215979	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
215979	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
215979	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
215988	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
215988	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
215988	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
216019	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
216020	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
216020	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
216024	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
216033	Vinculin family
216033	Vinculin family
216036	Ribosomal protein S6e
216057	Enolase, C-terminal TIM barrel domain
216081	SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase
216081	AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this
216082	Ets-domain
216099	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
216115	Sodium:solute symporter family
216134	pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an ess
216135	7 transmembrane receptor (metabotropic glutamate family)
216136	Thiamine pyrophosphate enzyme, C-terminal TPP binding domain
216136	Thiamine pyrophosphate enzyme, N-terminal TPP binding domain
216136	Thiamine pyrophosphate enzyme, central domain. The central domain of TPP enzymes contains a 2-fold Rossman fold
216136	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
216136	Thiamine pyrophosphate enzyme, C-terminal TPP binding domain
216136	Thiamine pyrophosphate enzyme, N-terminal TPP binding domain
216136	Thiamine pyrophosphate enzyme, central domain. The central domain of TPP enzymes contains a 2-fold Rossman fold
216136	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
216142	7 transmembrane receptor (rhodopsin family)
216143	7 transmembrane receptor (rhodopsin family)
216148	SH2 domain
216148	Phosphotyrosine interaction domain (PTB/PID)
216150	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
216152	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
216152	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
216154	WD domain, G-beta repeat
216156	WD domain, G-beta repeat
216166	Protein kinase domain
216166	POLO box duplicated region
216177	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
216178	Protein kinase domain
216179	Clathrin adaptor complex small chain
216179	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
216185	Transcription initiation factor TFIID 23-30kDa subunit
216188	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
216190	PH domain. PH stands for pleckstrin homology
216210	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
216223	Fibronectin type III domain
216224	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
216229	WD domain, G-beta repeat
216238	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
216244	Transaldolase
216282	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
216285	Homeobox domain
216294	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
216299	von Willebrand factor type D domain
216301	Ribosomal protein S6e
216317	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
216330	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
216334	Ribosomal protein L21e
216343	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
216344	ADP-ribosylation factor family
216344	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
216347	Ribosomal S3Ae family
216350	Tetraspanin family
216363	Uncharacterized ACR, COG1579
216363	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
216363	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
216390	Ribosomal protein L21e
216394	Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
216394	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
216416	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
216439	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
216439	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
216439	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
216439	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
216441	Sulfate transporter family. Mutations may lead to several human diseases
216443	Vinculin family
216443	WHEP-TRS domain
216443	tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included
216443	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
216453	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
216454	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
216456	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
216456	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
216460	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
216467	7 transmembrane receptor (rhodopsin family)
216472	7 transmembrane receptor (rhodopsin family)
216483	7 transmembrane receptor (rhodopsin family)
216502	Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold
216502	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
216505	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
216524	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
216530	Adenylate and Guanylate cyclase catalytic domain
216542	ENV polyprotein (coat polyprotein)
216544	Ribosomal protein L11, RNA binding domain
216544	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
216546	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
216546	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
216550	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
216550	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
216551	Galactoside-binding lectin
216558	UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate uri
216600	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
216600	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
216600	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
216600	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
216605	ENV polyprotein (coat polyprotein)
216605	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
216605	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
216605	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
216605	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
216610	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
216622	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
216622	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
216635	Globin
216635	Globin
216635	Globin
216636	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
216636	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
216639	Ribosomal protein S2
216643	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
216643	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
216672	Ribosomal L22e protein family
216674	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
216678	HMG (high mobility group) box
216681	NAC domain
216691	Ribosomal protein S5, N-terminal domain
216696	ENV polyprotein (coat polyprotein)
216704	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
216705	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
216707	Ribosomal L39 protein
216713	7 transmembrane receptor (rhodopsin family)
216715	7 transmembrane receptor (rhodopsin family)
216716	7 transmembrane receptor (rhodopsin family)
216717	7 transmembrane receptor (rhodopsin family)
216720	7 transmembrane receptor (rhodopsin family)
216721	7 transmembrane receptor (rhodopsin family)
216721	1,3-beta-glucan synthase component. This family consists of various 1,3-beta-glucan synthase components including Gls1, Gls2 and Gls3 from yeast. 1,3-beta-glucan synthase EC:2.4.1.34 also known as callose synthase catalyses the formation of a beta-1,3-glu
216724	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
216724	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
216725	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
216725	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
216725	Thrombospondin type 1 domain
216725	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
216726	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
216732	Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone
216749	7 transmembrane receptor (rhodopsin family)
216763	Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. M
216767	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
216772	Phospholipase A2 inhibitor
216774	7 transmembrane receptor (rhodopsin family)
216781	B-box zinc finger
216781	Zinc finger, C3HC4 type (RING finger)
216781	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
216782	7 transmembrane receptor (rhodopsin family)
216783	7 transmembrane receptor (rhodopsin family)
216784	7 transmembrane receptor (rhodopsin family)
216785	7 transmembrane receptor (rhodopsin family)
216787	7 transmembrane receptor (rhodopsin family)
216788	7 transmembrane receptor (rhodopsin family)
216790	Protein kinase domain
216792	Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in
216795	wnt family
216797	Trypsin
216798	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
216799	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
216800	HMG (high mobility group) box
216800	7 transmembrane receptor (rhodopsin family)
216802	7 transmembrane receptor (rhodopsin family)
216803	7 transmembrane receptor (rhodopsin family)
216810	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
216814	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
216817	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
216818	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
216818	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
216825	Ubiquitin carboxyl-terminal hydrolase family 2
216825	Ubiquitin carboxyl-terminal hydrolases family 2
216825	Zn-finger in ubiquitin-hydrolases and other protein
216827	Ribosomal protein S15
216831	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
216834	HIT family
216835	Ubiquitin carboxyl-terminal hydrolase family 2
216835	Ubiquitin carboxyl-terminal hydrolases family 2
216850	jmjC domain
216851	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
216851	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
216856	Carboxylesterase
216858	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
216859	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
216864	Hepatic lectin, N-terminal domain
216869	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
216869	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
216874	CG-1 domain. CG-1 domains are highly conserved domains of about 130 amino-acid residues containing a predicted bipartite NLS and named after a partial cDNA clone isolated from parsley encoding a sequence-specific DNA-binding protein. CG-1 domains are ass
216874	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
216874	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
216877	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
216877	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
216881	Sulfotransferase protein
216881	Sulfotransferase protein
216881	WSC domain. This domain may be involved in carbohydrate binding
216884	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera
216886	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
216900	7 transmembrane receptor (rhodopsin family)
216904	7 transmembrane receptor (rhodopsin family)
216909	7 transmembrane receptor (rhodopsin family)
216927	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
216928	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
216940	7 transmembrane receptor (rhodopsin family)
216953	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
216959	Helix-loop-helix DNA-binding domain
216961	WD domain, G-beta repeat
216963	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
216963	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
216963	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
216976	Protein kinase domain
216991	PH domain. PH stands for pleckstrin homology
216991	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
217031	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
217057	Domain of unknown function UPF0099. This domain has no known function
217066	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
217069	Zinc finger, C3HC4 type (RING finger)
217069	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
217069	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
217092	ENV polyprotein (coat polyprotein)
217114	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
217119	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
217123	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
217123	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
217127	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
217127	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
217127	MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases
217134	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
217151	ADP-ribosylation factor family
217151	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
217151	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
217154	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
217154	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
217164	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
217166	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
217166	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
217167	Connexin
217175	Intermediate filament protein
217175	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
217177	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
217177	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
217194	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
217194	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
217198	PH domain. PH stands for pleckstrin homology
217212	Pancreatic hormone peptide
217229	Ribosomal protein S5, C-terminal domain
217229	Ribosomal protein S5, N-terminal domain
217245	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
217246	Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid
217255	Ribosomal protein S7e
217280	EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains
217302	7 transmembrane receptor (rhodopsin family)
217302	7 transmembrane receptor (rhodopsin family)
217322	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
217328	MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins
217333	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
217342	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
217342	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
217344	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f
217351	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
217351	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
217364	Glycosyl hydrolase family 85. Family of endo-beta-N-acetylglucosaminidases. These enzymes work on a broad spectrum of substrates
217369	7 transmembrane receptor (rhodopsin family)
217376	Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates
217377	Cyclophilin type peptidyl-prolyl cis-trans isomerase
217378	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ con
217379	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
217383	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
217384	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
217390	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
217397	Formate--tetrahydrofolate ligase
217397	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
217404	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
217409	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
217409	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
217451	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
217451	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
217463	PX domain. PX domains bind to phosphoinositides
217463	PXA domain. This domain is associated with PX domains pfam00787
217463	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
217480	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
217480	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
217480	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
217480	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
217493	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
217495	ENV polyprotein (coat polyprotein)
217503	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
217536	Ribosomal RNA adenine dimethylase
217540	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
217540	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
217547	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
217547	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
217558	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
217566	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
217573	Ribosomal protein L21e
217576	Ribosomal protein L31e
217578	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
217584	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
217584	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
217586	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
217593	Mitochondrial carrier protein
217614	Ribosomal protein S5, C-terminal domain
217614	Ribosomal protein S5, N-terminal domain
217616	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
217616	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
217628	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
217628	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
217630	Mak10 subunit, NatC N(alpha)-terminal acetyltransferase. NatC N(alpha)-terminal acetyltransferases contains Mak10p, Mak31p and Mak3p subunits. All three subunits are associated with each other to form the active complex
217632	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
217647	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
217670	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
217674	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
217682	PH domain. PH stands for pleckstrin homology
217684	Sushi domain (SCR repeat)
217690	Protein kinase domain
217690	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
217692	Rap/ran-GAP
217693	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
217695	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
217696	Enhancer of rudimentary. Enhancer of rudimentary is a protein of unknown function that is highly conserved in plants and animals. This protein is found to be an enhancer of the rudimentary gene
217702	ATP synthase delta (OSCP) subunit. The ATP D subunit from E. coli is the same as the OSCP subunit which is this family. The ATP D subunit from metazoa are found in family pfam00401
217715	Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fac
217733	Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length
217738	Thrombospondin type 1 domain
217766	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
217767	HMG (high mobility group) box
217777	Phosphoglycerate kinase
217779	LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known
217779	LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known
217786	Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
217786	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
217790	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
217810	Ribosomal protein S5, C-terminal domain
217826	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
217835	Vacuolar sorting protein 9 (VPS9) domain
217835	Vacuolar sorting protein 9 (VPS9) domain
217847	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
217849	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
217852	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
217854	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
217864	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
217864	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL
217866	Tropomyosin
217866	Protein kinase domain
217866	Intermediate filament protein
217866	Uncharacterized ACR, COG1579
217866	Protein kinase C terminal domain
217866	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
217866	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
217866	Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length
217866	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
217866	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I, Apolipoprotein A-IV, Apolipoprotein E
217866	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
217866	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
217866	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
217869	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
217869	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
217869	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
217886	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
217908	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
217916	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
217918	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
217921	HMG (high mobility group) box
217922	HMG (high mobility group) box
217926	HMG (high mobility group) box
217928	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
217929	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
217929	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
217930	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
217931	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
217932	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
217944	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
217957	Transcription initiation factor IIA, gamma subunit, beta-barrel domain. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIA (TFIIA) is a multi
217957	Transcription initiation factor IIA, gamma subunit, helical domain. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIA (TFIIA) is a multimeri
217958	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
217963	Ribosomal L29e protein family
217991	Ribosomal protein L21e
217999	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
218005	Elongation factor 1 gamma, conserved domain
218006	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
218024	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
218025	ADP-ribosylation factor family
218030	Homeobox domain
218038	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
218038	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
218039	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
218051	7 transmembrane receptor (rhodopsin family)
218054	7 transmembrane receptor (rhodopsin family)
218060	Ribosomal L29e protein family
218065	7 transmembrane receptor (rhodopsin family)
218066	7 transmembrane receptor (rhodopsin family)
218068	7 transmembrane receptor (rhodopsin family)
218075	Protein kinase domain
218110	Somatotropin hormone family
218146	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
218171	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
218184	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
218184	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
218184	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
218184	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
218203	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
218203	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
218210	Zn-finger in Ran binding protein and others
218210	Zn-finger in Ran binding protein and others
218214	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
218241	Protein kinase domain
218242	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
218243	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
218243	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
218244	Protein kinase domain
218268	Eukaryotic initiation factor 4E
218269	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and pfam00349. Some members of the family have two copies of each of these domains
218269	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
218271	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
218272	Protein kinase domain
218274	Ribosomal protein L21e
218275	Papain family cysteine protease
218275	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
218276	Malic enzyme, NAD binding domain
218282	Ribosomal protein L35Ae
218297	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
218298	SKIP/SNW domain. This domain is found in chromatin proteins
218299	Cyclophilin type peptidyl-prolyl cis-trans isomerase
218304	Trypsin
218307	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
218307	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
218310	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
218313	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
218314	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
218317	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi
218317	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
218322	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
218322	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
218324	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
218326	Ribosomal protein L10
218332	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
218332	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
218340	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
218341	Rieske [2Fe-2S] domain. The rieske domain has a [2Fe-2S] centre. Two conserved cysteines that one Fe ion while the other Fe ion is coordinated by two conserved histidines
218354	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
218357	ENV polyprotein (coat polyprotein)
218357	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
218357	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
218358	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
218358	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
218359	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
218360	Low molecular weight phosphotyrosine protein phosphatase
218362	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
218363	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
218363	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
218369	Actin
218372	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
218385	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
218385	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
218397	SH2 domain
218397	PH domain. PH stands for pleckstrin homology
218397	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
218402	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
218402	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
218411	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
218411	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
218411	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
218411	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
218412	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
218412	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
218419	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
218420	Hsp90 protein
218426	Ribosomal protein L23
218428	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
218428	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
218428	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
218432	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
218432	KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
218440	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
218440	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
218441	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
218442	TMS membrane protein/tumour differentially expressed protein (TDE)
218447	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
218447	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
218456	Nucleoside diphosphate kinase
218461	3'5'-cyclic nucleotide phosphodiesterase
218473	EAP30. EAP30 is a subunit of the ELL complex. The ELL is an 80-kDa RNA polymerase II transcription factor. ELL interacts with three other proteins to form the complex known as ELL complex. The ELL complex is capable of increasing that catalytic rate of tr
218476	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
218476	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
218499	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
218501	Calponin family repeat
218501	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
218518	Occludin/ELL family
218518	Occludin/ELL family
218523	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
218523	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
218524	HMG (high mobility group) box
218538	Ets-domain
218569	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
218580	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
218581	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
218606	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
218606	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
218613	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
218614	Protein kinase domain
218614	Zinc finger, C3HC4 type (RING finger)
218617	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
218620	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
218624	Fibronectin type III domain
218630	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
218630	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
218631	Trypsin
218637	Ribosomal L39 protein
218639	ADP-ribosylation factor family
218658	Ribosomal protein L3
218663	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
218693	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA
218699	PX domain. PX domains bind to phosphoinositides
218704	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
218728	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
218728	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
218741	Ribosomal protein L19e
218744	Cadherin domain
218750	HMG (high mobility group) box
218751	7 transmembrane receptor (rhodopsin family)
218757	Protein kinase domain
218759	Protein kinase domain
218761	ENV polyprotein (coat polyprotein)
218769	Ribosomal protein L31e
218769	Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA
218772	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
218772	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
218779	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
218787	ENV polyprotein (coat polyprotein)
218788	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
218788	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
218792	Squalene/phytoene synthase
218799	ENV polyprotein (coat polyprotein)
218811	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24
218814	Core histone H2A/H2B/H3/H4
218827	Ribosomal protein L31e
218832	RNA polymerase alpha subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Members of this family
218832	RNA polymerase A/beta'/A" subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). This family includ
218840	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
218865	GMC oxidoreductase. This family of proteins bind FAD as a cofactor
218865	GMC oxidoreductase. This family of proteins bind FAD as a cofactor
218865	GMC oxidoreductase. This family of proteins bind FAD as a cofactor
218873	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
218877	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
218878	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is
218878	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
218933	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
218940	Ribosomal protein L6e
218945	CAP protein
218948	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
218955	Ribosomal protein L6
218963	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
218977	Guanylate-kinase-associated protein (GKAP) protein
218979	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
218986	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
218997	Ribosomal protein L19e
219002	7 transmembrane receptor (rhodopsin family)
219007	7 transmembrane receptor (rhodopsin family)
219009	7 transmembrane receptor (rhodopsin family)
219010	7 transmembrane receptor (rhodopsin family)
219011	7 transmembrane receptor (rhodopsin family)
219014	7 transmembrane receptor (rhodopsin family)
219016	7 transmembrane receptor (rhodopsin family)
219019	7 transmembrane receptor (rhodopsin family)
219021	7 transmembrane receptor (rhodopsin family)
219033	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
219033	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
219036	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
219037	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
219042	7 transmembrane receptor (rhodopsin family)
219048	7 transmembrane receptor (rhodopsin family)
219049	Proteasome A-type and B-type
219050	7 transmembrane receptor (rhodopsin family)
219054	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
219056	Enolase, C-terminal TIM barrel domain
219057	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
219058	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
219060	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
219061	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
219081	R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA
219102	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
219102	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
219103	Uncharacterized ACR, COG1579
219103	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
219106	Putative snoRNA binding domain. This family consists of various Pre RNA processing ribonucleoproteins. The function of the aligned region is unknown however it may be a common RNA or snoRNA or Nop1p binding domain. Nop5p (Nop58p) from yeast is the protein
219107	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
219110	Ribosomal S17
219126	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
219126	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
219133	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
219140	PH domain. PH stands for pleckstrin homology
219140	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
219140	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
219144	ADP-ribosylation factor family
219144	ADP-ribosylation factor family
219144	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
219145	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
219146	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
219147	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
219150	Homeobox domain
219167	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
219180	Ribosomal protein S5, C-terminal domain
219183	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
219183	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
219183	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
219186	HMG (high mobility group) box
219191	Ribosomal protein S6e
219230	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
219236	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
219252	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
219252	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
219254	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
219254	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
219257	Cadherin domain
219257	Cadherin domain
219293	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
219332	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
219333	Ubiquitin carboxyl-terminal hydrolase family 2
219333	Ubiquitin carboxyl-terminal hydrolases family 2
219346	Homeobox domain
219402	Translation initiation factor IF-3
219405	Ribosomal protein L21e
219409	Homeobox domain
219409	Homeobox domain
219414	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
219417	7 transmembrane receptor (rhodopsin family)
219421	7 transmembrane receptor (rhodopsin family)
219424	7 transmembrane receptor (rhodopsin family)
219425	7 transmembrane receptor (rhodopsin family)
219427	7 transmembrane receptor (rhodopsin family)
219428	7 transmembrane receptor (rhodopsin family)
219429	7 transmembrane receptor (rhodopsin family)
219430	7 transmembrane receptor (rhodopsin family)
219431	7 transmembrane receptor (rhodopsin family)
219432	7 transmembrane receptor (rhodopsin family)
219436	7 transmembrane receptor (rhodopsin family)
219437	7 transmembrane receptor (rhodopsin family)
219438	7 transmembrane receptor (rhodopsin family)
219439	7 transmembrane receptor (rhodopsin family)
219440	7 transmembrane receptor (rhodopsin family)
219442	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
219443	7 transmembrane receptor (rhodopsin family)
219444	7 transmembrane receptor (rhodopsin family)
219445	7 transmembrane receptor (rhodopsin family)
219447	7 transmembrane receptor (rhodopsin family)
219449	7 transmembrane receptor (rhodopsin family)
219450	7 transmembrane receptor (rhodopsin family)
219451	7 transmembrane receptor (rhodopsin family)
219453	7 transmembrane receptor (rhodopsin family)
219455	7 transmembrane receptor (rhodopsin family)
219459	7 transmembrane receptor (rhodopsin family)
219463	7 transmembrane receptor (rhodopsin family)
219464	7 transmembrane receptor (rhodopsin family)
219465	7 transmembrane receptor (rhodopsin family)
219467	7 transmembrane receptor (rhodopsin family)
219469	7 transmembrane receptor (rhodopsin family)
219473	7 transmembrane receptor (rhodopsin family)
219475	7 transmembrane receptor (rhodopsin family)
219477	7 transmembrane receptor (rhodopsin family)
219478	7 transmembrane receptor (rhodopsin family)
219479	7 transmembrane receptor (rhodopsin family)
219480	7 transmembrane receptor (rhodopsin family)
219481	7 transmembrane receptor (rhodopsin family)
219482	7 transmembrane receptor (rhodopsin family)
219484	7 transmembrane receptor (rhodopsin family)
219485	7 transmembrane receptor (rhodopsin family)
219487	7 transmembrane receptor (rhodopsin family)
219490	7 transmembrane receptor (rhodopsin family)
219491	7 transmembrane receptor (rhodopsin family)
219493	7 transmembrane receptor (rhodopsin family)
219494	7 transmembrane receptor (rhodopsin family)
219522	Fatty acid desaturase
219522	7 transmembrane receptor (rhodopsin family)
219523	7 transmembrane receptor (rhodopsin family)
219527	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
219527	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
219537	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
219539	Yippee putative zinc-binding protein
219542	Ribosomal protein S26e
219558	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
219558	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
219595	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
219635	Ribosomal protein L44
219662	Ribosomal protein L21e
219712	Hsp90 protein
219756	Protein kinase domain
219770	Connexin
219771	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
219771	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termi
219773	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
219809	Ribosomal L10
219843	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
219858	7 transmembrane receptor (rhodopsin family)
219865	7 transmembrane receptor (rhodopsin family)
219866	7 transmembrane receptor (rhodopsin family)
219868	7 transmembrane receptor (rhodopsin family)
219869	7 transmembrane receptor (rhodopsin family)
219870	7 transmembrane receptor (rhodopsin family)
219871	7 transmembrane receptor (rhodopsin family)
219872	7 transmembrane receptor (rhodopsin family)
219873	7 transmembrane receptor (rhodopsin family)
219874	7 transmembrane receptor (rhodopsin family)
219875	7 transmembrane receptor (rhodopsin family)
219891	Programmed cell death protein 2, C-terminal domain
219918	ENV polyprotein (coat polyprotein)
219927	Ribosomal prokaryotic L21 protein
219927	Ribosomal prokaryotic L21 protein
219928	7 transmembrane receptor (rhodopsin family)
219931	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
219952	7 transmembrane receptor (rhodopsin family)
219953	7 transmembrane receptor (rhodopsin family)
219953	7 transmembrane receptor (metabotropic glutamate family)
219954	7 transmembrane receptor (rhodopsin family)
219955	7 transmembrane receptor (rhodopsin family)
219956	7 transmembrane receptor (rhodopsin family)
219957	7 transmembrane receptor (rhodopsin family)
219958	7 transmembrane receptor (rhodopsin family)
219959	7 transmembrane receptor (rhodopsin family)
219960	7 transmembrane receptor (rhodopsin family)
219961	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
219962	7 transmembrane receptor (rhodopsin family)
219964	7 transmembrane receptor (rhodopsin family)
219965	7 transmembrane receptor (rhodopsin family)
219966	7 transmembrane receptor (rhodopsin family)
219967	7 transmembrane receptor (rhodopsin family)
219968	7 transmembrane receptor (rhodopsin family)
219972	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl
219979	7 transmembrane receptor (rhodopsin family)
219980	7 transmembrane receptor (rhodopsin family)
219981	7 transmembrane receptor (rhodopsin family)
219982	7 transmembrane receptor (rhodopsin family)
219983	7 transmembrane receptor (rhodopsin family)
219985	7 transmembrane receptor (rhodopsin family)
219986	7 transmembrane receptor (rhodopsin family)
219987	7 transmembrane receptor (rhodopsin family)
219995	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
219996	Zona pellucida-like domain
220001	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
220002	Cytochrome b561
220027	Adenylate and Guanylate cyclase catalytic domain
220056	Proteasome A-type and B-type
220061	Actinobacillus constitutively-expressed outer membrane lipoprotein A
220071	ENV polyprotein (coat polyprotein)
220077	PTB domain (IRS-1 type)
220079	NAD:arginine ADP-ribosyltransferase
220115	Protein-tyrosine phosphatase
220115	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
220147	D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain
220147	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
220148	Pyridoxal-dependent decarboxylase conserved domain
220148	Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined b
220359	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B d
220359	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
220429	Uncharacterized ACR, COG1579
220433	Ribosomal family S4e
220466	PH domain. PH stands for pleckstrin homology
220466	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
220486	Zinc finger, C3HC4 type (RING finger)
220486	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
220522	Gamma-glutamyltranspeptidase
220594	Ubiquitin carboxyl-terminal hydrolase family 2
220635	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
220667	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
220670	Ribosomal protein S2
220672	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
220686	Phosphatidylinositol 3- and 4-kinase
220717	Ribosomal protein L10
220729	FAD binding domain. This family includes members that bind FAD. This family includes the flavoprotein subunits from succinate and fumarate dehydrogenase, aspartate oxidase and the alpha subunit of adenylylsulfate reductase
220758	Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated a
220763	Hsp90 protein
220830	PX domain. PX domains bind to phosphoinositides
220830	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
220832	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
220840	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
220848	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
220869	Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans
220895	Putative methyltransferase. This is a family of putative methyltransferases. The aligned region contains the GXGXG S-AdoMet binding site suggesting a putative methyltransferase activity
220906	L1 transposable element
220959	Intermediate filament protein
220959	IncA protein. Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation
220959	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
220973	7 transmembrane receptor (rhodopsin family)
220984	Ribosomal protein L6
220988	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
220999	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
221009	Poly-adenylate binding protein, unique domain
221017	Intermediate filament protein
221035	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
221037	jmjC domain
221037	jmjC domain
221057	Ribosomal protein S2
221074	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
221078	NOL1/NOP2/sun family
221079	ADP-ribosylation factor family
221079	ADP-ribosylation factor family
221079	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
221079	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
221091	Fibronectin type III domain
221092	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
221092	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
221180	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
221184	C2 domain
221188	7 transmembrane receptor (Secretin family)
221188	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
221191	Trypsin
221223	Carboxylesterase
221259	ZPR1 zinc-finger domain. The zinc-finger protein ZPR1 is ubiquitous among eukaryotes. It is indeed known to be an essential protein in yeast. In quiescent cells, ZPR1 is localized to the cytoplasm. But in proliferating cells treated with EGF or with other
221264	Adenylate kinase
221267	Calponin family repeat
221267	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
221270	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
221278	Zinc finger, C3HC4 type (RING finger)
221278	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
221286	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
221300	Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits
221311	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
221324	Ribosomal protein L13
221324	7 transmembrane receptor (rhodopsin family)
221341	Regulator of chromosome condensation (RCC1)
221349	Ribosomal L10
221349	Nucleotide-sensitive chloride conductance regulator (ICln)
221356	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
221357	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation
221368	Ribosomal protein S8
221374	Ribosomal S17
221382	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
221382	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
221390	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
221391	7 transmembrane receptor (rhodopsin family)
221392	Ribosomal protein L15
221393	7 transmembrane receptor (Secretin family)
221395	7 transmembrane receptor (Secretin family)
221400	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
221424	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
221456	Kinesin motor domain
221460	Ribosomal protein S5, N-terminal domain
221496	LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1
221504	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
221509	Class II histocompatibility antigen, beta domain
221527	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
221527	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
221544	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
221547	ADP-ribosylation factor family
221547	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
221549	Class I Histocompatibility antigen, domains alpha 1 and 2
221550	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
221557	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
221566	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
221566	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
221567	7 transmembrane receptor (rhodopsin family)
221568	7 transmembrane receptor (rhodopsin family)
221570	7 transmembrane receptor (rhodopsin family)
221571	7 transmembrane receptor (rhodopsin family)
221572	ADP-ribosylation factor family
221572	7 transmembrane receptor (rhodopsin family)
221586	7 transmembrane receptor (rhodopsin family)
221608	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
221622	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
221644	Ribosomal L29e protein family
221654	CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh
221654	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
221692	RPEL repeat. The RPEL repeat is named after four conserved amino acids it contains. The function of the RPEL repeat is unknown however it might be a DNA binding repeat based on the observation that one member contains a pfam02037 domain that is also impli
221697	Ribosomal L15
221703	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
221721	Ribosomal protein L21e
221740	NAC domain
221823	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosp
221831	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
221831	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
221831	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
221839	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
221839	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
221875	Tropomyosin
221889	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
221890	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
221914	Glypican
221935	Fibronectin type III domain
221937	Fork head domain
221937	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
221948	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
221955	Lipase (class 3)
221979	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
222005	Mitochondrial carrier protein
222009	Ribosomal protein L31e
222017	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
222017	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
222019	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
222032	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
222052	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
222068	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
222068	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
222103	Intermediate filament protein
222117	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
222258	WD domain, G-beta repeat
222346	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
222487	7 transmembrane receptor (Secretin family)
222487	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
222537	Sulfotransferase protein
222545	7 transmembrane receptor (metabotropic glutamate family)
222545	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
222546	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
222611	7 transmembrane receptor (Secretin family)
222642	TspO/MBR family. Tryptophan-rich sensory protein (TspO) is an integral membrane protein that acts as a negative regulator of the expression of specific photosynthesis genes in response to oxygen/light. It is involved in the efflux of porphyrin intermediat
222643	Death domain
222663	CUB domain
222696	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
222696	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
222699	BTG1 family. A novel family of anti-proliferative proteins
222865	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
222894	Helix-loop-helix DNA-binding domain
222901	Ribosomal protein L14p/L23e
222962	Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their
223117	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
223126	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
223127	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
223127	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
223134	Ribosomal S3Ae family
223134	FMN-dependent dehydrogenase
223135	D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain
223149	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
223149	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
223155	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
223156	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
223156	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
223176	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
223176	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
223176	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
223177	Actin
223181	Actin
223196	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
223199	ENV polyprotein (coat polyprotein)
223200	Hsp90 protein
223202	Ribosomal S17
223227	HMG (high mobility group) box
223227	HMG (high mobility group) box
223227	HMG (high mobility group) box
223254	PH domain. PH stands for pleckstrin homology
223254	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
223254	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
223254	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
223261	Ribosomal protein S17
223262	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Membe
223267	Uncharacterised protein family (UPF0131). This family of proteins are uncharacterised, however BtrG is part of a butirosin-biosynthetic gene cluster from Bacillus circulans
223278	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
223278	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
223283	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
223283	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
223298	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
223319	WD domain, G-beta repeat
223328	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
223335	UDP-glucoronosyl and UDP-glucosyl transferase
223337	UDP-glucoronosyl and UDP-glucosyl transferase
223337	UDP-glucoronosyl and UDP-glucosyl transferase
223346	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
223364	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
223365	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
223365	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
223375	Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in
223385	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
223397	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
223398	RNA polymerase Rpb4
223398	Protein of unknown function DUF51. This family contains members in prokaryotes and eukaryotes. None of the members has a known function
223399	ENV polyprotein (coat polyprotein)
223404	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
223409	WD domain, G-beta repeat
223417	Cyclophilin type peptidyl-prolyl cis-trans isomerase
223425	Ribosomal protein S5, C-terminal domain
223425	Ribosomal protein S5, N-terminal domain
223435	Protein kinase domain
223435	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
223470	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
223470	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
223481	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
223481	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
223488	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
223496	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results
223499	WD domain, G-beta repeat
223499	Sof1-like domain. Sof1 is essential for cell growth and is a component of the nucleolar rRNA processing machinery
223500	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
223500	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
223502	Ribosomal protein L6e
223510	ENV polyprotein (coat polyprotein)
223510	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
223522	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
223522	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
223524	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
223526	S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure
223536	Ribosomal protein L6e
223539	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
223557	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
223573	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
223573	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
223582	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
223594	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
223595	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
223595	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
223603	NifU-like domain. This is an alignment of the carboxy-terminal domain. This is the only common region between the NifU protein from nitrogen-fixing bacteria and rhodobacterial species. The biochemical function of NifU is unknown
223626	4-hydroxybenzoyl-CoA thioesterase. This enzyme (EC 3.1.2.23) catalyses the final step in the biosynthesis of 4-hydroxybenzoate from 4-chlorobenzoate in the soil dwelling microbe Pseudomonas CBS-3
223628	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
223635	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
223635	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
223635	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
223635	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
223644	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
223649	Protein kinase domain
223650	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
223654	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
223666	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
223666	MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins
223693	Domain of unknown function
223701	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
223701	RPEL repeat. The RPEL repeat is named after four conserved amino acids it contains. The function of the RPEL repeat is unknown however it might be a DNA binding repeat based on the observation that one member contains a pfam02037 domain that is also impl
223706	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
223722	Acyl transferase domain
223723	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
223740	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
223745	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
223770	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
223770	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
223770	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
223772	BED zinc finger
223772	pfam02892, zf-BED, BED zinc finger
223773	pfam02892, zf-BED, BED zinc finger
223773	pfam02892, zf-BED, BED zinc finger
223775	Protein kinase domain
223776	Uncharacterized ACR, YdiU/UPF0061 family
223782	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
223797	Ribosomal protein L31e
223804	Sugar (and other) transporter
223811	Respiratory-chain NADH dehydrogenase, 49 Kd subunit
223814	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
223819	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
223825	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
223825	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
223827	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt
223838	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
223838	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
223838	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
223838	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
223843	Homeobox domain
223855	Ribosomal protein L36e
223862	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
223862	Ribosomal protein S3, C-terminal domain. This family contains a central domain pfam00013, hence the amino and carboxyl terminal domains are stored separately. This is a minimal carboxyl-terminal domain. Some are much longer
223864	Cyclic nucleotide-binding domain
223864	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
223871	7 transmembrane receptor (rhodopsin family)
223874	7 transmembrane receptor (rhodopsin family)
223876	7 transmembrane receptor (rhodopsin family)
223878	7 transmembrane receptor (rhodopsin family)
223903	Ubiquinol-cytochrome C reductase hinge protein. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 'hinge' protein of the complex which is thought to mediate formatio
223908	Intermediate filament protein
223909	Intermediate filament protein
223910	Intermediate filament protein
223911	Intermediate filament protein
223915	Intermediate filament protein
223917	Intermediate filament protein
223918	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
223928	Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
223928	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
223931	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
223932	7 transmembrane receptor (rhodopsin family)
223934	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
223934	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
223981	ENV polyprotein (coat polyprotein)
223995	Cadherin domain
223998	Poly-adenylate binding protein, unique domain
223998	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
224004	ENV polyprotein (coat polyprotein)
224010	Ribosomal L29e protein family
224014	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
224014	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
224014	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
224020	Phosphatidylinositol 3- and 4-kinase
224020	Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation
224023	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
224023	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
224041	7 transmembrane receptor (rhodopsin family)
224041	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
224041	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
224041	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
224044	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
224045	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
224048	Proteasome A-type and B-type
224054	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
224073	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
224078	DNA polymerase epsilon subunit B. DNA polymerase epsilon is essential for cell viability and chromosomal DNA replication in budding yeast. In addition, DNA polymerase epsilon may be involved in DNA repair and cell-cycle checkpoint control. The enzyme cons
224087	TMS membrane protein/tumour differentially expressed protein (TDE)
224087	Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
224087	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
224092	GTPase of unknown function
224092	GTPase of unknown function
224092	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
224097	Ribosomal protein L31e
224105	Protein kinase domain
224105	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
224115	von Willebrand factor type D domain
224115	AMOP domain. This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges
224119	Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
224119	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
224133	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
224134	Ribosomal protein S5, C-terminal domain
224134	Ribosomal protein S5, N-terminal domain
224137	Ribosomal L29e protein family
224139	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
224139	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
224145	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
224147	Proteasome A-type and B-type
224156	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
224161	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
224163	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
224173	Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
224173	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
224176	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
224176	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
224185	RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses
224185	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
224191	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
224201	CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and
224206	MCM2/3/5 family
224207	ENV polyprotein (coat polyprotein)
224211	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
224218	Protein kinase domain
224219	ENV polyprotein (coat polyprotein)
224224	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
224234	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
224243	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
224244	Ribosomal protein L13e
224250	PMP-22/EMP/MP20/Claudin family
224251	7 transmembrane receptor (rhodopsin family)
224256	7 transmembrane receptor (rhodopsin family)
224260	7 transmembrane receptor (rhodopsin family)
224264	7 transmembrane receptor (rhodopsin family)
224265	7 transmembrane receptor (rhodopsin family)
224270	7 transmembrane receptor (rhodopsin family)
224273	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
224273	QXW lectin repeat
224273	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
224281	NOL1/NOP2/sun family
224344	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
224381	FKBP-type peptidyl-prolyl cis-trans isomerase
224392	Ribosomal protein L21e
224418	Ribosomal protein L31e
224428	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
224432	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
224442	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
224443	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
224445	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
224480	Oxidoreductase FAD-binding domain
224480	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous
224485	Ribosomal protein L34e
224487	Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined b
224496	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
224496	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
224496	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
224497	Ribosomal L15
224498	Actin
224503	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
224504	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
224508	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
224508	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
224512	Cyclophilin type peptidyl-prolyl cis-trans isomerase
224532	Golgi 4-transmembrane spanning transporter
224533	Ornithine decarboxylase antizyme
224534	Proteasome A-type and B-type
224543	7 transmembrane receptor (rhodopsin family)
224544	7 transmembrane receptor (metabotropic glutamate family)
224544	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
224544	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
224546	Sugar (and other) transporter
224546	7 transmembrane receptor (metabotropic glutamate family)
224546	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
224547	7 transmembrane receptor (metabotropic glutamate family)
224548	ENV polyprotein (coat polyprotein)
224548	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
224549	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
224552	ENV polyprotein (coat polyprotein)
224552	7 transmembrane receptor (metabotropic glutamate family)
224552	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
224554	L1 transposable element
224564	HMG (high mobility group) box
224564	7 transmembrane receptor (metabotropic glutamate family)
224564	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
224569	7 transmembrane receptor (metabotropic glutamate family)
224569	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
224572	7 transmembrane receptor (metabotropic glutamate family)
224572	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
224573	Sugar (and other) transporter
224573	ENV polyprotein (coat polyprotein)
224573	7 transmembrane receptor (metabotropic glutamate family)
224573	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
224573	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
224574	7 transmembrane receptor (rhodopsin family)
224575	7 transmembrane receptor (rhodopsin family)
224576	7 transmembrane receptor (metabotropic glutamate family)
224576	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
224576	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
224578	7 transmembrane receptor (metabotropic glutamate family)
224578	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
224580	7 transmembrane receptor (metabotropic glutamate family)
224580	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
224582	7 transmembrane receptor (metabotropic glutamate family)
224582	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
224603	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
224605	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
224607	ENV polyprotein (coat polyprotein)
224607	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
224607	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
224607	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
224607	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
224616	Jacalin-like lectin domain. Proteins containing this domain are lectins. It is found in 1 to 6 copies in these proteins. The domain is also found in the animal prostatic spermine-binding protein
224627	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
224630	Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway
224650	Phosphotyrosine interaction domain (PTB/PID)
224650	Phosphotyrosine interaction domain (PTB/PID)
224650	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
224671	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
224687	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
224691	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
224691	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
224692	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
224694	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
224697	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
224697	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
224705	Vps52 / Sac2 family. Vps52 complexes with Vps53 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events
224705	Vps52 / Sac2 family. Vps52 complexes with Vps53 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events
224727	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
224742	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
224748	Class I Histocompatibility antigen, domains alpha 1 and 2
224752	Homeobox domain
224753	Class I Histocompatibility antigen, domains alpha 1 and 2
224753	Class I Histocompatibility antigen, domains alpha 1 and 2
224753	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
224753	Class I Histocompatibility antigen, domains alpha 1 and 2
224753	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
224754	Class I Histocompatibility antigen, domains alpha 1 and 2
224754	Class I Histocompatibility antigen, domains alpha 1 and 2
224756	Class I Histocompatibility antigen, domains alpha 1 and 2
224756	Class I Histocompatibility antigen, domains alpha 1 and 2
224756	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
224756	Class I Histocompatibility antigen, domains alpha 1 and 2
224756	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
224758	ENV polyprotein (coat polyprotein)
224758	Class I Histocompatibility antigen, domains alpha 1 and 2
224759	Class I Histocompatibility antigen, domains alpha 1 and 2
224761	Class I Histocompatibility antigen, domains alpha 1 and 2
224761	Class I Histocompatibility antigen, domains alpha 1 and 2
224761	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
224761	Class I Histocompatibility antigen, domains alpha 1 and 2
224761	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
224763	Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
224763	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
224764	7 transmembrane receptor (rhodopsin family)
224765	7 transmembrane receptor (rhodopsin family)
224766	7 transmembrane receptor (rhodopsin family)
224767	7 transmembrane receptor (rhodopsin family)
224768	7 transmembrane receptor (rhodopsin family)
224769	L1 transposable element
224769	Class I Histocompatibility antigen, domains alpha 1 and 2
224769	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
224771	7 transmembrane receptor (rhodopsin family)
224773	7 transmembrane receptor (rhodopsin family)
224776	7 transmembrane receptor (rhodopsin family)
224781	ENV polyprotein (coat polyprotein)
224782	Protein kinase domain
224792	7 transmembrane receptor (Secretin family)
224792	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
224794	Sulfatase
224794	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
224805	tRNA synthetases class II (A). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only alanyl-tRNA synthetases
224807	Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length
224813	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
224836	Ubiquitin carboxyl-terminal hydrolase family 2
224847	ENV polyprotein (coat polyprotein)
224860	C2 domain
224860	PH domain. PH stands for pleckstrin homology
224860	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
224860	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
224870	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
224872	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
224872	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
224875	Protein kinase domain
224876	Protein kinase domain
224883	P53
224893	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
224894	Ribosomal protein L21e
224902	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
224903	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
224910	Adenylate kinase
224910	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
224914	Zinc finger, C3HC4 type (RING finger)
224916	7 transmembrane receptor (metabotropic glutamate family)
224916	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
224920	Protein kinase domain
224923	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
224923	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
224933	Cyclophilin type peptidyl-prolyl cis-trans isomerase
224949	Ribosomal protein S6e
224954	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
224955	PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline
224997	Guanylate-kinase-associated protein (GKAP) protein
225010	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
225013	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
225013	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
225014	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
225019	Ribosomal protein L13e
225021	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
225022	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
225028	Protein kinase domain
225028	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
225029	Ribosomal protein S2
225030	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
225030	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
225033	Ribosomal protein L31e
225046	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
225055	Putative zinc finger in N-recognin
225058	Ribosomal protein S24e
225065	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
225077	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
225077	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
225077	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
225077	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
225093	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
225095	ENV polyprotein (coat polyprotein)
225096	Actin
225099	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
225099	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
225101	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
225103	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
225104	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
225104	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
225104	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
225105	Cyclophilin type peptidyl-prolyl cis-trans isomerase
225107	Ribosomal protein L3
225114	Cadherin domain
225115	Villin headpiece domain
225118	Eukaryotic protein of unknown function, DUF279
225127	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
225127	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
225127	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
225129	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
225129	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
225129	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
225134	Ribosomal family S4e
225139	SH2 domain
225141	Ribosomal protein S21e
225141	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
225149	ENV polyprotein (coat polyprotein)
225151	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
225151	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
225151	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
225151	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
225151	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
225152	Connexin
225154	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
225154	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
225156	ENV polyprotein (coat polyprotein)
225160	Death domain
225187	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
225187	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
225192	7 transmembrane receptor (rhodopsin family)
225195	Ribosomal protein S2
225201	Ribosomal L29e protein family
225215	Ribosomal protein L24e
225225	Fibrillarin
225227	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
225234	Ribosomal L10
225256	Cadherin domain
225256	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
225257	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
225264	Pyruvate kinase, alpha/beta domain
225264	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
225264	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
225266	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
225273	Ribosomal protein S2
225277	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
225277	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
225307	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
225314	Macrophage migration inhibitory factor (MIF)
225317	Ribosomal protein S8e
225326	Phosphatidylinositol 3- and 4-kinase
225326	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
225326	Phosphatidylinositol 3- and 4-kinase
225326	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
225326	Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation
225337	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
225339	Protein of unknown function DUF51. This family contains members in prokaryotes and eukaryotes. None of the members has a known function
225341	LIM domain. This family represents two copies of the LIM structural domain
225343	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
225348	Utp21 specific WD40 associated domain. Utp21 is a subunit of U3 snoRNP, which is essential for synthesis of 18S rRNA
225350	Glycoprotease family
225353	HMG (high mobility group) box
225362	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
225367	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
225367	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
225372	Phosphotyrosine interaction domain (PTB/PID)
225387	L1 transposable element
225387	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
225399	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
225416	Ribosomal protein S19e
225432	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
225442	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
225445	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
225446	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
225470	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
225482	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
225482	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
225495	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
225495	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
225526	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
225526	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
225527	HMG (high mobility group) box
225543	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
225550	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
225550	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
225552	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
225552	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
225559	Glutamine synthetase, catalytic domain
225559	Glutamine synthetase, beta-Grasp domain
225579	AMP-binding enzyme
225586	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
225586	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
225600	3'5'-cyclic nucleotide phosphodiesterase
225620	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
225620	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
225631	Homeobox domain
225636	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
225636	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
225638	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a b
225681	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
225681	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
225728	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
225742	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
225742	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
225742	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
225743	Zinc finger, C3HC4 type (RING finger)
225761	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
225761	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
225763	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
225799	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
225799	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
225805	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
225807	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
225812	SKIP/SNW domain. This domain is found in chromatin proteins
225814	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
225817	Cyclophilin type peptidyl-prolyl cis-trans isomerase
225822	Ribosomal L29e protein family
225826	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
225826	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
225827	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
225845	NLP/P60 family. The function of this domain is unknown. It is found in several lipoproteins
225849	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
225865	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
225870	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
225875	Fibronectin type III domain
225875	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
225876	F-box domain
225876	jmjC domain
225876	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
225898	HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes
225903	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
225912	Cytochrome b561
225919	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
225924	Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses
225926	7 transmembrane receptor (rhodopsin family)
225932	7 transmembrane receptor (rhodopsin family)
225934	7 transmembrane receptor (rhodopsin family)
225935	7 transmembrane receptor (rhodopsin family)
225936	7 transmembrane receptor (rhodopsin family)
225939	7 transmembrane receptor (rhodopsin family)
225940	7 transmembrane receptor (rhodopsin family)
225941	7 transmembrane receptor (rhodopsin family)
225943	Trypsin
225951	7 transmembrane receptor (rhodopsin family)
225956	7 transmembrane receptor (rhodopsin family)
225957	ENV polyprotein (coat polyprotein)
225957	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
225958	Protein kinase domain
225959	7 transmembrane receptor (rhodopsin family)
225961	7 transmembrane receptor (rhodopsin family)
225962	7 transmembrane receptor (rhodopsin family)
225963	7 transmembrane receptor (rhodopsin family)
225970	7 transmembrane receptor (rhodopsin family)
225971	7 transmembrane receptor (rhodopsin family)
225972	7 transmembrane receptor (rhodopsin family)
225974	Ribosomal protein L13e
225997	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a
225997	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
225998	metallopeptidase family M24
225998	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
226017	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
226017	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
226025	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
226025	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
226030	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
226030	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
226036	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
226041	Phosphoglucomutase/phosphomannomutase, C-terminal domain
226041	Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I
226041	Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II
226041	pfam02878, PGM_PMM_I, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I
226041	pfam02879, PGM_PMM_II, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II
226041	pfam02880, PGM_PMM_III, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain III
226043	Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans
226047	ADP-ribosylation factor family
226047	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
226049	DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai
226063	Ribosomal protein L36e
226081	Dynein light chain type 1
226082	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
226086	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
226097	Proteasome A-type and B-type
226099	Exocyst complex subunit Sec15-like
226100	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
226101	C2 domain
226101	C2 domain
226101	C2 domain
226102	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
226105	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
226106	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
226108	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
226118	Adenovirus minor core protein PV
226119	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
226121	Ribosomal protein L44
226122	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
226123	MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000
226123	Immunoglobulin A1 protease. This family consists of immunoglobulin A1 protease proteins. The immunoglobulin A1 protease cleaves immunoglobulin IgA and is found in pathogenic bacteria such as Neisseria gonorrhoeae. Not all of the members of this family are
226143	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
226144	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
226160	Ribosomal protein L6, N-terminal domain
226180	Intermediate filament protein
226180	Intermediate filament protein
226182	WD domain, G-beta repeat
226200	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results
226201	Ribosomal protein L13
226212	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
226248	Macrophage migration inhibitory factor (MIF)
226250	PH domain. PH stands for pleckstrin homology
226251	LIM domain. This family represents two copies of the LIM structural domain
226251	Villin headpiece domain
226251	LIM domain. This family represents two copies of the LIM structural domain
226265	Enolase, C-terminal TIM barrel domain
226273	Fibronectin type III domain
226278	7 transmembrane receptor (rhodopsin family)
226292	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
226294	Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site
226294	Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases
226304	7 transmembrane receptor (rhodopsin family)
226310	Ribosomal protein S19
226313	S-adenosyl-L-homocysteine hydrolase
226317	Ribosomal protein S5, C-terminal domain
226322	ENV polyprotein (coat polyprotein)
226325	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
226326	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
226335	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
226335	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
226336	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
226338	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
226343	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
226344	WD domain, G-beta repeat
226348	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
226348	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
226349	Ribosomal S3Ae family
226352	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
226359	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
226376	GDP dissociation inhibitor
226387	NAC domain
226393	7 transmembrane receptor (rhodopsin family)
226409	HNH endonuclease
226409	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
226409	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
226413	Glycosyl hydrolase family 1
226416	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
226421	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
226432	Importin-beta N-terminal domain
226438	Fibronectin type III domain
226439	Helix-loop-helix DNA-binding domain
226443	Porphobilinogen deaminase, C-terminal domain
226443	Porphobilinogen deaminase, dipyromethane cofactor binding domain
226462	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
226472	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
226472	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
226473	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
226473	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
226475	Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA
226478	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
226479	Core histone H2A/H2B/H3/H4
226491	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
226494	Hsp90 protein
226496	Ribosomal protein S8e
226503	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
226513	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
226519	Laminin B (Domain IV)
226519	Laminin N-terminal (Domain VI)
226519	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
226519	Laminin B (Domain IV)
226519	Laminin N-terminal (Domain VI)
226519	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
226519	Methyl-accepting chemotaxis protein (MCP) signaling domain. This domain is thought to transduce the signal to CheA since it is highly conserved in very diverse MCPs
226519	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
226519	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
226523	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
226527	Zinc-binding dehydrogenase
226541	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
226554	Dynamin family
226554	PH domain. PH stands for pleckstrin homology
226561	Eukaryotic initiation factor 4E
226564	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
226565	Thi4 family. This family includes a putative thiamine biosynthetic enzyme
226565	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
226565	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
226565	FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1
226591	TIP41-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 interacts with TAP42 and negatively regulates the TOR signaling pathway
226601	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
226601	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
226602	ENV polyprotein (coat polyprotein)
226602	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
226602	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
226604	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
226604	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
226640	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
226641	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
226641	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
226643	Ribosomal protein S24e
226646	Respiratory-chain NADH dehydrogenase, 49 Kd subunit
226654	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
226673	Protein-tyrosine phosphatase
226673	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
226685	Pentaxin family. Pentaxins are also known as pentraxins
226689	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
226689	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
226690	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
226691	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
226695	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
226695	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
226702	7 transmembrane receptor (rhodopsin family)
226705	7 transmembrane receptor (rhodopsin family)
226710	7 transmembrane receptor (rhodopsin family)
226711	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
226720	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
226723	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
226732	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
226733	Ribosomal protein L44
226735	Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen
226740	Ribosomal L29e protein family
226745	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
226745	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
226745	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
226751	Uncharacterized ACR, COG1579
226751	Intermediate filament protein
226751	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
226751	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
226751	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
226751	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
226778	Kinase associated domain 1
226781	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
226783	Homeobox domain
226786	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
226786	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
226791	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
226822	Spumavirus gag protein
226823	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
226830	MYND finger
226849	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy
226856	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
226859	Cyclin-dependent kinase regulatory subunit
226861	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
226866	Recombination protein O. Recombination protein O (RecO) is involved in DNA repair and pfam00470 pathway recombination
226871	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
226874	NUDIX domain
226875	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
226877	HMG (high mobility group) box
226878	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
226878	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
226882	Cyclophilin type peptidyl-prolyl cis-trans isomerase
226884	ENV polyprotein (coat polyprotein)
226884	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
226884	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
226884	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
226884	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
226885	HMG (high mobility group) box
226896	Transcription factor AP-2
226922	KCNQ1 voltage-gated potassium channel
226922	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
226922	Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors
226922	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
226928	C2 domain
226936	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
226936	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
226948	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
226948	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
226949	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
226950	Elongation factor 1 gamma, conserved domain
226950	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
226950	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
226952	Protein kinase domain
226952	Uncharacterized protein family (ORF7) DUF. Several members of this family are Borrelia burgdorferi plasmid proteins of uncharacterized function
226954	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
226955	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
226956	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
226970	PH domain. PH stands for pleckstrin homology
226970	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
226970	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
226982	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
226982	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
226985	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
226999	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
227000	Cyclophilin type peptidyl-prolyl cis-trans isomerase
227035	ADP-ribosylation factor family
227036	ADP-ribosylation factor family
227036	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
227036	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
227037	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
227038	FKBP-type peptidyl-prolyl cis-trans isomerase
227042	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
227044	'chromo' (CHRromatin Organization MOdifier) domain
227044	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
227044	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
227045	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
227054	Ribosomal protein L23
227054	Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a
227055	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
227056	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
227058	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
227059	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
227061	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
227078	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
227089	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
227089	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
227090	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
227090	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
227092	HMG (high mobility group) box
227102	ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER
227114	NAC domain
227114	Orbivirus outer capsid protein VP2. VP2 acts as an anchor for VP1 and VP3. VP2 contains a non-specific DNA and RNA binding domain in the N-terminus
227114	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
227119	Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue
227120	C2 domain
227120	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
227134	Homeobox domain
227150	Ribosomal protein L31e
227157	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
227231	D-ala D-ala ligase. This family contains D-alanine--D-alanine ligase enzymes EC:6.3.2.4
227231	Carbamoyl-phosphate synthase L chain, N-terminal domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosyn
227231	Carbamoyl-phosphate synthase L chain, ATP binding domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosy
227231	Carbamoyl-phosphate synthetase large chain, oligomerisation domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. The carbamoyl-phosphate synthase (CPS) enzyme in prokar
227231	MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase and the domain is also found in Carbamoyl phosphate synthetase (CPS) where it forms a regulatory domain that binds to the allosteric effector ornithine. This family also i
227234	Kinesin motor domain
227237	Ribosomal protein L21e
227242	Ribosomal protein L21e
227261	Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5
227261	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
227266	Ribosomal protein L21e
227288	7 transmembrane receptor (rhodopsin family)
227288	7 transmembrane receptor (rhodopsin family)
227289	7 transmembrane receptor (rhodopsin family)
227292	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
227292	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
227309	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
227318	L1 transposable element
227320	L1 transposable element
227326	7 transmembrane receptor (rhodopsin family)
227327	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
227333	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
227333	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
227333	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
227333	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
227334	Ubiquitin carboxyl-terminal hydrolase family 2
227334	Ubiquitin carboxyl-terminal hydrolases family 2
227335	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
227336	UDP-glucoronosyl and UDP-glucosyl transferase
227354	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
227354	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
227354	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
227354	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
227367	7 transmembrane receptor (rhodopsin family)
227377	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
227384	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
227388	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
227393	Dienelactone hydrolase family
227393	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
227394	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
227394	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
227396	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
227397	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
227399	Histidine acid phosphatase
227403	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
227403	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
227407	ENV polyprotein (coat polyprotein)
227411	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
227411	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
227411	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
227412	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
227416	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
227421	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
227425	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
227435	Domain of unknown function
227435	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
227435	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
227435	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
227449	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
227458	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
227460	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
227461	Ribosomal protein S6e
227462	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
227485	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
227489	CCR4-Not complex component, Not1. The Ccr4-Not complex is a global regulator of transcription that affects genes positively and negatively and is thought to regulate transcription factor TFIID
227513	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
227513	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
227517	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
227517	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
227517	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
227522	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
227532	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
227559	Ribosomal protein L23
227564	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
227566	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
227567	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
227567	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
227570	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
227570	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
227572	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
227572	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
227588	Eukaryotic initiation factor 1A
227599	AMP-binding enzyme
227606	Transcription factor TFIID (or TATA-binding protein, TBP)
227619	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
227620	UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate urid
227624	ADP-ribosylation factor family
227624	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
227627	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
227630	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
227632	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
227638	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
227656	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
227659	Sugar (and other) transporter
227671	Glycosyltransferase family 6
227677	Homeobox domain
227729	Ribosomal L15
227731	Mitochondrial carrier protein
227733	Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal
227736	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
227744	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
227746	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
227757	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
227770	7 transmembrane receptor (rhodopsin family)
227772	7 transmembrane receptor (rhodopsin family)
227775	7 transmembrane receptor (rhodopsin family)
227777	Eukaryotic ribosomal protein L18
227778	7 transmembrane receptor (rhodopsin family)
227780	7 transmembrane receptor (rhodopsin family)
227785	NAC domain
227786	7 transmembrane receptor (rhodopsin family)
227787	7 transmembrane receptor (rhodopsin family)
227788	7 transmembrane receptor (rhodopsin family)
227789	7 transmembrane receptor (rhodopsin family)
227794	7 transmembrane receptor (rhodopsin family)
227796	7 transmembrane receptor (rhodopsin family)
227800	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukary
227801	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
227802	7 transmembrane receptor (Secretin family)
227802	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
227827	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
227827	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
227847	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
227847	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
227852	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
227859	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
227859	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
227885	Hsp90 protein
227888	Inorganic pyrophosphatase
227906	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
227920	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
227920	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
227937	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
227940	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
227940	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
227945	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
227968	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
227968	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
227974	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
227974	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
227989	14-3-3 protein
227992	Ribosomal protein L21e
228003	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
228005	Cyclophilin type peptidyl-prolyl cis-trans isomerase
228026	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
228033	ATP synthase subunit C
228065	GAF domain. Domain present in phytochromes and cGMP-specific phosphodiesterases
228123	Ribosomal family S4e
228124	Cyclophilin type peptidyl-prolyl cis-trans isomerase
228126	Enolase, C-terminal TIM barrel domain
228139	ATP P2X receptor
228145	7 transmembrane receptor (rhodopsin family)
228151	Fatty acid desaturase
228151	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
228152	7 transmembrane receptor (rhodopsin family)
228153	7 transmembrane receptor (rhodopsin family)
228157	7 transmembrane receptor (rhodopsin family)
228161	7 transmembrane receptor (rhodopsin family)
228166	7 transmembrane receptor (rhodopsin family)
228176	7 transmembrane receptor (rhodopsin family)
228178	7 transmembrane receptor (rhodopsin family)
228184	7 transmembrane receptor (rhodopsin family)
228185	7 transmembrane receptor (rhodopsin family)
228186	7 transmembrane receptor (rhodopsin family)
228189	ENV polyprotein (coat polyprotein)
228189	7 transmembrane receptor (rhodopsin family)
228189	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
228196	7 transmembrane receptor (rhodopsin family)
228196	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
228199	Dynamin family
228199	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
228204	7 transmembrane receptor (rhodopsin family)
228206	7 transmembrane receptor (rhodopsin family)
228209	7 transmembrane receptor (rhodopsin family)
228210	7 transmembrane receptor (rhodopsin family)
228212	7 transmembrane receptor (rhodopsin family)
228213	7 transmembrane receptor (rhodopsin family)
228218	7 transmembrane receptor (rhodopsin family)
228228	7 transmembrane receptor (rhodopsin family)
228235	ADP-ribosylation factor family
228237	Protein kinase domain
228238	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
228239	7 transmembrane receptor (rhodopsin family)
228243	7 transmembrane receptor (rhodopsin family)
228252	7 transmembrane receptor (rhodopsin family)
228253	7 transmembrane receptor (rhodopsin family)
228256	7 transmembrane receptor (rhodopsin family)
228260	7 transmembrane receptor (rhodopsin family)
228263	7 transmembrane receptor (rhodopsin family)
228269	7 transmembrane receptor (rhodopsin family)
228271	7 transmembrane receptor (rhodopsin family)
228272	7 transmembrane receptor (rhodopsin family)
228273	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
228274	7 transmembrane receptor (rhodopsin family)
228275	7 transmembrane receptor (rhodopsin family)
228276	7 transmembrane receptor (rhodopsin family)
228278	7 transmembrane receptor (rhodopsin family)
228279	7 transmembrane receptor (rhodopsin family)
228281	7 transmembrane receptor (rhodopsin family)
228282	7 transmembrane receptor (rhodopsin family)
228284	7 transmembrane receptor (rhodopsin family)
228285	7 transmembrane receptor (rhodopsin family)
228287	7 transmembrane receptor (rhodopsin family)
228288	7 transmembrane receptor (rhodopsin family)
228292	7 transmembrane receptor (rhodopsin family)
228294	7 transmembrane receptor (rhodopsin family)
228295	7 transmembrane receptor (rhodopsin family)
228298	7 transmembrane receptor (rhodopsin family)
228299	7 transmembrane receptor (rhodopsin family)
228300	7 transmembrane receptor (rhodopsin family)
228304	7 transmembrane receptor (rhodopsin family)
228307	7 transmembrane receptor (rhodopsin family)
228308	7 transmembrane receptor (rhodopsin family)
228311	7 transmembrane receptor (rhodopsin family)
228318	7 transmembrane receptor (rhodopsin family)
228319	7 transmembrane receptor (rhodopsin family)
228325	7 transmembrane receptor (rhodopsin family)
228337	7 transmembrane receptor (rhodopsin family)
228345	7 transmembrane receptor (rhodopsin family)
228355	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
228355	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
228355	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
228355	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
228355	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
228355	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
228357	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
228364	D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain
228364	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
228366	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyl
228377	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
228382	Mitochondrial carrier protein
228408	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
228408	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
228413	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy
228418	Mitochondrial carrier protein
228419	S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure
228421	Kinesin motor domain
228430	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
228435	7 transmembrane receptor (rhodopsin family)
228437	7 transmembrane receptor (rhodopsin family)
228438	7 transmembrane receptor (rhodopsin family)
228439	7 transmembrane receptor (rhodopsin family)
228441	7 transmembrane receptor (rhodopsin family)
228442	7 transmembrane receptor (rhodopsin family)
228443	7 transmembrane receptor (rhodopsin family)
228447	7 transmembrane receptor (rhodopsin family)
228448	7 transmembrane receptor (rhodopsin family)
228449	7 transmembrane receptor (rhodopsin family)
228454	7 transmembrane receptor (rhodopsin family)
228464	7 transmembrane receptor (rhodopsin family)
228465	7 transmembrane receptor (rhodopsin family)
228466	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
228466	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
228470	7 transmembrane receptor (rhodopsin family)
228475	7 transmembrane receptor (rhodopsin family)
228482	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
228507	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
228515	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
228549	Core histone H2A/H2B/H3/H4
228550	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
228564	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
228569	SH2 domain
228598	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
228602	Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans
228604	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
228608	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
228662	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
228666	ENV polyprotein (coat polyprotein)
228692	Appr-1"-p processing enzyme family. This domain is found in a number of otherwise unrelated proteins. This domain is found at the C-terminus of the macro-H2A histone protein. This domain is found in the non-structural proteins of several types of ssRNA vi
228710	Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration
228712	Ribosomal protein S5, C-terminal domain
228712	Ribosomal protein S5, N-terminal domain
228714	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
228714	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
228731	Homeobox domain
228731	Homeobox domain
228732	Cyclophilin type peptidyl-prolyl cis-trans isomerase
228756	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
228756	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family
228776	Ribosomal protein L10
228776	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
228785	Protein kinase domain
228785	Phosphatidylinositol 3- and 4-kinase
228796	LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
228796	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
228798	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
228804	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
228809	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
228814	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
228836	Guanylate-kinase-associated protein (GKAP) protein
228861	WAP-type (Whey Acidic Protein) 'four-disulfide core'
228861	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
228869	Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain
228876	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
228880	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
228880	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
228893	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
228942	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
228960	Syntaxin
228961	Sec63 domain
228961	Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase
228966	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p
228983	Oxysterol-binding protein
228993	Sugar (and other) transporter
228994	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment
228994	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment f
228998	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
229003	UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Some members have large insertions near t
229004	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
229005	Protein kinase domain
229013	7 transmembrane receptor (rhodopsin family)
229017	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
229027	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
229038	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
229043	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
229051	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
229051	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
229055	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
229055	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
229064	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
229069	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
229070	Phosphoglycerate mutase family
229070	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
229076	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
229076	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
229087	HMG (high mobility group) box
229089	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
229089	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
229090	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
229091	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
229091	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
229095	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
229096	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment
229098	ENV polyprotein (coat polyprotein)
229105	Hsp90 protein
229109	Phosphoglycerate kinase
229121	Cyclophilin type peptidyl-prolyl cis-trans isomerase
229142	ENV polyprotein (coat polyprotein)
229152	Cullin family
229176	Ribosomal protein S6e
229189	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
229206	Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids
229207	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
229209	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
229209	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
229219	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
229228	NUDIX domain
229228	NUDIX domain
229228	NUDIX domain
229269	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
229270	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
229272	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
229272	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
229274	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
229276	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
229277	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
229278	Calx-beta domain
229280	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
229285	MIT domain
229309	Phosphoglycerate kinase
229323	7 transmembrane receptor (rhodopsin family)
229330	Carboxylesterase
229330	Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl
229333	Carboxylesterase
229333	Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl
229337	Carboxylesterase
229350	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
229352	Hsp90 protein
229354	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
229354	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
229363	Glutamine amidotransferase class-I
229363	tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs
229363	GMP synthase C terminal domain. GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. This family is the C-terminal domain specific to the GMP synthases EC:6.3.5.2. In prokaryotes this domain mediates dimerisation. E
229366	7 transmembrane receptor (metabotropic glutamate family)
229367	AMP-binding enzyme
229369	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
229371	Ribosomal protein S6e
229373	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
229384	Ribosomal protein L35Ae
229389	Ribosomal protein S6e
229389	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
229389	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
229391	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
229413	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
229418	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
229419	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
229425	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
229425	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
229458	Cadherin domain
229459	Cadherin domain
229468	Ribosomal L28e protein family
229476	Tc5 transposase
229476	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di
229476	CENP-B protein. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding domain, which binds to a corresponding 17bp CENP-B box sequence. In the C-terminal 59 residues,
229476	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
229482	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
229488	Androgen receptor
229500	Ets-domain
229504	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
229505	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
229506	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
229517	Mitochondrial carrier protein
229517	Mitochondrial carrier protein
229521	C2 domain
229539	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
229541	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
229541	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
229544	Ribosomal protein S5, C-terminal domain
229544	Ribosomal protein S5, N-terminal domain
229546	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
229547	PX domain. PX domains bind to phosphoinositides
229553	Macrophage migration inhibitory factor (MIF)
229560	Ribosomal protein L21e
229563	Ribosomal protein L13e
229569	Homeobox domain
229569	Integral membrane protein DUF110. This archaebacterial protein family has no known function. Some members of this family are annotated as FlaJ, however we can find no supporting evidence for this annotation
229570	14-3-3 protein
229571	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
229580	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
229584	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
229589	DHHA2 domain. This domain is often found adjacent to the DHH domain pfam01368 and is called DHHA2 for DHH associated domain. This domain is diagnostic of DHH subfamily 2 members. The domain is about 120 residues long and contains a conserved DXK motif at
229607	Core histone H2A/H2B/H3/H4
229608	Core histone H2A/H2B/H3/H4
229608	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
229615	pfam02891, zf-MIZ, MIZ zinc finger
229615	pfam02891, zf-MIZ, MIZ zinc finger
229622	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
229626	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
229631	14-3-3 protein
229636	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
229639	L1 transposable element
229644	B-box zinc finger
229644	Filamin/ABP280 repeat
229644	B-box zinc finger
229644	Filamin/ABP280 repeat
229663	'Cold-shock' DNA-binding domain
229665	Adenosine/AMP deaminase
229681	ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain
229688	Glycosyl hydrolases family 18
229691	7 transmembrane receptor (rhodopsin family)
229697	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whic
229699	Sugar (and other) transporter
229700	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
229706	Sodium:neurotransmitter symporter family
229709	S-adenosyl-L-homocysteine hydrolase
229710	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
229710	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
229714	7 transmembrane receptor (rhodopsin family)
229729	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
229729	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
229731	Mitochondrial carrier protein
229739	Conserved hypothetical protein 95
229739	Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat
229740	Ribosomal protein L34e
229746	Ribosomal family S4e
229751	Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwee
229751	Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str
229752	Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwee
229752	Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str
229756	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
229756	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
229772	Ets-domain
229776	Protein-tyrosine phosphatase
229776	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
229782	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
229785	Ribosomal protein S19e
229785	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
229791	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
229791	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
229799	Ribosomal protein L14p/L23e
229803	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
229803	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
229810	Enolase, C-terminal TIM barrel domain
229824	Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15
229837	Clathrin adaptor complex small chain
229837	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
229841	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
229841	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
229841	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
229851	Hsp90 protein
229851	Ribosomal L29e protein family
229857	Ribosomal protein L44
229869	Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue
229871	Zinc-binding dehydrogenase
229871	Prephenate dehydrogenase. Members of this family are prephenate dehydrogenases EC:1.3.1.12 involved in tyrosine biosynthesis
229879	Nucleoside diphosphate kinase
229893	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
229898	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
229898	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
229900	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
229900	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
229902	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
229902	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
229905	Aminotransferase class I and II
229905	DegT/DnrJ/EryC1/StrS aminotransferase family. The members of this family are probably all pyridoxal-phosphate-dependent aminotransferase enzymes with a variety of molecular functions. The family includes StsA, StsC and StsS. The aminotransferase activity
229905	Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OAH
229910	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
229910	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
229920	Ribosomal protein L21e
229921	Ribosomal protein L6
229949	Adenylate kinase
229958	Protein-tyrosine phosphatase
229958	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
229979	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
229992	PWI domain
230002	Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex
230002	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
230002	Glucuronate isomerase. This is a family of Glucuronate isomerases also known as D-glucuronate isomerase, uronic isomerase, uronate isomerase, or uronic acid isomerase, EC:5.3.1.12. This enzyme catalyses the reactions: D-glucuronate <=> D-fructuronate and
230003	Ribosomal protein S17
230006	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
230010	Poly-adenylate binding protein, unique domain
230010	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
230013	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
230013	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
230025	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
230042	Ribosomal protein S6e
230043	GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins
230044	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
230044	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
230045	Hsp90 protein
230045	Macrophage migration inhibitory factor (MIF)
230046	Ribosomal protein S6e
230054	HMG (high mobility group) box
230069	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
230082	Nrap protein. Members of this family are nucleolar RNA-associated proteins (Nrap) which are highly conserved from yeast (Saccharomyces cerevisiae) to human. In the mouse, Nrap is ubiquitously expressed and is specifically localized in the nucleolus. Nrap
230098	PH domain. PH stands for pleckstrin homology
230098	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
230099	Eukaryotic-type carbonic anhydrase
230103	Protein kinase domain
230103	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
230103	Protein kinase domain
230103	Adenylate and Guanylate cyclase catalytic domain
230103	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
230119	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
230125	Mitochondrial carrier protein
230126	SH2 domain
230145	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
230163	Fructose-bisphosphate aldolase class-I
230173	ENV polyprotein (coat polyprotein)
230174	Elongation factor 1 gamma, conserved domain
230175	Ribosomal protein L13e
230185	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
230188	7 transmembrane receptor (rhodopsin family)
230195	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
230197	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
230220	Core histone H2A/H2B/H3/H4
230234	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
230235	DOMON domain. The DOMON (named after dopamine beta-monooxygenase N-terminal) domain is 110-125 residues long. It is predicted to form an all beta fold with 7-8 strands. The domain may mediate extracellular adhesive interactions
230251	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
230251	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
230251	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
230253	Ribosomal protein S2
230257	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
230260	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
230260	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
230280	TNF(Tumor Necrosis Factor) family
230287	Enolase, C-terminal TIM barrel domain
230301	Ribosomal protein S5, N-terminal domain
230310	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
230316	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
230316	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
230337	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
230337	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
230341	Ribosomal protein L3
230344	Calx-beta domain
230346	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
230347	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
230347	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
230358	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
230358	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
230367	Ribosomal RNA adenine dimethylase
230396	Interferon alpha/beta domain
230396	Interferon alpha/beta domain
230397	Interferon alpha/beta domain
230398	Interferon alpha/beta domain
230401	Interferon alpha/beta domain
230405	Interferon alpha/beta domain
230405	Interferon alpha/beta domain
230411	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
230419	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
230419	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
230431	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
230431	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
230437	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
230437	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
230448	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
230459	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
230460	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
230461	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
230465	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
230466	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
230474	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
230483	Homeobox domain
230493	Ubiquitin carboxyl-terminal hydrolase, family 1
230494	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
230494	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
230514	Vacuolar protein sorting 55. Vps55 is involved in the secretion of the Golgi form of the soluble vacuolar carboxypeptidase Y, but not the trafficking of the membrane-bound vacuolar alkaline phosphatase. Both Vps55 and obesity receptor gene-related protei
230558	Low-density lipoprotein receptor domain class A
230558	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assemb
230562	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
230582	Oxidoreductase FAD-binding domain
230582	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
230582	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous
230592	Ribosomal protein L6
230594	PAP/25A associated domain
230594	Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to
230596	PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is abou
230612	Sodium:solute symporter family
230616	Ribosomal protein L3
230621	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
230621	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
230638	Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This ig-fold domain is found
230638	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
230639	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
230661	Protein kinase domain
230664	Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown
230664	Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown
230674	jmjN domain
230674	jmjC domain
230684	7 transmembrane receptor (rhodopsin family)
230690	7 transmembrane receptor (rhodopsin family)
230692	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
230692	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
230709	Peptidase family M48
230710	Palmitoyl protein thioesterase
230714	Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue
230715	Palmitoyl protein thioesterase
230715	ENV polyprotein (coat polyprotein)
230721	Poly-adenylate binding protein, unique domain
230724	Ribosomal protein S5, C-terminal domain
230724	Ribosomal protein S5, N-terminal domain
230727	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa
230727	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. These proteins contain three strongly conserved histidines in the putative trans
230735	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
230735	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
230737	GTPase of unknown function
230758	Ribosomal protein S5, C-terminal domain
230758	Ribosomal protein S5, N-terminal domain
230760	CUB domain
230760	Sushi domain (SCR repeat)
230761	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
230765	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
230771	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
230775	7 transmembrane receptor (Secretin family)
230775	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
230775	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
230775	Thrombospondin type 1 domain
230775	7 transmembrane receptor (rhodopsin family)
230775	7 transmembrane receptor (Secretin family)
230775	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
230775	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
230777	7 transmembrane receptor (rhodopsin family)
230779	TMS membrane protein/tumour differentially expressed protein (TDE)
230796	TPR Domain
230796	WD domain, G-beta repeat
230799	14-3-3 protein
230800	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
230801	Protein of unknown function (DUF409). Family of eukaryotic membrane proteins with unknown function
230801	Protein of unknown function (DUF409). Family of eukaryotic membrane proteins with unknown function
230805	'Cold-shock' DNA-binding domain
230806	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
230810	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
230815	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
230827	Proteasome A-type and B-type
230828	Tissue factor
230837	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
230837	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
230837	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
230843	MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of p
230843	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
230857	Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be
230861	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA
230887	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
230890	Carboxylesterase
230891	Carboxylesterase
230896	Ribosomal L29e protein family
230899	Atrial natriuretic peptide
230903	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylate
230904	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated
230936	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
230972	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
230972	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
231002	PH domain. PH stands for pleckstrin homology
231003	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
231003	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
231006	ENV polyprotein (coat polyprotein)
231025	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
231025	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
231025	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
231031	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
231032	Cadherin domain
231032	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
231033	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
231037	Transketolase, thiamine diphosphate binding domain. This family includes transketolase enzymes EC:2.2.1.1. and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit EC:1.2.4.4. Both these enzymes utilise thiamine pyrophosphate as a cofacto
231044	Homeobox domain
231045	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
231047	ENV polyprotein (coat polyprotein)
231050	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
231051	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
231051	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
231059	AMP-binding enzyme
231075	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
231075	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
231078	MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains
231081	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
231086	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
231098	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
231109	LIM domain. This family represents two copies of the LIM structural domain
231109	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
231118	Homeobox domain
231125	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
231126	ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER
231134	PTB domain (IRS-1 type)
231143	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
231148	Villin headpiece domain
231148	LIM domain. This family represents two copies of the LIM structural domain
231148	Villin headpiece domain
231151	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
231157	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
231178	Mitochondrial carrier protein
231193	Ribosomal L29e protein family
231207	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
231220	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
231227	ADP-ribosylation factor family
231227	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
231234	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
231237	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
231239	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
231241	PAP/25A associated domain
231242	eIF-6 family. This family includes eukaryotic translation initiation factor 6 as well as presumed archaebacterial homologues
231248	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
231248	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
231251	Smr domain. This family includes the Smr (Small MutS Related) proteins, and the C-terminal region of the MutS2 protein. It has been suggested that this domain interacts with the MutS1 protein in the case of Smr proteins and with the N-terminal MutS relate
231252	Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
231252	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
231252	Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
231252	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
231269	Ribosomal protein L21e
231294	Ribosomal protein S5, C-terminal domain
231294	Ribosomal protein S5, N-terminal domain
231317	Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5
231317	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
231326	AMP-binding enzyme
231326	Phosphopantetheine attachment site. A 4'-phosphopantetheine prosthetic group is attached through a serine. This prosthetic group acts as a a 'swinging arm' for the attachment of activated fatty acid and amino-acid groups. This domain forms a four helix b
231326	AMP-binding enzyme
231329	RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase com
231332	Death effector domain
231343	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
231343	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
231380	Repeat in ubiquitin-activating (UBA) protein
231380	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
231381	Trypsin
231382	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
231386	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment
231386	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment f
231387	UDP-glucoronosyl and UDP-glucosyl transferase
231391	UDP-glucoronosyl and UDP-glucosyl transferase
231392	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
231393	UDP-glucoronosyl and UDP-glucosyl transferase
231394	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
231396	UDP-glucoronosyl and UDP-glucosyl transferase
231396	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
231397	UDP-glucoronosyl and UDP-glucosyl transferase
231398	UDP-glucoronosyl and UDP-glucosyl transferase
231407	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
231408	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
231411	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
231413	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
231430	60Kd inner membrane protein
231444	Eukaryotic porin
231446	Cyclophilin type peptidyl-prolyl cis-trans isomerase
231462	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
231467	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
231470	Calx-beta domain
231470	von Willebrand factor type A domain
231470	Giardia variant-specific surface protein
231470	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
231470	Calx-beta domain
231474	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
231484	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
231484	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
231501	Ribosomal L29e protein family
231506	Tesmin/TSO1-like CXC domain. This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in
231547	ENV polyprotein (coat polyprotein)
231552	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
231552	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
231558	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
231563	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
231563	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
231565	Zona pellucida-like domain
231571	Domain of Unknown Function (DUF408)
231573	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
231580	Protein kinase domain
231580	Protein-tyrosine phosphatase
231580	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
231583	Sulfate transporter family. Mutations may lead to several human diseases
231586	7 transmembrane receptor (metabotropic glutamate family)
231586	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
231586	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
231587	7 transmembrane receptor (metabotropic glutamate family)
231588	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
231589	7 transmembrane receptor (metabotropic glutamate family)
231589	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
231591	7 transmembrane receptor (metabotropic glutamate family)
231591	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
231594	Actin
231600	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
231602	ATP P2X receptor
231605	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
231613	Uncharacterized ACR, COG1579
231613	Methyl-accepting chemotaxis protein (MCP) signaling domain. This domain is thought to transduce the signal to CheA since it is highly conserved in very diverse MCPs
231613	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I, Apolipoprotein A-IV, Apolipoprotein E
231613	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
231613	IncA protein. Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation
231613	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
231613	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
231613	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
231655	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
231663	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
231663	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
231666	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
231707	Cyclophilin type peptidyl-prolyl cis-trans isomerase
231715	Acyl-CoA dehydrogenase, middle domain. Central domain of Acyl-CoA dehydrogenase has a beta-barrel fold
231715	Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle
231717	PH domain. PH stands for pleckstrin homology
231724	Rad9. Rad9 is required for transient cell-cycle arrests and transcriptional induction of DNA repair in response to DNA damage
231727	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
231736	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
231760	Fibronectin type III domain
231760	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
231769	Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding
231772	Ribosomal protein S5, C-terminal domain
231772	Ribosomal protein S5, N-terminal domain
231802	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
231803	O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine
231810	Aminotransferase class I and II
231812	7 transmembrane receptor (metabotropic glutamate family)
231812	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
231821	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
231822	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
231830	LIM domain. This family represents two copies of the LIM structural domain
231832	Domain of unknown function
231834	PX domain. PX domains bind to phosphoinositides
231836	Kinesin motor domain
231837	Core histone H2A/H2B/H3/H4
231858	DIL domain. The DIL domain has no known function
231858	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
231863	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
231864	7 transmembrane receptor (rhodopsin family)
231866	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
231869	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
231869	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
231889	G10 protein
231905	Ribosomal L18ae protein family
231914	Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase
231923	7 transmembrane receptor (metabotropic glutamate family)
231923	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
231931	GTPase of unknown function
231942	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
231949	Ribosomal protein S6e
231959	Nop14-like family. Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production
231999	PH domain. PH stands for pleckstrin homology
232028	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
232031	7 transmembrane receptor (rhodopsin family)
232031	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
232040	ENV polyprotein (coat polyprotein)
232044	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
232044	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
232045	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232047	Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018
232048	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232052	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232053	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232055	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232059	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232060	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232063	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232065	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232066	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232067	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232068	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232070	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
232072	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232073	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
232074	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232077	Fork head domain
232078	Pyridoxal-phosphate dependent enzyme. Members of this family are all pyridoxal-phosphate dependent enzymes. This family includes: serine dehydratase EC:4.2.1.13 P20132, threonine dehydratase EC:4.2.1.16, tryptophan synthase beta chain EC:4.2.1.20, threoni
232094	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
232104	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
232104	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
232110	SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase
232110	AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this
232118	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
232119	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
232119	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
232143	HMG (high mobility group) box
232156	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
232157	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known stru
232165	Ribosomal protein S8e
232174	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
232180	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
232190	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
232201	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
232201	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
232206	Ribosomal protein L3
232210	Uncharacterized ACR, COG2135
232211	ENV polyprotein (coat polyprotein)
232227	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
232235	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
232235	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
232237	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine posit
232259	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
232259	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
232273	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
232288	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
232288	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
232312	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
232320	Cyclophilin type peptidyl-prolyl cis-trans isomerase
232322	Actin
232327	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
232333	Sodium:neurotransmitter symporter family
232333	Sodium:neurotransmitter symporter family
232336	Lipoxygenase
232336	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
232338	Nebulin repeat
232338	LIM domain. This family represents two copies of the LIM structural domain
232338	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
232339	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
232339	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
232339	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
232345	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
232345	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
232345	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
232345	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
232354	Ribosomal protein S5, C-terminal domain
232354	Ribosomal protein S5, N-terminal domain
232357	Cyclophilin type peptidyl-prolyl cis-trans isomerase
232358	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
232360	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
232361	Apoptosis regulator proteins, Bcl-2 family
232361	7 transmembrane receptor (metabotropic glutamate family)
232363	7 transmembrane receptor (metabotropic glutamate family)
232363	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
232364	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
232367	7 transmembrane receptor (metabotropic glutamate family)
232367	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
232368	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
232371	Trypsin
232371	CUB domain
232378	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
232391	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
232391	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
232399	Ribosomal protein L21e
232400	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
232400	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
232401	VHS domain. Domain present in VPS-27, Hrs and STAM
232401	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
232404	ENV polyprotein (coat polyprotein)
232415	Lectin C-type domain. This family includes both long and short form C-type
232421	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
232431	7 transmembrane receptor (metabotropic glutamate family)
232431	7 transmembrane receptor (metabotropic glutamate family)
232433	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
232433	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
232440	Core histone H2A/H2B/H3/H4
232441	ADP-ribosylation factor family
232441	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
232465	Cullin family
232466	Cullin family
232485	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
232485	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
232487	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
232491	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
232493	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysacchari
232510	Ribosomal protein S6e
232515	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
232515	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
232527	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
232527	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
232534	Ribosomal protein S5, C-terminal domain
232534	Ribosomal protein S5, N-terminal domain
232539	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
232539	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
232539	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
232560	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
232569	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
232578	Polyprenyl synthetase
232581	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
232585	von Willebrand factor type D domain
232587	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
232587	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
232597	AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4-
232598	LIM domain. This family represents two copies of the LIM structural domain
232616	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
232619	Nucleoside diphosphate kinase
232626	Ribosomal protein L21e
232637	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
232647	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
232647	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
232649	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
232665	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
232670	Tetraspanin family
232680	Zinc carboxypeptidase
232680	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fol
232683	Proteasome A-type and B-type
232706	Fructose-bisphosphate aldolase class-I
232706	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
232710	PMP-22/EMP/MP20/Claudin family
232714	Trefoil (P-type) domain
232714	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
232717	Trypsin
232717	Trypsin
232718	Trypsin
232724	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232727	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232728	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232733	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
232733	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
232744	7 transmembrane receptor (rhodopsin family)
232745	Polysaccharide deacetylase. This domain is found in polysaccharide deacetylase. This family of polysaccharide deacetylases includes NodB (nodulation protein B from Rhizobium) which is a chitooligosaccharide deacetylase. It also includes chitin deacetylase
232745	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
232747	7 transmembrane receptor (rhodopsin family)
232750	7 transmembrane receptor (rhodopsin family)
232752	7 transmembrane receptor (rhodopsin family)
232758	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232759	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
232762	7 transmembrane receptor (rhodopsin family)
232774	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
232776	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
232780	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea
232781	Ribosomal protein L31e
232782	Ribosomal protein L35Ae
232784	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
232786	Ribosomal protein S5, N-terminal domain
232787	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
232791	Not1 N-terminal domain, CCR4-Not complex component
232791	NOT2 / NOT3 / NOT5 family. NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5
232798	SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1
232804	ATP synthase subunit C
232815	D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain
232815	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
232822	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
232823	Actin
232825	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
232826	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
232828	7 transmembrane receptor (metabotropic glutamate family)
232828	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
232838	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
232852	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
232854	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
232860	Protein kinase domain
232860	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
232862	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
232862	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
232862	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
232867	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
232867	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
232868	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
232871	7 transmembrane receptor (metabotropic glutamate family)
232871	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
232873	7 transmembrane receptor (metabotropic glutamate family)
232873	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
232875	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
232878	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
232878	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
232879	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
232879	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
232883	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
232883	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
232886	ENV polyprotein (coat polyprotein)
232887	WD domain, G-beta repeat
232890	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
232892	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
232895	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
232899	Homeobox domain
232900	Homeobox domain
232901	Homeobox domain
232910	Clathrin adaptor complex small chain
232910	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
232920	Cyclophilin type peptidyl-prolyl cis-trans isomerase
232923	ENV polyprotein (coat polyprotein)
232923	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
232926	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
232926	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
232930	GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
232930	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
232937	7 transmembrane receptor (Secretin family)
232941	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
232944	Kinase associated domain 1
232947	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
232953	7 transmembrane receptor (rhodopsin family)
232953	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
232954	Intermediate filament protein
232958	Protein kinase domain
232959	7 transmembrane receptor (rhodopsin family)
232959	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
232961	7 transmembrane receptor (rhodopsin family)
232961	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
232962	7 transmembrane receptor (rhodopsin family)
232962	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
232963	Protein kinase domain
232974	Ets-domain
232978	Protein kinase domain
232984	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to Glc
232993	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
232994	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
232995	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
232998	Ubiquitin carboxyl-terminal hydrolases family 2
233003	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
233005	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
233006	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
233007	ENV polyprotein (coat polyprotein)
233011	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
233011	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
233017	Phosphoglycerate mutase family
233017	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
233022	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
233022	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
233024	Protein kinase domain
233024	Octicosapeptide repeat. Short motif that may bind Ca2+
233024	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
233033	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
233033	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
233033	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
233038	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated
233040	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated
233046	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
233053	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
233055	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
233055	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
233057	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
233061	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
233067	Fibronectin type III domain
233067	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
233067	Fibronectin type III domain
233067	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
233067	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
233069	Protein kinase domain
233071	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
233071	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
233075	Ribosomal L18ae protein family
233079	7 transmembrane receptor (rhodopsin family)
233080	7 transmembrane receptor (rhodopsin family)
233081	7 transmembrane receptor (rhodopsin family)
233086	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
233087	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
233089	Ribosomal protein L23
233098	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
233107	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
233111	CUB domain
233121	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
233121	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
233131	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
233147	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
233166	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
233168	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
233169	ENV polyprotein (coat polyprotein)
233169	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
233173	Fusion glycoprotein F0
233173	ENV polyprotein (coat polyprotein)
233173	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
233174	HMG (high mobility group) box
233176	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
233177	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
233177	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
233179	7 transmembrane receptor (metabotropic glutamate family)
233179	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
233181	ENV polyprotein (coat polyprotein)
233181	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
233182	Ribosomal L18ae protein family
233183	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
233184	Spumavirus gag protein
233184	Granin (chromogranin or secretogranin)
233184	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p
233184	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
233187	PMP-22/EMP/MP20/Claudin family
233187	PMP-22/EMP/MP20/Claudin family
233189	Uncharacterized protein family UPF0021
233190	Trypsin
233196	IMP4 family. This family includes IMP4, which is involved ribosomal RNA processing
233220	7 transmembrane receptor (rhodopsin family)
233221	7 transmembrane receptor (rhodopsin family)
233222	7 transmembrane receptor (rhodopsin family)
233224	7 transmembrane receptor (rhodopsin family)
233227	7 transmembrane receptor (rhodopsin family)
233230	7 transmembrane receptor (rhodopsin family)
233230	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
233231	7 transmembrane receptor (rhodopsin family)
233249	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
233249	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
233254	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
233254	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
233256	ENV polyprotein (coat polyprotein)
233260	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
233272	Ribosomal protein S7e
233276	Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin
233281	Regulator of chromosome condensation (RCC1)
233281	Anaphase-promoting complex, subunit 10 (APC10)
233281	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
233281	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
233297	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
233298	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
233298	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
233298	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
233330	Homeobox domain
233330	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
233332	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
233335	Intermediate filament protein
233336	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
233336	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
233352	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
233352	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
233355	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
233355	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
233401	Uracil DNA glycosylase superfamily
233405	Sec1 family
233405	Sec1 family
233406	Microtubule associated protein (MAP65/ASE1 family)
233412	Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidine
233412	Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins. One member is a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane. One member is a a Na+-dependent nucleoside transporter se
233426	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
233426	KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
233427	Ribosomal protein S28e
233436	7 transmembrane receptor (rhodopsin family)
233437	7 transmembrane receptor (metabotropic glutamate family)
233437	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
233438	7 transmembrane receptor (metabotropic glutamate family)
233439	7 transmembrane receptor (metabotropic glutamate family)
233440	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
233441	7 transmembrane receptor (metabotropic glutamate family)
233443	7 transmembrane receptor (metabotropic glutamate family)
233443	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
233444	7 transmembrane receptor (metabotropic glutamate family)
233445	7 transmembrane receptor (metabotropic glutamate family)
233445	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
233448	7 transmembrane receptor (metabotropic glutamate family)
233448	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
233452	7 transmembrane receptor (metabotropic glutamate family)
233452	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
233453	7 transmembrane receptor (rhodopsin family)
233453	7 transmembrane receptor (metabotropic glutamate family)
233453	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
233454	7 transmembrane receptor (rhodopsin family)
233457	7 transmembrane receptor (rhodopsin family)
233462	7 transmembrane receptor (rhodopsin family)
233465	7 transmembrane receptor (rhodopsin family)
233466	7 transmembrane receptor (metabotropic glutamate family)
233466	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
233469	7 transmembrane receptor (metabotropic glutamate family)
233469	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
233471	Ribosomal family S4e
233476	Ribosomal protein S17
233479	Ribosomal protein S17
233485	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
233485	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
233489	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
233493	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
233493	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
233508	14-3-3 protein
233529	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
233531	Ribosomal L15
233532	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
233537	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b
233537	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be
233542	ENV polyprotein (coat polyprotein)
233545	ENT domain. This presumed domain is named after Emsy N Terminus (ENT). Emsy is a protein that is amplified in breast cancer and interacts with BRCA2. The N terminus of this protein is found to be similar to other vertebrate and plant proteins of unknown
233549	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
233549	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
233552	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be
233564	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
233564	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
233566	Ribosomal protein S8e
233571	7 transmembrane receptor (rhodopsin family)
233576	7 transmembrane receptor (rhodopsin family)
233578	7 transmembrane receptor (rhodopsin family)
233587	7 transmembrane receptor (rhodopsin family)
233592	7 transmembrane receptor (rhodopsin family)
233595	7 transmembrane receptor (rhodopsin family)
233596	7 transmembrane receptor (rhodopsin family)
233600	7 transmembrane receptor (rhodopsin family)
233601	7 transmembrane receptor (rhodopsin family)
233609	7 transmembrane receptor (rhodopsin family)
233613	7 transmembrane receptor (rhodopsin family)
233616	7 transmembrane receptor (rhodopsin family)
233620	7 transmembrane receptor (rhodopsin family)
233634	Ribonucleotide reductase, small chain
233635	7 transmembrane receptor (rhodopsin family)
233636	7 transmembrane receptor (rhodopsin family)
233636	Ubiquitin carboxyl-terminal hydrolase family 2
233636	Ubiquitin carboxyl-terminal hydrolases family 2
233636	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
233637	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
233642	7 transmembrane receptor (rhodopsin family)
233644	7 transmembrane receptor (rhodopsin family)
233645	7 transmembrane receptor (rhodopsin family)
233646	7 transmembrane receptor (rhodopsin family)
233649	Cyclic nucleotide-binding domain
233649	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
233650	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
233651	Cadherin domain
233665	7 transmembrane receptor (rhodopsin family)
233666	7 transmembrane receptor (rhodopsin family)
233669	'chromo' (CHRromatin Organization MOdifier) domain
233669	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
233670	7 transmembrane receptor (rhodopsin family)
233681	7 transmembrane receptor (rhodopsin family)
233688	Cyclophilin type peptidyl-prolyl cis-trans isomerase
233690	7 transmembrane receptor (rhodopsin family)
233695	7 transmembrane receptor (rhodopsin family)
233695	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
233699	7 transmembrane receptor (rhodopsin family)
233708	7 transmembrane receptor (rhodopsin family)
233709	7 transmembrane receptor (rhodopsin family)
233711	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
233712	7 transmembrane receptor (rhodopsin family)
233721	Aminotransferase class-V
233726	Importin-beta N-terminal domain
233726	Importin-beta N-terminal domain
233733	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
233733	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
233735	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
233744	Reeler domain
233744	Thrombospondin type 1 domain
233747	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
233781	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
233781	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon
233789	FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal
233789	Bunyavirus glycoprotein G1. Bunyavirus has three genomic segments: small (S), middle-sized (M), and large (L). The S segment encodes the nucleocapsid and a non-structural protein. The M segment codes for two glycoproteins, G1 and G2, and another non-struc
233801	AMP-binding enzyme
233802	pfam02926, THUMP, THUMP domain. The THUMP domain is named after after thiouridine synthases, methylases and PSUSs. The THUMP domain consists of about 110 amino acid residues. It is predicted that this domain is an RNA-binding domain that adopts an alpha/
233805	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
233808	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
233810	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
233813	von Willebrand factor type A domain
233815	Cyclophilin type peptidyl-prolyl cis-trans isomerase
233830	Tubulin binding cofactor A
233832	Zinc finger, C3HC4 type (RING finger)
233832	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
233836	Sodium:solute symporter family
233858	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
233858	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
233859	Glutamate/Leucine/Phenylalanine/Valine dehydrogenase
233861	Ribosomal protein S7e
233877	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
233878	Sushi domain (SCR repeat)
233885	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea
233888	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
233890	Vinculin family
233892	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
233902	F-box domain
233902	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
233904	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET dom
233904	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
233908	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
233910	B-box zinc finger
233910	Zinc finger, C3HC4 type (RING finger)
233910	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
233919	7 transmembrane receptor (rhodopsin family)
233921	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
233926	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
233926	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
233938	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
233952	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
233952	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
233960	Ribosomal protein L21e
233977	Uncharacterized ACR, COG1579
233977	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
233977	Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length
233977	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
233979	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
233987	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
233991	Ribosomal L22e protein family
233992	Ribosomal protein L21e
233995	Phosphoglucose isomerase. Phosphoglucose isomerase catalyses the interconversion of glucose-6-phosphate and fructose-6-phosphate
234004	Eukaryotic porin
234072	ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisi
234073	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
234097	Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5
234097	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
234100	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
234107	CUB domain
234120	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
234129	HMG14 and HMG17
234132	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
234135	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
234158	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
234158	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
234187	Eukaryotic porin
234188	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
234188	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
234190	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
234190	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
234201	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
234201	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
234209	Cyclophilin type peptidyl-prolyl cis-trans isomerase
234214	Sorbin homologous domain
234214	Sorbin homologous domain
234214	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
234217	Actin
234218	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
234219	Helix-loop-helix DNA-binding domain
234219	Helix-loop-helix DNA-binding domain
234247	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
234247	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
234249	ENV polyprotein (coat polyprotein)
234254	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
234267	Myelin proteolipid protein (PLP or lipophilin)
234286	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
234292	D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain
234309	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
234313	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
234319	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
234319	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
234321	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
234329	B-box zinc finger
234333	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
234353	PH domain. PH stands for pleckstrin homology
234358	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
234362	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
234365	YjeF-related protein N-terminus
234373	Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding
234373	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
234374	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
234374	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
234378	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
234378	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
234378	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
234378	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
234384	Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is
234400	14-3-3 protein
234404	SCO1/SenC. This family is involved in biogenesis of respiratory and photosynthetic systems. SCO1 is required for a post-translational step in the accumulation of subunits COXI and COXII of cytochrome c oxidase. SenC is required for optimal cytochrome c o
234407	Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of va
234415	7 transmembrane receptor (rhodopsin family)
234417	7 transmembrane receptor (rhodopsin family)
234419	Tropomyosin
234419	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
234425	Intermediate filament protein
234427	Sushi domain (SCR repeat)
234429	Protein of unknown function, DUF259
234438	Phosphoribulokinase / Uridine kinase family
234440	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
234440	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
234441	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
234445	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
234448	Ribosomal protein L13e
234456	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
234460	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
234463	Domain of unknown function
234474	Ribosomal protein L21e
234483	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
234486	Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction a
234501	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
234501	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
234505	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
234505	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
234520	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
234520	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
234542	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anc
234546	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
234550	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
234563	Carboxylesterase
234564	Carboxylesterase
234576	Clathrin adaptor complex small chain
234576	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
234577	C2 domain
234593	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known t
234594	CCR4-Not complex component, Not1. The Ccr4-Not complex is a global regulator of transcription that affects genes positively and negatively and is thought to regulate transcription factor TFIID
234595	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
234611	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
234612	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
234612	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
234622	DNA directed RNA polymerase, 7 kDa subunit
234628	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
234628	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
234628	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
234628	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
234635	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
234640	Ribosomal protein L10
234663	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
234664	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
234667	Carboxylesterase
234668	Carboxylesterase
234669	Carboxylesterase
234671	Carboxylesterase
234672	Carboxylesterase
234673	Carboxylesterase
234675	Carboxylesterase
234676	Carboxylesterase
234677	Carboxylesterase
234678	Uncharacterised protein family (UPF0183). This family of proteins includes Lin-10 from C. elegans
234684	F-box domain
234690	Ribosomal protein S5, C-terminal domain
234690	Ribosomal protein S5, N-terminal domain
234699	WD domain, G-beta repeat
234702	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
234703	Ribosomal L10
234703	Ribosomal protein L16
234723	Mitosis protein DIM1
234723	Mitosis protein DIM1
234723	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
234725	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
234728	FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in compl
234751	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
234779	SH2 domain
234779	PH domain. PH stands for pleckstrin homology
234779	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
234793	Adenosine-deaminase (editase) domain
234796	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
234796	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
234826	ENV polyprotein (coat polyprotein)
234848	Ribosomal protein L13e
234852	Eukaryotic protein of unknown function, DUF279
234854	Protein kinase domain
234854	Uncharacterized protein family (ORF7) DUF. Several members of this family are Borrelia burgdorferi plasmid proteins of uncharacterized function
234858	ENV polyprotein (coat polyprotein)
234858	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
234858	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
234858	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
234865	Nup133 nucleoporin. RNA undergoing nuclear export first encounters the basket of the nuclear pore. Nup133 is a nucleoporin accessible on the basket side of the pore
234877	Transcription factor TFIIB repeat
234878	Protein kinase domain
234882	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
234885	7 transmembrane receptor (rhodopsin family)
234886	ENV polyprotein (coat polyprotein)
234886	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
234886	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
234886	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
234886	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
234886	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
234886	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
234889	Adenylate and Guanylate cyclase catalytic domain
234892	ENV polyprotein (coat polyprotein)
234893	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
234897	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
234902	Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II
234902	pfam02879, PGM_PMM_II, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II
234921	ENV polyprotein (coat polyprotein)
234922	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
234928	ENV polyprotein (coat polyprotein)
234929	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
234929	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
234929	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
234930	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
234930	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
234930	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
234932	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
234939	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
234955	Macrophage migration inhibitory factor (MIF)
234961	ENV polyprotein (coat polyprotein)
234967	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
234973	Cadherin domain
234981	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
234987	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
234997	7 transmembrane receptor (rhodopsin family)
235005	7 transmembrane receptor (rhodopsin family)
235019	7 transmembrane receptor (rhodopsin family)
235019	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
235019	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
235021	7 transmembrane receptor (rhodopsin family)
235025	7 transmembrane receptor (rhodopsin family)
235026	7 transmembrane receptor (rhodopsin family)
235033	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
235036	Second step splicing factor 1. SSF is thought to bind rRNA and similarity with peter pan in Drosophila suggests a role in cell proliferation
235040	Peptidase family C54
235044	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
235048	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
235072	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
235075	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
235075	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
235078	ENV polyprotein (coat polyprotein)
235086	Fibronectin type III domain
235086	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
235087	Protein kinase domain
235104	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
235106	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
235130	Thrombospondin type 1 domain
235130	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
235132	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
235145	Triosephosphate isomerase
235146	Ribosomal protein L21e
235175	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
235186	7 transmembrane receptor (rhodopsin family)
235195	7 transmembrane receptor (rhodopsin family)
235196	7 transmembrane receptor (rhodopsin family)
235206	7 transmembrane receptor (rhodopsin family)
235207	7 transmembrane receptor (rhodopsin family)
235221	7 transmembrane receptor (rhodopsin family)
235226	7 transmembrane receptor (rhodopsin family)
235231	7 transmembrane receptor (rhodopsin family)
235234	7 transmembrane receptor (rhodopsin family)
235235	Actin
235236	7 transmembrane receptor (rhodopsin family)
235237	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
235237	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
235237	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
235241	7 transmembrane receptor (rhodopsin family)
235243	7 transmembrane receptor (rhodopsin family)
235246	7 transmembrane receptor (rhodopsin family)
235247	7 transmembrane receptor (rhodopsin family)
235248	7 transmembrane receptor (rhodopsin family)
235254	7 transmembrane receptor (rhodopsin family)
235255	7 transmembrane receptor (rhodopsin family)
235256	7 transmembrane receptor (rhodopsin family)
235257	7 transmembrane receptor (rhodopsin family)
235260	7 transmembrane receptor (rhodopsin family)
235266	7 transmembrane receptor (rhodopsin family)
235277	7 transmembrane receptor (rhodopsin family)
235280	7 transmembrane receptor (rhodopsin family)
235283	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
235293	Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues.
235307	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
235320	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
235320	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
235323	Ubiquitin carboxyl-terminal hydrolase family 2
235323	Ubiquitin carboxyl-terminal hydrolase family 2
235323	Ubiquitin carboxyl-terminal hydrolases family 2
235336	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
235336	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
235339	2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain
235341	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
235344	Protein kinase domain
235353	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
235353	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
235364	NAC domain
235379	Olfactomedin-like domain
235379	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
235379	Olfactomedin-like domain
235379	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
235384	ADP-ribosylation factor family
235384	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
235384	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
235385	Aconitase family (aconitate hydratase)
235385	Aconitase C-terminal domain. Members of this family usually also match to pfam00330. This domain undergoes conformational change in the enzyme mechanism
235386	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
235392	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
235392	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
235394	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
235406	PX domain. PX domains bind to phosphoinositides
235406	PX domain. PX domains bind to phosphoinositides
235406	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
235412	Ribosomal protein S8e
235424	Ribosomal protein L5
235424	ribosomal L5P family C-terminus. This region is found associated with pfam00281
235427	GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
235427	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
235435	Glycosyl hydrolase family 1
235440	WD domain, G-beta repeat
235440	Regulator of chromosome condensation (RCC1)
235440	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
235442	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
235444	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
235444	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
235459	Transcription initiation factor IIA, gamma subunit, beta-barrel domain. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIA (TFIIA) is a multi
235459	Transcription initiation factor IIA, gamma subunit, helical domain. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIA (TFIIA) is a multimeri
235472	Fibronectin type III domain
235472	Fibronectin type III domain
235472	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
235474	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
235480	C2 domain
235486	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
235497	Daxx Family. The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis, but precise role played by Daxx remians to be determined
235497	Leo1-like protein. Members of this family are part of the Paf1/RNA polymerase II complex. The Paf1 complex probably functions during the elongation phase of transcription
235501	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
235505	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
235505	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
235506	Cyclophilin type peptidyl-prolyl cis-trans isomerase
235509	Fructose-bisphosphate aldolase class-I
235513	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
235517	HMG (high mobility group) box
235523	Ribosomal protein L6
235523	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
235526	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involved
235527	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involve
235527	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involved
235533	FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain
235533	FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain
235534	Histidine acid phosphatase
235554	ADP-ribosylation factor family
235554	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
235555	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
235559	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
235559	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
235567	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
235574	E1-E2 ATPase
235574	Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
235574	Cation transporting ATPase, C-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
235574	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
235575	Ribosomal L10
235575	Ribosomal protein L16
235580	von Willebrand factor type A domain
235581	Ribosomal protein L6e
235584	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
235587	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ri
235587	pfam02877, PARP_reg, Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affini
235611	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
235611	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
235612	F-box domain
235623	WD domain, G-beta repeat
235623	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
235624	Protein-tyrosine phosphatase
235626	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
235633	Vacuolar sorting protein 9 (VPS9) domain
235636	Microtubule associated protein (MAP65/ASE1 family)
235641	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
235663	Ribosomal protein S5, N-terminal domain
235663	Ribosomal protein S5, C-terminal domain
235680	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
235693	7 transmembrane receptor (rhodopsin family)
235712	7 transmembrane receptor (rhodopsin family)
235714	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
235718	Protein kinase domain
235741	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
235748	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
235748	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
235758	GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
235758	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
235759	6-phosphogluconate dehydrogenase, C-terminal domain. This family represents the C-terminal all-alpha domain of 6-phosphogluconate dehydrogenase. The domain contains two structural repeats of 5 helices each
235762	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
235790	Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is possibly related to the pfam00135 family
235795	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
235796	Defensin propeptide
235800	Interferon alpha/beta domain
235805	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
235806	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
235812	Phosphoglycerate mutase family
235812	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
235820	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
235841	Homeobox domain
235847	7 transmembrane receptor (rhodopsin family)
235852	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
235854	7 transmembrane receptor (rhodopsin family)
235856	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
235869	7 transmembrane receptor (rhodopsin family)
235870	7 transmembrane receptor (rhodopsin family)
235872	7 transmembrane receptor (metabotropic glutamate family)
235876	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
235881	Protein kinase domain
235882	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
235887	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
235890	Sugar (and other) transporter
235891	Sulfotransferase protein
235896	Ribosomal protein S19e
235896	Domain of unknown function DUF128. This archaebacterial protein family has no known function. The domain is found duplicated in some members
235897	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
235900	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
235904	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
235904	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
235904	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
235904	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
235908	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
235909	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
235910	7 transmembrane receptor (metabotropic glutamate family)
235910	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
235916	7 transmembrane receptor (metabotropic glutamate family)
235917	Ribosomal protein L3
235923	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
235925	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
235927	7 transmembrane receptor (metabotropic glutamate family)
235927	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
235928	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
235932	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
235932	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
235934	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
235938	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
235949	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
235950	7 transmembrane receptor (rhodopsin family)
235950	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
235952	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
235953	14-3-3 protein
235956	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
235962	7 transmembrane receptor (rhodopsin family)
235964	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
235966	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
235976	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
235979	Ribosomal protein S5, C-terminal domain
235979	Ribosomal protein S5, N-terminal domain
235981	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
235982	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
235984	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
235985	HMG (high mobility group) box
235992	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
235992	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
235994	7 transmembrane receptor (metabotropic glutamate family)
235997	7 transmembrane receptor (metabotropic glutamate family)
236010	Uncharacterised protein family (UPF0083)
236011	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
236011	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
236024	7 transmembrane receptor (rhodopsin family)
236025	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
236032	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
236033	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
236035	Polyprenyl synthetase
236039	Ribosomal protein S7e
236047	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
236060	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
236066	Ribosomal protein L10
236068	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
236077	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
236079	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
236082	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
236111	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
236117	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
236127	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
236133	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
236137	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
236145	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
236149	Sugar (and other) transporter
236150	Protein kinase domain
236150	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
236151	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
236152	ENV polyprotein (coat polyprotein)
236160	Ribosomal protein S7e
236164	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
236182	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
236193	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
236196	Sulfotransferase protein
236207	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
236210	Somatotropin hormone family
236212	7 transmembrane receptor (metabotropic glutamate family)
236214	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
236217	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
236223	Proteasome A-type and B-type
236225	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
236226	Protein kinase domain
236227	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
236228	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
236232	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
236232	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
236238	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
236251	Palmitoyl protein thioesterase
236271	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
236271	KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
236272	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
236286	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
236293	Sugar (and other) transporter
236294	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
236300	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
236300	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
236302	7 transmembrane receptor (rhodopsin family)
236302	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
236304	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
236304	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
236307	HMG (high mobility group) box
236312	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
236312	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
236317	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
236321	ENV polyprotein (coat polyprotein)
236325	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
236334	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
236334	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
236342	Protein kinase domain
236347	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
236347	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
236359	Eukaryotic initiation factor 1A
236364	ENV polyprotein (coat polyprotein)
236364	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
236364	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
236364	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
236369	Ribosomal protein L21e
236370	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
236372	7 transmembrane receptor (metabotropic glutamate family)
236393	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
236394	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
236395	Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is possibly related to the pfam00135 family
236401	7 transmembrane receptor (metabotropic glutamate family)
236407	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
236413	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
236415	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
236416	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
236420	Disintegrin
236420	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
236420	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
236427	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
236428	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
236428	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
236437	7 transmembrane receptor (rhodopsin family)
236437	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
236452	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
236452	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
236456	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
236463	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
236469	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
236471	Somatotropin hormone family
236473	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
236476	Cadherin domain
236479	Tropomyosin
236490	7 transmembrane receptor (rhodopsin family)
236509	7 transmembrane receptor (rhodopsin family)
236511	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
236511	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
236517	ENV polyprotein (coat polyprotein)
236518	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
236520	Uncharacterized ACR, COG1579
236524	Ribosomal protein L6e
236526	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
236533	Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown
236534	Ribosomal protein L21e
236535	7 transmembrane receptor (rhodopsin family)
236535	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
236552	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
236554	Ribosomal protein L21e
236557	Zinc carboxypeptidase
236565	Ribosomal protein L3
236573	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
236573	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
236573	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
236573	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
236574	Protein kinase domain
236579	Ribosomal protein S7e
236583	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
236605	Ribosomal protein S7e
236612	Ribosomal protein L3
236622	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
236627	Proteasome A-type and B-type
236629	Actin
236632	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
236640	ENV polyprotein (coat polyprotein)
236643	C2 domain
236643	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
236643	C2 domain
236644	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
236649	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
236650	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
236651	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
236657	Cyclophilin type peptidyl-prolyl cis-trans isomerase
236663	Core histone H2A/H2B/H3/H4
236663	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
236676	Cyclophilin type peptidyl-prolyl cis-trans isomerase
236682	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
236683	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
236690	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
236698	Ribosomal protein L6e
236702	Skp1 family, tetramerisation domain
236704	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
236704	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
236720	Zinc-binding dehydrogenase
236721	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
236725	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
236727	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
236727	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
236729	Core histone H2A/H2B/H3/H4
236730	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
236737	Sodium / potassium ATPase beta chain
236742	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
236742	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
236747	Zn-finger in Ran binding protein and others
236747	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
236748	Cyclophilin type peptidyl-prolyl cis-trans isomerase
236749	LIM domain. This family represents two copies of the LIM structural domain
236750	ENV polyprotein (coat polyprotein)
236762	Homeobox domain
236765	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
236766	Ribosomal protein S2
236767	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
236770	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
236774	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
236775	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
236775	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
236778	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
236781	7 transmembrane receptor (rhodopsin family)
236781	7 transmembrane receptor (rhodopsin family)
236784	7 transmembrane receptor (rhodopsin family)
236785	7 transmembrane receptor (rhodopsin family)
236786	Ribosomal protein L36e
236787	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
236787	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
236794	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
236798	7 transmembrane receptor (Secretin family)
236798	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
236799	7 transmembrane receptor (Secretin family)
236805	Fructose-bisphosphate aldolase class-I
236807	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
236820	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
236823	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
236831	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
236842	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
236843	Ribosomal protein L21e
236844	Ribosomal L22e protein family
236845	RanBP1 domain
236846	Eukaryotic ribosomal protein L18
236852	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
236858	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
236863	Ribosomal protein S5, C-terminal domain
236863	Ribosomal protein S5, N-terminal domain
236864	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
236866	7 transmembrane receptor (rhodopsin family)
236868	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
236869	Ribosomal protein S19
236871	HMG (high mobility group) box
236872	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
236874	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
236875	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
236876	ENV polyprotein (coat polyprotein)
236876	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
236876	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
236876	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
236876	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
236876	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
236882	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
236882	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
236882	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
236882	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
236882	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
236882	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
236884	Ribosomal L10
236887	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
236887	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
236891	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
236892	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
236893	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
236899	Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase. Glycerol-3-phosphate cytidylyltransferase
236900	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
236904	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
236912	Ribosomal protein L13e
236913	Proteasome A-type and B-type
236913	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
236914	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
236915	PH domain. PH stands for pleckstrin homology
236915	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
236915	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
236918	Hsp90 protein
236920	START domain
236920	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
236922	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
236928	Translationally controlled tumor protein
236929	HMG (high mobility group) box
236932	Ribosomal protein S6e
236961	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
236968	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
236969	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
236973	Zinc finger, C3HC4 type (RING finger)
236973	YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named
236973	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
236975	NAD-dependent glycerol-3-phosphate dehydrogenase
236978	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
236978	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
236983	S-adenosyl-L-homocysteine hydrolase
236986	Fructose-bisphosphate aldolase class-I
236991	Hsp90 protein
236994	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
236994	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
237004	HMG (high mobility group) box
237005	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
237008	Phosphorylase family 2
237009	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
237010	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
237012	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
237012	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
237012	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
237014	metallopeptidase family M24
237016	Pyruvate kinase, alpha/beta domain
237016	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
237016	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
237018	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
237024	Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration
237025	DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh
237025	DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds
237025	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
237029	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
237029	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
237030	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
237032	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
237035	Fructose-bisphosphate aldolase class-I
237036	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
237038	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
237050	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
237068	Ribosomal family S4e
237081	WD domain, G-beta repeat
237083	Ribosomal protein L14p/L23e
237085	Ribosomal protein L10
237093	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids, some interacts with c-Myc and repress its transcriptional activity. Suggesting that other members of this family may also regulate transcription by intera
237093	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription b
237096	Actin
237105	Ribosomal protein L13e
237105	Sodium:neurotransmitter symporter family
237107	GTPase of unknown function
237107	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
237108	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
237115	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
237116	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
237119	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
237119	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
237124	Protein kinase domain
237130	Fork head domain
237132	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
237132	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
237140	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
237155	Fructose-bisphosphate aldolase class-I
237156	Ribosomal protein L6
237159	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domai
237162	Ribosomal protein S2
237172	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
237175	7 transmembrane receptor (Secretin family)
237175	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
237175	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are
237178	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
237178	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
237188	Protein kinase domain
237200	Helix-loop-helix DNA-binding domain
237201	Ribosomal L29e protein family
237212	Macrophage migration inhibitory factor (MIF)
237213	Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
237213	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
237213	Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
237213	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
237228	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
237241	Core histone H2A/H2B/H3/H4
237250	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
237252	B-box zinc finger
237275	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
237275	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
237284	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
237285	Ribosomal protein L23
237285	Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a
237295	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
237296	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
237300	Protein kinase domain
237303	Triosephosphate isomerase
237305	Enolase, N-terminal domain
237305	Enolase, C-terminal TIM barrel domain
237310	Tissue factor
237310	Tissue factor
237313	Tissue factor
237319	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
237319	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
237320	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
237320	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
237321	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
237323	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
237331	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
237331	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
237335	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
237335	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
237335	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
237342	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
237343	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
237343	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
237360	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
237360	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
237361	Ribosomal protein L15
237362	7 transmembrane receptor (rhodopsin family)
237369	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
237369	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
237369	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
237374	Sodium:solute symporter family
237374	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
237391	Protein-tyrosine phosphatase
237391	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
237396	ENV polyprotein (coat polyprotein)
237396	7 transmembrane receptor (metabotropic glutamate family)
237396	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
237396	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
237396	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
237396	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
237396	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
237397	C2 domain
237403	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
237408	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
237409	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
237410	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
237410	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
237412	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
237415	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
237415	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
237415	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
237415	IMP dehydrogenase / GMP reductase C terminus. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is
237427	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
237432	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
237433	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
237436	Growth-Arrest-Specific Protein 2 Domain
237436	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
237438	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
237438	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
237448	Translation initiation factor SUI1
237459	Protein kinase domain
237482	Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast member of this family
237482	pfam02935, COX7C, Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast mem
237483	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
237483	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
237499	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
237500	TPR Domain
237501	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
237512	Ribosomal protein S8
237518	AMP-binding enzyme
237523	L1 transposable element
237523	Fibronectin type III domain
237523	Protein-tyrosine phosphatase
237523	von Willebrand factor type D domain
237523	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
237536	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
237542	Oxysterol-binding protein
237542	PH domain. PH stands for pleckstrin homology
237546	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
237547	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
237549	SCP-like extracellular protein
237549	SCP-like extracellular protein. This domain is also found in prokaryotes
237553	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
237564	Kinesin motor domain
237564	Intermediate filament protein
237564	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
237577	Ribosomal protein S26e
237581	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
237604	Ribosomal protein S19e
237611	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
237611	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
237611	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
237612	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
237626	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
237630	PH domain. PH stands for pleckstrin homology
237636	AcrB/AcrD/AcrF family. Members of this family are integral membrane proteins. Some are involved in drug resistance
237636	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
237666	Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold
237666	Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold
237668	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
237696	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
237699	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
237703	Orotidine 5'-phosphate decarboxylase / HUMPS family. This family includes Orotidine 5'-phosphate decarboxylase enzymes EC:4.1.1.23 that are involved in the final step of pyrimidine biosynthesis. The family also includes enzymes such as hexulose-6-phosphat
237704	ENV polyprotein (coat polyprotein)
237704	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
237704	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
237704	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
237711	WD domain, G-beta repeat
237720	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
237730	Fibrillarin
237751	GTPase of unknown function
237751	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
237755	7 transmembrane receptor (rhodopsin family)
237757	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
237761	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
237761	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
237765	Ribosomal protein S6e
237767	7 transmembrane receptor (rhodopsin family)
237769	7 transmembrane receptor (rhodopsin family)
237769	C. elegans Sre G protein-coupled chemoreceptor. Caenorhabditis elegans Sre proteins are candidate chemosensory receptors. There are four main recognized groups of such receptors: Odr-10, Sra, Sro, and Srg. Sre (this family), Sra pfam02117 and Srb pfam0217
237769	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
237769	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
237771	B-box zinc finger
237771	Zinc finger, C3HC4 type (RING finger)
237771	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
237775	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
237780	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
237780	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
237806	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
237818	PMP-22/EMP/MP20/Claudin family
237823	AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synt
237828	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
237831	Sodium:sulfate symporter transmembrane region. Some members in this family belong to the subfamily SODIT1
237831	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
237837	7 transmembrane receptor (rhodopsin family)
237837	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
237837	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
237880	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
237889	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
237891	Growth-Arrest-Specific Protein 2 Domain
237891	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
237898	Ubiquitin carboxyl-terminal hydrolase family 2
237909	Ribosomal protein L21e
237926	Oxygen-independent Coproporphyrinogen III oxidase. This family of bacterial enzymes catalyses the anaerobic transformation of coproporphyrinogen III to protoporphyrinogen IX required for porphyrin biosynthesis
237930	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
237934	Intermediate filament protein
237934	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
237940	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
237940	Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
237940	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge
237958	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
237958	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
237960	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
237987	Protein of unknown function, DUF270
237989	B-box zinc finger
237989	Zinc finger, C3HC4 type (RING finger)
237989	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
238011	Sulfatase
238011	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
238023	Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold
238024	Fructosamine kinase. This family includes eukaryotic fructosamine-3-kinase enzymes. The family also includes bacterial members that have not been characterised but probably have a similar or identical function
238024	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
238029	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
238057	Transforming growth factor beta like domain
238057	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
238074	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
238076	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
238076	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
238091	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
238091	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
238094	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
238096	Ribosomal protein L13e
238096	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
238096	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
238099	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
238119	Pyruvate kinase, alpha/beta domain
238119	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
238119	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
238143	HMG (high mobility group) box
238153	Ribosomal protein S17
238162	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
238174	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
238176	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Membe
238198	Ribosomal protein L21e
238199	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
238205	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
238217	Ribosomal L10
238217	Ribosomal protein L16
238220	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238251	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
238252	7 transmembrane receptor (rhodopsin family)
238252	7 transmembrane receptor (rhodopsin family)
238257	LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in resp
238257	Eukaryotic protein of unknown function, DUF284. Members of this family have no known function. They have been predicted to contain transmembrane helices
238257	LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in respo
238266	C2 domain
238271	Cyclic nucleotide-binding domain
238271	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
238271	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
238276	Protein kinase A anchor
238294	Galactoside-binding lectin
238294	Ribosomal L18ae protein family
238320	Transposase. Transposase proteins are necessary for efficient DNA transposition. This family consists of various E. coli insertion elements and other bacterial transposases some of which are members of the IS3 family
238323	MIT domain
238324	Ribosomal protein S12
238329	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
238331	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
238333	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
238335	Eukaryotic initiation factor 1A
238338	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
238350	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
238351	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
238351	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
238351	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
238353	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps
238354	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
238377	7 transmembrane receptor (rhodopsin family)
238379	Kinesin motor domain
238379	Uncharacterized ACR, COG1579
238379	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
238379	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
238384	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
238386	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
238393	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
238394	GTP1/OBG family
238394	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
238394	TGS domain. The TGS domain is named after ThrRS, GTPase, and SpoT. Interestingly, TGS domain was detected also at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochae
238395	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
238400	S25 ribosomal protein
238406	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
238406	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
238409	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238410	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238412	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238413	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238414	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238418	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238420	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238424	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238425	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238427	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238433	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238437	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238440	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238443	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238444	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238447	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238448	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
238462	ENV polyprotein (coat polyprotein)
238463	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
238463	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
238465	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
238467	Carbohydrate phosphorylase. The members of this family catalyse the formation of glucose 1-phosphate from one of the following polyglucoses
238489	AMP-binding enzyme
238505	Vitamin B12 dependent methionine synthase, activation domain
238505	B12 binding domain. This domain binds to B12 (adenosylcobamide), it is found in several enzymes, such as glutamate mutase, methionine synthase and methylmalonyl-CoA mutase
238505	Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13,
238505	B12 binding domain. This B12 binding domain is found in methionine synthase EC:2.1.1.13, and other shorter proteins that bind to B12. This domain is always found to the N-terminus of pfam02310. The structure of this domain is known, it is a 4 helix bundle
238505	Pterin binding enzyme. This family includes a variety of pterin binding enzymes that all adopt a TIM barrel fold. The family includes dihydropteroate synthase EC:2.5.1.15 as well as a group methyltransferase enzymes including methyltetrahydrofolate, corri
238507	Ribonucleotide reductase, small chain
238533	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
238533	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
238547	Core histone H2A/H2B/H3/H4
238550	Core histone H2A/H2B/H3/H4
238550	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
238555	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
238564	Protein kinase domain
238568	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
238569	7 transmembrane receptor (metabotropic glutamate family)
238569	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
238569	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
238569	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
238569	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
238569	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
238572	Sugar (and other) transporter
238586	Chaperonin 10 Kd subunit
238599	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
238619	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
238619	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
238623	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
238637	Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018
238637	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
238639	Cyclophilin type peptidyl-prolyl cis-trans isomerase
238643	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
238656	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
238657	Actin
238661	Ribosomal protein L13e
238672	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
238678	ArgK protein. The ArgK protein acts as an ATPase enzyme and as a kinase, and phosphorylates periplasmic binding proteins involved in the LAO (lysine, arginine, ornithine)/AO transport systems
238679	Ribosomal protein L23
238680	Fibrinogen beta and gamma chains, C-terminal globular domain
238680	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
238681	Translation initiation factor SUI1
238688	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
238688	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
238690	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
238692	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
238693	tRNA synthetases class I (K). This family includes only lysyl tRNA synthetases from prokaryotes
238703	NADH dehydrogenase
238710	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
238722	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
238724	ENV polyprotein (coat polyprotein)
238725	7 transmembrane receptor (rhodopsin family)
238725	7 transmembrane receptor (rhodopsin family)
238756	Ribosomal protein S5, C-terminal domain
238756	Ribosomal protein S5, N-terminal domain
238761	HMG (high mobility group) box
238765	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
238766	Ribosomal protein L21e
238786	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon
238787	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
238787	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
238787	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
238790	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
238790	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
238804	Ribosomal L15
238836	Core histone H2A/H2B/H3/H4
238871	3'5'-cyclic nucleotide phosphodiesterase
238871	3'5'-cyclic nucleotide phosphodiesterase
238880	Actin
238880	Actin
238881	Electron transfer flavoprotein alpha subunit. This protein is distantly related to and forms a heterodimer with pfam01012
238896	WD domain, G-beta repeat
238927	Mnd1 family. This family of proteins includes MND1 from S. cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair
238939	Cadherin domain
238943	Enolase, N-terminal domain
238943	Enolase, C-terminal TIM barrel domain
238952	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
238955	Homeobox domain
238955	Pou domain - N-terminal to homeobox domain
238963	MIT domain
238963	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
238974	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
239006	Ribosomal protein S7e
239006	Orbivirus outer capsid protein VP2. VP2 acts as an anchor for VP1 and VP3. VP2 contains a non-specific DNA and RNA binding domain in the N-terminus
239013	Core histone H2A/H2B/H3/H4
239017	Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases
239017	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
239037	Fibronectin type III domain
239038	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
239052	ENV polyprotein (coat polyprotein)
239053	ENV polyprotein (coat polyprotein)
239054	Protein-tyrosine phosphatase
239054	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
239055	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
239055	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
239076	Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone
239080	7 transmembrane receptor (rhodopsin family)
239081	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
239085	Arginyl tRNA synthetase N terminal domain. This domain is found at the amino terminus of Arginyl tRNA synthetase, also called additional domain 1 (Add-1). It is about 140 residues long and it has been suggested that this domain will be involved in tRNA re
239087	Transcription initiation factor TFIID 23-30kDa subunit
239089	7 transmembrane receptor (metabotropic glutamate family)
239090	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
239091	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
239097	Leishmanolysin
239099	Homeobox domain
239106	Trypsin
239107	Fructose-bisphosphate aldolase class-I
239118	Ribosomal protein L13e
239121	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
239121	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
239122	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
239125	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
239126	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
239126	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
239138	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
239151	GTPase of unknown function
239157	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
239157	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
239158	'Cold-shock' DNA-binding domain
239167	ENV polyprotein (coat polyprotein)
239177	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
239177	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
239182	Ribosomal L29e protein family
239186	FKBP-type peptidyl-prolyl cis-trans isomerase
239189	PH domain. PH stands for pleckstrin homology
239191	C-5 cytosine-specific DNA methylase
239192	Olfactomedin-like domain
239193	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
239199	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
239217	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
239217	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
239245	Ribosomal protein L15
239250	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
239250	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
239273	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
239273	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
239283	7 transmembrane receptor (rhodopsin family)
239294	Ribosomal protein L19e
239314	Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length
239318	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
239318	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
239318	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
239319	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
239324	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
239324	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
239336	7 transmembrane receptor (rhodopsin family)
239336	7 transmembrane receptor (rhodopsin family)
239337	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
239337	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
239337	Ribosomal protein L23
239337	Thrombospondin type 1 domain
239337	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
239337	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
239341	Elongation factor 1 gamma, conserved domain
239343	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
239343	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
239343	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
239351	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
239364	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
239365	ENV polyprotein (coat polyprotein)
239366	ENV polyprotein (coat polyprotein)
239369	ENV polyprotein (coat polyprotein)
239378	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
239383	Ribonucleotide reductase, small chain
239390	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
239390	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
239392	Core histone H2A/H2B/H3/H4
239393	CUB domain
239406	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
239406	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
239411	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
239418	Ribosomal L18ae protein family
239420	CUB domain
239420	Sushi domain (SCR repeat)
239422	CUB domain
239422	Sushi domain (SCR repeat)
239424	CUB domain
239428	ENV polyprotein (coat polyprotein)
239428	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
239428	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
239428	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
239428	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
239434	Ribosomal L15
239454	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
239472	Ribosomal protein L24e
239488	ENV polyprotein (coat polyprotein)
239499	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
239502	Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
239502	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
239510	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
239515	Glycosyl hydrolase family 14. This family are beta amylases
239525	DHHA2 domain. This domain is often found adjacent to the DHH domain pfam01368 and is called DHHA2 for DHH associated domain. This domain is diagnostic of DHH subfamily 2 members. The domain is about 120 residues long and contains a conserved DXK motif at
239526	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
239528	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
239530	7 transmembrane receptor (rhodopsin family)
239536	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
239537	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
239540	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
239554	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
239554	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
239555	Mab-21 protein
239555	Mab-21 protein
239556	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
239564	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
239564	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
239591	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
239593	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
239598	S25 ribosomal protein
239598	Poly-adenylate binding protein, unique domain
239603	Sugar (and other) transporter
239606	Sugar (and other) transporter
239609	von Willebrand factor type D domain
239609	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
239611	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
239615	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
239618	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
239634	ENV polyprotein (coat polyprotein)
239651	7 transmembrane receptor (rhodopsin family)
239652	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
239652	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
239658	Ribosomal protein L36e
239659	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
239667	AMP-binding enzyme
239673	Intermediate filament protein
239673	Intermediate filament protein
239673	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
239673	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
239674	Intermediate filament protein
239679	Homeobox domain
239685	ENV polyprotein (coat polyprotein)
239685	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
239685	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
239685	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
239690	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
239707	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
239708	PWI domain
239710	ENV polyprotein (coat polyprotein)
239714	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
239719	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
239724	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
239724	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
239728	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
239731	Fibronectin type III domain
239731	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
239739	Lysosome-associated membrane glycoprotein (Lamp)
239740	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
239740	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
239743	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
239743	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
239748	Elongation factor 1 gamma, conserved domain
239748	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
239749	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
239755	Ribosomal protein L11, RNA binding domain
239755	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
239759	Lipase
239759	Lipase
239761	Ribosomal protein L35Ae
239761	Core histone H2A/H2B/H3/H4
239791	Eukaryotic porin
239796	Mab-21 protein
239796	Mab-21 protein
239814	S25 ribosomal protein
239826	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
239827	Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability
239828	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1
239833	Leishmanolysin
239835	Protein kinase domain
239835	Fibronectin type III domain
239845	7 transmembrane receptor (metabotropic glutamate family)
239845	7 transmembrane receptor (metabotropic glutamate family)
239846	Cyclophilin type peptidyl-prolyl cis-trans isomerase
239852	Zona pellucida-like domain
239853	7 transmembrane receptor (Secretin family)
239853	7 transmembrane receptor (Secretin family)
239853	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
239855	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
239857	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv
239857	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
239863	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
239878	Glycyl-tRNA synthetase beta subunit
239878	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
239893	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
239916	Phosphoglycerate kinase
239931	PMP-22/EMP/MP20/Claudin family
239931	PMP-22/EMP/MP20/Claudin family
239933	Ribosomal protein L13e
239940	Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibers in association with hair keratin intermediate filaments. This family has been divided up int
239950	Galactose binding lectin domain
239955	Ribosomal protein S6e
239995	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
239998	Clathrin adaptor complex small chain
240003	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
240003	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
240007	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
240009	Ribosomal protein L19e
240027	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
240028	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase act
240029	7 transmembrane receptor (metabotropic glutamate family)
240029	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
240030	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
240031	Arp2/3 complex, 34kD subunit p34-Arc. Arp2/3 protein complex has been implicated in the control of actin polymerization in cells. The human complex consists of seven subunits which include the actin related Arp2 and Arp3, and five others referred to as p4
240034	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
240038	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
240039	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
240040	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
240041	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
240042	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
240045	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
240047	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
240047	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
240047	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases
240053	7 transmembrane receptor (rhodopsin family)
240054	NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in at least one member, proteolysis has shown that the Peptid
240055	Ribosomal protein S26e
240057	C2 domain
240057	Tropomyosin
240057	PH domain. PH stands for pleckstrin homology
240057	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
240057	ATP synthase B/B' CF(0). Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits a
240058	C2 domain
240061	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
240063	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
240064	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
240064	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
240067	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
240068	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
240069	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
240072	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
240073	Class II histocompatibility antigen, alpha domain
240074	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
240074	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
240075	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
240075	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
240087	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
240089	Core histone H2A/H2B/H3/H4
240091	Protein kinase domain
240091	Class I Histocompatibility antigen, domains alpha 1 and 2
240093	Zinc finger, C3HC4 type (RING finger)
240094	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
240095	Class I Histocompatibility antigen, domains alpha 1 and 2
240095	Class I Histocompatibility antigen, domains alpha 1 and 2
240097	7 transmembrane receptor (rhodopsin family)
240100	7 transmembrane receptor (rhodopsin family)
240110	Ribosomal L29e protein family
240117	Myo-inositol-1-phosphate synthase. This is a family of myo-inositol-1-phosphate synthases. Inositol-1-phosphate catalyses the conversion of glucose-6- phosphate to inositol-1-phosphate, which is then dephosphorylated to inositol. Inositol phosphates play
240119	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
240119	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invo
240120	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
240121	Fibronectin type III domain
240121	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
240123	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
240124	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
240125	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
240155	Ribosomal protein L11, RNA binding domain
240156	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
240160	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-stero
240168	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
240168	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
240171	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
240172	Ribosomal L29e protein family
240182	Cullin family
240182	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
240183	Zinc finger, C3HC4 type (RING finger)
240188	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
240188	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
240195	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
240198	Proteasome A-type and B-type
240204	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
240204	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
240205	Triosephosphate isomerase
240215	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
240216	Kinesin motor domain
240219	Glutamine synthetase, beta-Grasp domain
240219	Glutamine synthetase, catalytic domain
240222	Ribosomal protein S5, C-terminal domain
240222	Ribosomal protein S5, N-terminal domain
240232	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
240236	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
240236	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 is related to this family
240239	7 transmembrane receptor (rhodopsin family)
240255	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
240255	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment f
240261	Glycosyl hydrolase family 26
240261	Uncharacterized ACR, COG1579
240269	Ribosomal protein L6e
240277	Enolase, C-terminal TIM barrel domain
240284	WD domain, G-beta repeat
240289	Proteasome A-type and B-type
240303	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
240303	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
240322	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
240322	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
240322	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
240332	Sodium:neurotransmitter symporter family
240343	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
240350	Ribosomal protein S7e
240350	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
240360	Galactose-1-phosphate uridyl transferase, C-terminal domain. SCOP reports fold duplication with N-terminal domain. Both involved in Zn and Fe binding
240367	Ribosomal L29e protein family
240411	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
240444	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
240444	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
240456	Ribosomal protein L36e
240456	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
240478	Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases
240479	Protein kinase domain
240482	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
240490	HMG (high mobility group) box
240493	PTB domain (IRS-1 type)
240493	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
240495	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
240496	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
240496	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
240498	ENV polyprotein (coat polyprotein)
240498	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
240498	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
240516	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
240519	Microtubule associated protein (MAP65/ASE1 family)
240525	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
240527	Mitochondrial carrier protein
240531	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
240537	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
240539	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
240541	Hsp90 protein
240543	7 transmembrane receptor (rhodopsin family)
240544	7 transmembrane receptor (rhodopsin family)
240545	7 transmembrane receptor (rhodopsin family)
240551	ENV polyprotein (coat polyprotein)
240551	7 transmembrane receptor (rhodopsin family)
240551	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
240551	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
240551	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
240551	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
240569	2-hydroxychromene-2-carboxylate isomerase family. 2-hydroxychromene-2-carboxylate (HCCA) isomerase enzymes catalyse one step in prokaryotic polyaromatic hydrocarbon (PAH) catabolic pathways . This family also contains members with functions other than HCC
240569	2-hydroxychromene-2-carboxylate isomerase family. 2-hydroxychromene-2-carboxylate (HCCA) isomerase enzymes catalyse one step in prokaryotic polyaromatic hydrocarbon (PAH) catabolic pathways. This family also contains members with functions other than HCCA
240582	Hyaluronidase
240590	DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown
240590	DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown
240590	DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai
240595	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
240595	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
240632	ab-hydrolase associated lipase region
240632	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
240633	ab-hydrolase associated lipase region
240641	Kinesin motor domain
240656	HMG (high mobility group) box
240657	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
240660	Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region
240660	Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region
240665	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termi
240667	WD domain, G-beta repeat
240669	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
240670	Ribosomal protein L31e
240672	Protein-tyrosine phosphatase
240672	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
240672	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
240675	von Willebrand factor type A domain
240677	ENV polyprotein (coat polyprotein)
240677	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
240677	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
240677	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
240677	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
240679	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
240679	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
240689	Proteasome A-type and B-type
240690	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
240693	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
240693	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
240693	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
240710	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
240722	Mitochondrial carrier protein
240725	Sulfatase
240726	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
240731	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
240731	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
240752	C2 domain
240752	Phosphatidylinositol 3- and 4-kinase
240752	PX domain. PX domains bind to phosphoinositides
240752	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
240752	Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation
240752	PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains
240752	PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains (unpublished observa
240753	PH domain. PH stands for pleckstrin homology
240753	PH domain. PH stands for pleckstrin homology
240756	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
240756	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
240756	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
240756	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
240761	GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
240761	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
240762	7 transmembrane receptor (rhodopsin family)
240763	Macrophage migration inhibitory factor (MIF)
240773	Sushi domain (SCR repeat)
240776	Uncharacterized protein family, UPF0065
240776	Calcium-activated BK potassium channel alpha subunit
240776	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
240779	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
240779	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
240779	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
240783	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
240798	ENV polyprotein (coat polyprotein)
240798	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
240799	Ribosomal protein S6e
240811	Ribosomal protein S5, C-terminal domain
240811	Ribosomal protein S5, N-terminal domain
240841	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
240843	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl
240867	7 transmembrane receptor (rhodopsin family)
240869	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
240869	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
240871	Cyclic nucleotide-binding domain
240871	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
240880	Protein kinase domain
240888	7 transmembrane receptor (rhodopsin family)
240891	Protein of unknown function DUF47. This family includes prokaryotic proteins of unknown function, as well as a protein annotated as the pit accessory protein from Sinorhizobium meliloti. However, the function of this protein is also unknown (Pit stands fo
240892	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
240894	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
240895	SKIP/SNW domain. This domain is found in chromatin proteins
240898	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
240903	Translation initiation factor SUI1
240913	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
240921	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
240921	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
240923	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
240957	Fatty acid desaturase
240958	Fatty acid desaturase
240958	Protein kinase domain
240961	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
240970	Ribosomal protein L21e
241003	Fibronectin type III domain
241004	Fibronectin type III domain
241005	Fibronectin type III domain
241020	Deuterolysin metalloprotease (M35) family
241025	Ribosomal protein S17
241035	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
241041	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
241042	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
241042	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
241042	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
241042	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
241051	Protein kinase domain
241053	Ribosomal protein L11, RNA binding domain
241053	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
241054	Nucleoside diphosphate kinase
241056	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
241059	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
241060	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
241060	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
241063	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
241063	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
241063	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
241070	7 transmembrane receptor (rhodopsin family)
241073	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
241074	Cyclophilin type peptidyl-prolyl cis-trans isomerase
241075	PH domain. PH stands for pleckstrin homology
241077	Ribosomal protein L31e
241079	Protein kinase domain
241080	Protein kinase domain
241090	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
241091	Mitochondrial carrier protein
241097	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
241111	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
241118	Amiloride-sensitive sodium channel
241119	SKIP/SNW domain. This domain is found in chromatin proteins
241119	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
241140	Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes
241151	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
241152	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
241153	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
241154	ADP-ribosylation factor family
241154	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
241157	7 transmembrane receptor (rhodopsin family)
241158	MYND finger
241158	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
241176	ENV polyprotein (coat polyprotein)
241177	Proteasome A-type and B-type
241193	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
241197	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
241201	Cadherin domain
241201	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophili
241213	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
241215	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
241215	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
241217	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
241218	Microtubule associated protein (MAP65/ASE1 family)
241219	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
241219	Up-frameshift suppressor 2. Transcripts harboring premature signals for translation termination are recognized and rapidly degraded by eukaryotic cells through a pathway known as nonsense-mediated mRNA decay. In Saccharomyces cerevisiae, three trans-actin
241226	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
241228	CUB domain
241228	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
241230	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
241242	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
241242	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
241242	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
241242	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
241242	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
241263	7 transmembrane receptor (metabotropic glutamate family)
241263	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
241270	Zinc finger, C3HC4 type (RING finger)
241272	ENV polyprotein (coat polyprotein)
241294	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
241301	Cyclophilin type peptidyl-prolyl cis-trans isomerase
241308	PH domain. PH stands for pleckstrin homology
241311	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
241311	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
241320	Ribosomal protein L36e
241322	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
241324	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
241325	Thymosin beta-4 family
241327	Olfactomedin-like domain
241329	Uncharacterised protein family (UPF0108)
241329	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
241366	ENV polyprotein (coat polyprotein)
241366	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
241366	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
241366	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
241366	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
241385	Papain family cysteine protease
241390	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
241391	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
241422	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
241424	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
241432	LIM domain. This family represents two copies of the LIM structural domain
241440	ENV polyprotein (coat polyprotein)
241447	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
241452	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
241452	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
241462	Translation initiation factor SUI1
241487	Ribosomal protein S6e
241489	3'5'-cyclic nucleotide phosphodiesterase
241489	GAF domain. Domain present in phytochromes and cGMP-specific phosphodiesterases
241490	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternati
241504	Ribosomal L10
241506	Ribosomal protein L6
241512	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
241512	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
241515	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
241516	High molecular weight glutenin subunit. Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin
241521	MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains
241521	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
241524	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
241525	Yippee putative zinc-binding protein
241528	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
241528	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
241530	7 transmembrane receptor (rhodopsin family)
241531	7 transmembrane receptor (rhodopsin family)
241532	7 transmembrane receptor (rhodopsin family)
241536	7 transmembrane receptor (rhodopsin family)
241537	7 transmembrane receptor (rhodopsin family)
241539	7 transmembrane receptor (rhodopsin family)
241540	Homeobox domain
241552	Cyclophilin type peptidyl-prolyl cis-trans isomerase
241556	Tetraspanin family
241568	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
241568	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
241572	Pyruvate kinase, alpha/beta domain
241572	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
241572	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
241587	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
241593	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
241593	PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline
241612	Spore germination protein
241612	Sodium:solute symporter family
241621	Ribosomal protein L15
241625	Core histone H2A/H2B/H3/H4
241626	Transglutaminase family
241626	Transglutaminase family, C-terminal ig like domain
241626	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
241633	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
241634	SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop
241635	TFIIE alpha subunit. The general transcription factor TFIIE has an essential role in eukaryotic transcription initiation together with RNA polymerase II and other general factors. Human TFIIE consists of two subunits TFIIE-alpha and TFIIE-beta and joins t
241636	Transglutaminase family
241636	Transglutaminase family, C-terminal ig like domain
241636	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
241636	Transglutaminase family
241636	Transglutaminase family, C-terminal ig like domain
241636	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
241639	PH domain. PH stands for pleckstrin homology
241658	Ribosomal L10
241672	Calreticulin family
241681	Kinesin motor domain
241690	Ribosomal L10
241694	Rap/ran-GAP
241697	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
241697	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
241709	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
241714	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
241716	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
241723	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
241725	DNA gyrase B. This family represents the second domain of DNA gyrase B which has a ribosomal S5 domain 2-like fold. This family is structurally related to PF01119
241725	DNA gyrase B. This family represents the second domain of DNA gyrase B which has a ribosomal S5 domain 2-like fold. This family is structurally related to pfam01119
241725	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
241732	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
241735	MYND finger
241737	Ribosomal protein L6
241764	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
241764	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
241764	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter
241765	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
241765	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
241769	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
241770	C2 domain
241786	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
241794	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
241794	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
241810	Ribosomal protein S5, N-terminal domain
241830	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
241850	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
241853	ENV polyprotein (coat polyprotein)
241853	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
241853	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
241854	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
241854	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
241854	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
241857	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
241857	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
241860	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
241860	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
241860	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
241863	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
241864	Protein kinase domain
241865	Zinc finger, C3HC4 type (RING finger)
241866	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
241866	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
241874	Cyclophilin type peptidyl-prolyl cis-trans isomerase
241877	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the fa
241878	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
241879	ENV polyprotein (coat polyprotein)
241882	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
241883	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
241883	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
241883	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
241890	Amino acid permease
241890	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
241890	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
241890	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
241897	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
241898	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
241899	ENV polyprotein (coat polyprotein)
241901	ENV polyprotein (coat polyprotein)
241901	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
241901	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
241915	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
241923	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
241942	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
241944	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
241950	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
241950	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
241957	Ribosomal protein L21e
241965	Cadherin domain
241965	Cadherin domain
241965	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
241967	Cyclophilin type peptidyl-prolyl cis-trans isomerase
241980	Cyclophilin type peptidyl-prolyl cis-trans isomerase
241983	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
241983	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
241983	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
241985	Pyruvate kinase, alpha/beta domain
241985	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
241989	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
241992	14-3-3 protein
241994	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
241994	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
241996	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
241996	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
242025	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
242036	Fructose-bisphosphate aldolase class-I
242037	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
242039	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
242039	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
242050	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
242050	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
242051	Poly-adenylate binding protein, unique domain
242051	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
242052	Carboxylesterase
242054	Enolase, C-terminal TIM barrel domain
242060	Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown
242061	ENV polyprotein (coat polyprotein)
242065	PH domain. PH stands for pleckstrin homology
242083	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
242084	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
242088	Cadherin domain
242093	7 transmembrane receptor (rhodopsin family)
242093	7 transmembrane receptor (rhodopsin family)
242096	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
242097	N-acetylmuramoyl-L-alanine amidase. This family includes zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity EC:3.5.1.28. This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (prefere
242100	N-acetylmuramoyl-L-alanine amidase. This family includes zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity EC:3.5.1.28. This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (prefere
242101	Ribosomal protein L21e
242107	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
242107	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
242111	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
242112	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
242112	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
242112	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
242122	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
242125	Mab-21 protein
242126	Sugar (and other) transporter
242126	Sugar (and other) transporter
242151	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
242151	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
242153	14-3-3 protein
242175	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
242175	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
242175	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
242185	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
242195	Cyclophilin type peptidyl-prolyl cis-trans isomerase
242199	Zinc-binding dehydrogenase
242200	Cyclophilin type peptidyl-prolyl cis-trans isomerase
242202	3'5'-cyclic nucleotide phosphodiesterase
242205	14-3-3 protein
242206	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
242206	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
242206	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
242220	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a b
242221	Ribosomal protein L19e
242244	Cyclophilin type peptidyl-prolyl cis-trans isomerase
242257	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
242259	Domain of unknown function DUF20. This transmembrane region is found in putative permeases and predicted transmembrane proteins it has no known function. It is not clear what source suggested that these proteins may be permeases and this information shoul
242280	Protein kinase domain
242285	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
242285	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
242285	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
242286	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
242291	Inositol monophosphatase family
242311	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
242316	Transforming growth factor beta like domain
242316	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
242317	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
242317	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
242318	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
242319	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
242321	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt
242339	Cyclic nucleotide-binding domain
242340	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
242341	ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae
242341	ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae
242363	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
242363	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
242363	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
242373	Ribosomal protein L23
242377	Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases
242384	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
242384	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
242384	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
242386	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
242388	Ribosomal protein S6e
242410	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
242410	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
242412	HMG (high mobility group) box
242425	7 transmembrane receptor (metabotropic glutamate family)
242425	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
242443	Bacterial extracellular solute-binding proteins, family 3
242443	Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors
242447	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
242451	NAC domain
242459	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
242474	Domain of unknown function DUF20. This transmembrane region is found in putative permeases and predicted transmembrane proteins it has no known function. It is not clear what source suggested that these proteins may be permeases and this information shoul
242481	Paralemmin
242496	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
242497	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
242499	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
242499	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
242502	Ribonucleotide reductase, small chain
242505	Uncharacterized ACR, COG1579
242505	Intermediate filament protein
242505	ADP-ribosylation factor family
242505	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
242505	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
242505	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
242505	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
242505	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
242506	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
242517	Interferon alpha/beta domain
242519	Interferon alpha/beta domain
242519	Interferon alpha/beta domain
242521	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
242523	DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown
242523	DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown
242523	DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai
242525	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
242536	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
242540	Ribosomal Proteins L2, C-terminal domain
242540	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
242540	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
242546	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
242555	Ribosomal family S4e
242557	Peptidase family C54
242570	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
242597	Ribosomal L29e protein family
242599	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
242603	CUB domain
242607	Sodium:dicarboxylate symporter family
242620	DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown
242643	Ribosomal protein L36e
242646	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
242646	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
242646	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
242651	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
242651	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
242653	PMP-22/EMP/MP20/Claudin family
242662	C2 domain
242663	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
242663	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
242665	Palmitoyl protein thioesterase
242667	Guanylate-kinase-associated protein (GKAP) protein
242669	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
242681	ADP-ribosylation factor family
242681	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
242702	Fibronectin type III domain
242702	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
242703	Core histone H2A/H2B/H3/H4
242705	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associ
242705	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
242711	Trypsin
242711	Protein of unknown function DUF100. This prokaryotic family has no known function
242712	CUB domain
242721	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
242726	Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-
242733	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
242735	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
242737	KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
242740	Carboxylesterase
242741	Carboxylesterase
242742	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
242744	HMG (high mobility group) box
242744	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
242745	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
242747	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
242748	AcrB/AcrD/AcrF family. Members of this family are integral membrane proteins. Some are involved in drug resistance
242748	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
242767	Sugar (and other) transporter
242785	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
242785	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
242789	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
242789	Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibers in association with hair keratin intermediate filaments. This family has been divided up int
242805	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
242805	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
242809	Ribosomal protein L3
242819	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
242823	ENV polyprotein (coat polyprotein)
242823	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
242823	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
242823	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
242825	Protein kinase domain
242825	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
242827	Ribosomal protein L21e
242842	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
242842	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
242842	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
242848	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
242851	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
242852	ADP-ribosylation factor family
242852	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
242891	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
242894	Ribosomal protein L31e
242904	Cyclophilin type peptidyl-prolyl cis-trans isomerase
242905	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
242914	Kinesin motor domain
242939	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
242940	Protein kinase domain
242940	POLO box duplicated region
242960	F-box domain
242960	F-box domain
242965	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
242987	Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site
242987	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
242987	Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This ig-fold domain is found
242987	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
242997	Uncharacterized ACR, COG1579
242997	Intermediate filament protein
242997	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
242997	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
243011	Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat
243014	ADP-ribosylation factor family
243014	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
243016	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
243040	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
243043	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
243043	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
243044	Adenylate kinase
243044	Thymidylate kinase
243060	ADP-ribosylation factor family
243060	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
243062	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
243065	Protein kinase domain
243068	Methylenetetrahydrofolate reductase. This family includes the 5,10-methylenetetrahydrofolate reductase EC:1.7.99.5 from bacteria and methylenetetrahydrofolate reductase EC: 1.5.1.20 from eukaryotes. The structure for this domain is known to be a TIM barre
243077	Ribosomal protein L31e
243078	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-ster
243083	Trypsin
243083	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
243085	UDP-glucoronosyl and UDP-glucosyl transferase
243099	Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold
243100	Serum albumin family
243119	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
243126	HMG (high mobility group) box
243150	Ribosomal L22e protein family
243162	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
243163	Ribosomal protein L31e
243176	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
243176	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
243176	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243183	Ribosomal L15
243187	Homeobox domain
243200	7 transmembrane receptor (metabotropic glutamate family)
243205	7 transmembrane receptor (metabotropic glutamate family)
243206	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243208	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243209	Ribosomal protein L19e
243209	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243210	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243220	Sulfotransferase protein
243220	WSC domain. This domain may be involved in carbohydrate binding
243241	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
243241	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
243245	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243246	Protein kinase domain
243261	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
243270	7 transmembrane receptor (rhodopsin family)
243277	7 transmembrane receptor (Secretin family)
243277	7 transmembrane receptor (Secretin family)
243277	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
243302	S25 ribosomal protein
243305	Glypican
243306	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
243308	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243311	HMG (high mobility group) box
243312	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
243326	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
243328	Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their
243332	Core histone H2A/H2B/H3/H4
243339	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
243341	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
243349	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
243352	Nucleoside diphosphate kinase
243368	7 transmembrane receptor (metabotropic glutamate family)
243368	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
243369	Thrombospondin type 1 domain
243369	von Willebrand factor type D domain
243369	Low-density lipoprotein receptor domain class A
243369	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
243369	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
243369	Thrombospondin type 1 domain
243369	Low-density lipoprotein receptor domain class A
243369	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
243369	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
243373	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
243373	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
243374	GTPase of unknown function
243375	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
243375	Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
243375	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge
243376	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
243376	Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
243376	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge
243377	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
243377	Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitou
243377	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrog
243378	Prp18 domain. The splicing factor Prp18 is required for the second step of pre-mRNA splicing. The structure of a large fragment of the Saccharomyces cerevisiae Prp18 is known. This fragment is fully active in yeast splicing in vitro and includes the seque
243382	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
243388	ENV polyprotein (coat polyprotein)
243407	7 transmembrane receptor (rhodopsin family)
243419	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243420	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243422	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243423	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243425	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243427	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
243428	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243430	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243431	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243433	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243434	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243436	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243437	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243438	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243439	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243440	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243442	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243444	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243445	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243446	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243448	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243449	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243450	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243455	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243457	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243458	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243459	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243460	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243461	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243462	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243463	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243464	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
243465	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243467	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243468	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243469	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243479	7 transmembrane receptor (rhodopsin family)
243484	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
243484	Polysaccharide biosynthesis protein. This is a family of diverse bacterial polysaccharide biosynthesis proteins including the CapD protein, WalL protein mannosyl-transferase and several putative epimerases (e.g. WbiI)
243484	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
243497	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
243514	Eukaryotic ribosomal protein L18
243523	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
243529	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
243536	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
243543	Ribosomal protein L23
243547	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
243548	LIM domain. This family represents two copies of the LIM structural domain
243574	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
243574	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
243575	Elongation factor 1 gamma, conserved domain
243606	NAC domain
243612	DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds
243616	Sodium:neurotransmitter symporter family
243617	Ribosomal protein L15
243621	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
243628	7 transmembrane receptor (metabotropic glutamate family)
243628	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
243628	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
243628	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
243628	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
243630	Amiloride-sensitive sodium channel
243635	Cyclophilin type peptidyl-prolyl cis-trans isomerase
243642	Ribosomal protein S5, C-terminal domain
243642	Ribosomal protein S5, N-terminal domain
243645	ADP-ribosylation factor family
243645	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
243645	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
243645	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
243645	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
243645	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
243658	Lectin C-type domain. This family includes both long and short form C-type
243665	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
243665	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
243665	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
243676	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
243676	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
243677	ENV polyprotein (coat polyprotein)
243689	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
243715	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
243716	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
243728	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
243737	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
243743	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans
243743	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: T
243743	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
243748	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243753	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predict
243755	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
243764	7 transmembrane receptor (rhodopsin family)
243766	Transcription factor TFIIB repeat
243781	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243784	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243785	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243786	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243790	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
243790	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
243790	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
243791	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
243794	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243796	VHS domain. Domain present in VPS-27, Hrs and STAM
243801	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
243811	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
243813	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
243815	ENV polyprotein (coat polyprotein)
243815	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
243815	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
243815	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
243815	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
243821	Protein kinase domain
243823	Isochorismatase family. This family are hydrolase enzymes
243824	Protein kinase domain
243827	ENV polyprotein (coat polyprotein)
243827	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
243827	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
243827	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
243831	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243831	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
243833	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243834	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243834	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
243835	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop
243836	Ribosomal protein S8e
243841	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
243841	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
243841	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
243849	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
243851	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
243858	Ribosomal L15
243863	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
243864	Class I Histocompatibility antigen, domains alpha 1 and 2
243864	Class I Histocompatibility antigen, domains alpha 1 and 2
243865	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
243865	KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
243867	F-box domain
243869	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
243869	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
243869	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
243873	Ribosomal protein S19e
243874	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
243874	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
243875	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
243878	Ribosomal L15
243880	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
243881	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
243884	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243884	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
243889	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
243902	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243902	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
243905	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243906	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243906	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243908	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
243912	Hsp20/alpha crystallin family
243914	EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function,
243914	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
243917	Ribosomal protein L23
243917	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
243918	Glycosyltransferase family 6
243918	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
243918	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
243937	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
243944	ENV polyprotein (coat polyprotein)
243948	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243951	7 transmembrane receptor (metabotropic glutamate family)
243951	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
243955	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243956	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243959	Trypsin
243961	Sterile alpha motif (SAM)/Pointed domain
243961	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
243961	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
243961	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
243961	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
243963	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243963	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
243964	Sodium:neurotransmitter symporter family
243965	Sodium:neurotransmitter symporter family
243967	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
243967	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
243968	Protein kinase domain
243973	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
243976	Phosphoglucose isomerase. Phosphoglucose isomerase catalyses the interconversion of glucose-6-phosphate and fructose-6-phosphate
243978	7 transmembrane receptor (rhodopsin family)
243979	7 transmembrane receptor (rhodopsin family)
243979	7 transmembrane receptor (rhodopsin family)
243981	7 transmembrane receptor (rhodopsin family)
244000	Zinc finger, C3HC4 type (RING finger)
244000	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
244003	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
244023	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
244023	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
244023	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
244027	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
244031	ENV polyprotein (coat polyprotein)
244041	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
244041	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
244049	C2 domain
244050	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
244050	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
244052	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
244052	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
244059	'chromo' (CHRromatin Organization MOdifier) domain
244059	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
244059	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
244059	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u
244061	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
244067	PH domain. PH stands for pleckstrin homology
244067	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
244089	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
244089	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
244089	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
244090	Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidine
244090	Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins. One member is a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane. One member is a a Na+-dependent nucleoside transporter se
244099	HMG (high mobility group) box
244111	Translation initiation factor SUI1
244112	7 transmembrane receptor (metabotropic glutamate family)
244112	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
244114	7 transmembrane receptor (metabotropic glutamate family)
244114	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
244115	7 transmembrane receptor (metabotropic glutamate family)
244115	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
244116	7 transmembrane receptor (metabotropic glutamate family)
244116	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
244121	NAC domain
244137	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
244144	Ubiquitin carboxyl-terminal hydrolase family 2
244144	Ubiquitin carboxyl-terminal hydrolases family 2
244145	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
244153	Protein kinase domain
244153	Adenylate and Guanylate cyclase catalytic domain
244153	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
244153	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
244167	Ribosomal L15
244171	HMG (high mobility group) box
244178	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
244178	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
244179	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
244183	B-box zinc finger
244183	Zinc finger, C3HC4 type (RING finger)
244187	7 transmembrane receptor (rhodopsin family)
244195	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
244197	7 transmembrane receptor (rhodopsin family)
244198	Olfactomedin-like domain
244200	7 transmembrane receptor (rhodopsin family)
244202	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
244202	L1 transposable element
244202	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
244202	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
244202	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
244209	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
244212	Sulfotransferase protein
244214	GDP dissociation inhibitor
244218	Ciliary neurotrophic factor
244220	von Willebrand factor type A domain
244226	Ribosomal protein L31e
244229	Ribosomal protein S5, C-terminal domain
244229	Ribosomal protein S5, N-terminal domain
244232	7 transmembrane receptor (rhodopsin family)
244234	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
244236	von Willebrand factor type D domain
244236	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
244236	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
244238	7 transmembrane receptor (rhodopsin family)
244238	7 transmembrane receptor (rhodopsin family)
244239	Actin
244243	Lectin C-type domain. This family includes both long and short form C-type
244250	7 transmembrane receptor (metabotropic glutamate family)
244258	Ribosomal protein S2
244258	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
244292	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
244310	Guanylate-kinase-associated protein (GKAP) protein
244321	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
244324	CUB domain
244329	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
244344	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
244345	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
244348	Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA
244349	MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases
244349	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
244361	Ribosomal L29e protein family
244371	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
244373	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
244375	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
244375	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
244389	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
244414	Ribosomal protein L19e
244414	Ribosomal protein L31e
244416	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase,
244416	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p
244417	Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis
244417	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
244421	Zinc finger, C3HC4 type (RING finger)
244421	ATP-dependent protease La (LON) domain
244422	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
244428	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
244433	ENV polyprotein (coat polyprotein)
244433	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
244433	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
244433	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
244435	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
244442	HMG (high mobility group) box
244443	Disintegrin
244443	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
244443	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
244448	Zinc finger, C3HC4 type (RING finger)
244448	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
244486	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
244486	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
244486	Disintegrin
244486	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
244486	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
244487	Ribosomal protein S7e
244496	HMG (high mobility group) box
244509	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
244512	Ribosomal protein L3
244516	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
244532	Acyl-ACP thioesterase. This family consists of various acyl-acyl carrier protein (ACP) thioesterases (TE) these terminate fatty acyl group extension via hydrolyzing an acyl group on a fatty acid
244533	Possible metal-binding domain in RNase L inhibitor, RLI. Possible metal-binding domain in endoribonuclease RNase L inhibitor. Found at the N-terminal end of RNase L inhibitor proteins, adjacent to the 4Fe-4S binding domain, fer4, pfam00037. Also often fou
244537	Calx-beta domain
244555	ATP synthase subunit C
244556	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
244584	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
244585	C2 domain
244585	Intermediate filament protein
244585	Microtubule associated protein (MAP65/ASE1 family)
244585	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
244588	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
244588	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
244595	Carboxylesterase
244598	Carboxylesterase
244614	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
244616	Thymidine kinase
244625	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
244628	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
244637	ENV polyprotein (coat polyprotein)
244646	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
244646	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
244650	PH domain. PH stands for pleckstrin homology
244650	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
244651	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
244653	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
244653	Ribosomal protein L31e
244668	Rap/ran-GAP
244668	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
244671	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
244671	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
244677	ENV polyprotein (coat polyprotein)
244680	Fibronectin type III domain
244682	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
244683	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
244684	Ribosomal protein S5, N-terminal domain
244685	ENV polyprotein (coat polyprotein)
244685	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
244693	jmjN domain
244693	jmjC domain
244693	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
244694	jmjC domain
244698	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
244701	7 transmembrane receptor (rhodopsin family)
244710	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
244710	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
244710	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
244714	7 transmembrane receptor (rhodopsin family)
244723	Olfactomedin-like domain
244731	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
244754	HMG (high mobility group) box
244754	Herpesvirus UL6 like. This family consists of various proteins from the herpesviridae that are similar to herpes simplex virus type I UL6 virion protein. UL6 is essential for cleavage and packaging of the viral genome
244757	Glycosyl hydrolases family 35
244758	ADP-ribosylation factor family
244763	Helix-turn-helix. This large family of DNA binding helix-turn helix proteins includes Cro and CI
244810	7 transmembrane receptor (rhodopsin family)
244811	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
244813	Homeobox domain
244813	Homeobox domain
244838	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
244838	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
244859	Protein kinase domain
244864	Lectin C-type domain. This family includes both long and short form C-type
244869	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
244869	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
244885	SH2 domain
244886	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
244893	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
244894	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
244913	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
244917	HMG (high mobility group) box
244919	Leo1-like protein. Members of this family are part of the Paf1/RNA polymerase II complex. The Paf1 complex probably functions during the elongation phase of transcription
244923	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
244925	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
244925	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
244954	Trypsin
244955	Homeobox domain
244962	PX domain. PX domains bind to phosphoinositides
244967	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
244967	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
245000	Phosphatidylinositol 3- and 4-kinase
245000	FAT domain. The FAT domain is named after FRAP, ATM and TRRAP
245001	FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal
245007	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
245007	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
245007	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
245010	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
245020	Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region
245022	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
245022	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
245023	Ribosomal protein S26e
245026	von Willebrand factor type A domain
245027	von Willebrand factor type A domain
245029	ENV polyprotein (coat polyprotein)
245029	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
245029	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
245029	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
245045	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
245045	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
245047	Ribosomal S17
245049	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
245050	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
245055	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
245056	Core histone H2A/H2B/H3/H4
245062	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
245062	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
245066	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
245067	Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins
245068	Protein kinase domain
245074	Protein kinase domain
245075	Protein kinase domain
245086	Marek's disease glycoprotein A
245088	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
245091	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
245094	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
245095	ENV polyprotein (coat polyprotein)
245097	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
245098	7 transmembrane receptor (metabotropic glutamate family)
245109	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
245113	7 transmembrane receptor (metabotropic glutamate family)
245118	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
245128	Amino acid permease
245137	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
245143	Protein kinase domain
245143	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
245148	7 transmembrane receptor (rhodopsin family)
245155	7 transmembrane receptor (rhodopsin family)
245155	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
245161	Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins
245163	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
245171	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
245185	Protein kinase domain
245190	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
245193	3'5'-cyclic nucleotide phosphodiesterase
245202	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
245208	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
245211	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
245217	FAD linked oxidases, C-terminal domain. This domain has a ferredoxin-like fold
245217	pfam02913, FAD-oxidase_C, FAD linked oxidases, C-terminal domain. This domain has a ferredoxin-like fold
245238	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
245239	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
245251	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
245251	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
245251	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
245253	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
245258	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
245263	Homeobox domain
245266	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
245269	Protein kinase domain
245278	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
245290	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
245290	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
245290	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
245293	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
245294	Defensin propeptide
245295	Skp1 family, tetramerisation domain
245299	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
245310	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
245317	Core histone H2A/H2B/H3/H4
245318	Core histone H2A/H2B/H3/H4
245328	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
245335	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
245335	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
245335	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
245347	Uncharacterised protein family (UPF0041)
245350	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
245353	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
245355	Protein kinase domain
245355	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results
245357	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
245358	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
245358	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
245358	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
245358	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
245368	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
245372	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
245372	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
245372	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
245376	ENV polyprotein (coat polyprotein)
245376	LIM domain. This family represents two copies of the LIM structural domain
245376	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
245376	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
245377	PH domain. PH stands for pleckstrin homology
245380	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
245381	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
245381	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
245381	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
245391	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
245393	ENV polyprotein (coat polyprotein)
245403	WD domain, G-beta repeat
245404	WD domain, G-beta repeat
245405	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
245418	Homeobox domain
245418	Uncharacterized ACR, COG1579
245418	Pou domain - N-terminal to homeobox domain
245418	Archaeal ATPase. This family contain a conserved P-loop motif that is involved in binding ATP. This family is only found in archaebacteria and particularly in Methanococcus jannaschii that encodes sixteen members of this family
245418	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
245423	Endomembrane protein 70
245424	7 transmembrane receptor (rhodopsin family)
245436	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
245440	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
245442	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
245446	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
245446	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
245450	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
245453	Fork head domain
245456	S-adenosyl-L-homocysteine hydrolase
245456	Shikimate / quinate 5-dehydrogenase. This family contains both shikimate and quinate dehydrogenases. Shikimate 5-dehydrogenase catalyses the conversion of shikimate to 5-dehydroshikimate. This reaction is part of the shikimate pathway which is involved in
245470	EB1-like C-terminal motif. This motif is found at the C-terminus of proteins that are related to the EB1 protein. The EB1 proteins contain an N-terminal CH domain pfam00307. The human EB1 protein was originally discovered as a protein interacting with the
245472	Synaptophysin / synaptoporin
245474	PUA domain. The PUA domain named after PseudoUridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi
245474	PUA domain. The PUA domain named after Pseudouridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi
245474	TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that s
245480	ATP synthase
245483	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
245483	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
245484	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
245494	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
245494	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
245498	Zn-finger in Ran binding protein and others
245498	CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2
245501	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
245502	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
245506	ATP synthase
245508	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
245515	Peptidase family M41
245515	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
245525	HSF-type DNA-binding
245526	7 transmembrane receptor (rhodopsin family)
245528	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
245528	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
245532	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
245532	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
245533	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
245537	Carboxylesterase
245541	Ribosomal protein S2
245545	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
245548	Zinc finger, C3HC4 type (RING finger)
245564	Homeobox domain
245569	Fibronectin type III domain
245569	Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tran
245573	Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex
245573	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
245574	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
245574	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
245575	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
245578	Cadherin domain
245580	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
245582	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
245586	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
245586	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
245589	ENV polyprotein (coat polyprotein)
245589	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
245589	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
245590	Ribosomal protein L15
245595	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
245596	Homeobox domain
245606	Ribosomal protein S5, C-terminal domain
245606	Ribosomal protein S5, N-terminal domain
245610	TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucle
245614	NAC domain
245618	Protein-tyrosine phosphatase
245619	Protein kinase domain
245623	Ribosomal protein S7e
245630	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
245638	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
245643	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
245643	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
245644	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
245644	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
245645	Ribosomal protein S7e
245650	Protein kinase domain
245650	Adenylate and Guanylate cyclase catalytic domain
245650	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
245651	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
245656	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
245660	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
245663	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
245666	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
245667	Arenavirus glycoprotein
245667	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
245668	Ubiquitin carboxyl-terminal hydrolase family 2
245668	Ubiquitin carboxyl-terminal hydrolases family 2
245668	Zn-finger in ubiquitin-hydrolases and other protein
245670	ADP-ribosylation factor family
245670	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
245676	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
245682	ENV polyprotein (coat polyprotein)
245683	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
245683	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
245684	PH domain. PH stands for pleckstrin homology
245684	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
245684	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
245685	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
245708	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
245757	Ribosomal protein S2
245802	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
245828	Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses
245838	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
245841	RNA polymerase Rpb8. Rpb8 is a subunit common to the three yeast RNA polymerases, pol I, II and III. Rpb8 interacts with the largest subunit Rpb1, and with Rpb3 and Rpb11, two smaller subunits
245857	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
245860	Autophagy protein Apg9. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg9 plays a direct role in the formation of the cytoplasm to vacuole targetin
245867	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
245880	WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin
245881	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
245886	Vacuolar sorting protein 9 (VPS9) domain
245897	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
245903	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
245919	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
245920	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
245921	Protein kinase domain
245921	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with
245959	Mitochondrial carrier protein
245961	Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in
245964	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
245965	ATP synthase, Delta/Epsilon chain, beta-sandwich domain. Part of the ATP synthase CF(1). These subunits are part of the head unit of the ATP synthase. The subunits are called delta and epsilon in human and metazoan species but in bacterial species the de
245972	ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae
245973	V-ATPase subunit C
245977	7 transmembrane receptor (Secretin family)
245977	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
245979	LIM domain. This family represents two copies of the LIM structural domain
245980	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
245980	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
245981	Ribosomal L15
246048	Lectin C-type domain. This family includes both long and short form C-type
246049	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
246060	Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase
246073	Fork head domain
246075	7 transmembrane receptor (rhodopsin family)
246076	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
246086	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
246097	Death domain
246104	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth f
246115	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
246133	Slow voltage-gated potassium channel
246143	Death domain
246145	Phosphatidylethanolamine-binding protein
246149	Homeobox domain
246152	WD domain, G-beta repeat
246153	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
246153	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
246172	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
246174	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
246177	Myosin head (motor domain)
246185	Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrou
246211	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precurso
246212	Rap/ran-GAP
246214	Fibronectin type III domain
246215	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
246228	Fibronectin type III domain
246228	Fibronectin type III domain
246228	von Willebrand factor type A domain
246234	Na+/Pi-cotransporter. This is a family of mainly mammalian type II renal Na+/Pi-cotransporters with other related sequences from lower eukaryotes and bacteria some of which are also Na+/Pi-cotransporters. In the kidney the type II renal Na+/Pi-cotranspor
246235	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
246237	PH domain. PH stands for pleckstrin homology
246239	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
246240	Protein kinase domain
246243	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
246245	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
246247	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
246248	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
246249	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
246252	Carboxylesterase
246253	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
246257	Domain of unknown function (DUF341)
246267	Triosephosphate isomerase
246271	Ets-domain
246274	Uncharacterized protein family UPF0005
246277	Pyridoxal-dependent decarboxylase conserved domain
246280	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
246281	O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine
246293	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
246297	RPEL repeat. The RPEL repeat is named after four conserved amino acids it contains. The function of the RPEL repeat is unknown however it might be a DNA binding repeat based on the observation that one member contains a pfam02037 domain that is also impli
246301	Helix-loop-helix DNA-binding domain
246310	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
246313	7 transmembrane receptor (rhodopsin family)
246316	EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function,
246316	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
246317	CUB domain
246317	Low-density lipoprotein receptor domain class A
246324	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
246325	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
246325	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
246327	DNA primase small subunit. DNA primase synthesizes the RNA primers for the Okazaki fragments in lagging strand DNA synthesis. DNA primase is a heterodimer of large and small subunits
246328	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
246329	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
246331	CUB domain
246331	MAM domain. An extracellular domain found in many receptors
246331	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
246332	Protein kinase domain
246334	P53
246358	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
246381	Protein kinase domain
246696	Mitochondrial carrier protein
246703	YjeF-related protein N-terminus
246710	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
246721	RNA polymerases L / 13 to 16 kDa subunit
246721	RNA polymerases L / 13 to 16 kDa subunit
246721	RNA polymerases L / 13 to 16 kDa subunit
246757	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
246757	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
246759	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
246760	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
246766	Glycosyltransferase family 6
246767	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
246770	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
246771	Mitochondrial carrier protein
246787	Sodium:solute symporter family
246788	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
246791	Homeobox domain
246792	Homeobox domain
252829	Homeobox domain
252830	Homeobox domain
252833	Transforming growth factor beta like domain
252833	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
252834	Transforming growth factor beta like domain
252834	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
252837	7 transmembrane receptor (rhodopsin family)
252837	7 transmembrane receptor (rhodopsin family)
252837	Thiamine biosynthesis protein (ThiI). ThiI is required for thiazole synthesis, required for thiamine biosynthesis
252856	Helix-loop-helix DNA-binding domain
252859	7 transmembrane receptor (rhodopsin family)
252859	7 transmembrane receptor (rhodopsin family)
252868	PMP-22/EMP/MP20/Claudin family
252870	Ubiquitin carboxyl-terminal hydrolase family 2
252870	Ubiquitin carboxyl-terminal hydrolases family 2
252870	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
252876	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic d
252880	OAR domain
252880	Homeobox domain
252882	MAM domain. An extracellular domain found in many receptors
252885	Homeobox domain
252886	Fork head domain
252888	Cadherin domain
252892	EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function,
252892	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
252894	Cadherin domain
252895	Cadherin domain
252898	Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA
252904	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
252905	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
252906	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
252907	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
252908	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
252909	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
252910	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
252912	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
252917	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
252917	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
252919	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
252922	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
252922	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
252923	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
252923	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
252923	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
252924	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
252924	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
252926	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
252926	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
252927	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
252928	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Memb
252931	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
252934	Cytochrome c oxidase subunit Va. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit Va
252939	7 transmembrane receptor (rhodopsin family)
252941	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
252942	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
252960	7 transmembrane receptor (rhodopsin family)
252972	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
252974	EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function,
252995	Fibronectin type III domain
253012	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
253013	Ribosomal protein L31e
253017	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-stero
253026	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
253141	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
253141	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
253228	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
253251	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
253272	Zinc finger, C3HC4 type (RING finger)
253289	Regulatory subunit of type II PKA R-subunit
253314	Eukaryotic initiation factor 4E
253323	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
253342	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
253387	Ribosomal protein L3
253430	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
253482	Ribosomal protein S6e
253512	Mitochondrial carrier protein
253558	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
253558	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
253559	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv
253567	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
253659	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
253659	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
253665	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
253738	Fimbrial Usher protein. This protein is involved in biogenesis of gram negative bacterial pili
253738	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
253769	WD domain, G-beta repeat
253782	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
253825	Ribosomal protein L21e
253827	SelR domain. Methionine sulfoxide reduction is an important process, by which cells regulate biological processes and cope with oxidative stress. MsrA, a protein involved in the reduction of methionine sulfoxides in proteins, has been known for four decad
253839	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
253853	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
253853	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
253924	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
253933	7 transmembrane receptor (rhodopsin family)
253935	Fibrinogen beta and gamma chains, C-terminal globular domain
253936	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
253936	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
253943	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment
253947	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
253957	Ribosomal S3Ae family
253959	Rap/ran-GAP
253966	Ribosomal protein L21e
253980	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
253980	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
253986	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
254042	metallopeptidase family M24
254049	Actin
254088	Intermediate filament protein
254102	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
254109	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
254111	Intermediate filament protein
254116	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
254122	PX domain. PX domains bind to phosphoinositides
254145	Actin
254173	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
254240	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
254263	Cornichon protein
254263	Cornichon protein
254272	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
254287	Ribosomal protein L31e
254359	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
254394	MCM2/3/5 family
254395	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
254428	Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples ha
254552	NUDIX domain
254552	NUDIX domain
254556	Zinc finger, C3HC4 type (RING finger)
254571	Kinesin motor domain
254573	Core histone H2A/H2B/H3/H4
254620	Intermediate filament protein
254665	Protein kinase domain
254700	Ubiquitin carboxyl-terminal hydrolase family 2
254705	TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway
254773	Transglycosylase SLT domain. This family is distantly related to pfam00062
254783	7 transmembrane receptor (rhodopsin family)
254784	7 transmembrane receptor (rhodopsin family)
254786	7 transmembrane receptor (rhodopsin family)
254787	7 transmembrane receptor (rhodopsin family)
254791	Intermediate filament protein
254863	Protein of unknown function (DUF423). Potential integral membrane protein
254879	7 transmembrane receptor (rhodopsin family)
254953	PH domain. PH stands for pleckstrin homology
254953	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
254953	BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region
254954	Ribosomal protein L6e
254973	7 transmembrane receptor (rhodopsin family)
254974	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
255173	Ribosomal protein S7e
255189	C2 domain
255189	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop
255220	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
255239	Protein kinase domain
255239	Protein kinase domain
255239	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
255240	Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in
255290	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
255308	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
255308	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
255319	PH domain. PH stands for pleckstrin homology
255319	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
255319	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
255332	ADP-ribosylation factor family
255374	Metallo-beta-lactamase superfamily
255395	Ribosomal protein S21e
255447	Actin
255488	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
255519	HMG (high mobility group) box
255576	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
255576	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
255608	Ubiquitin carboxyl-terminal hydrolase family 2
255608	Ubiquitin carboxyl-terminal hydrolases family 2
255609	Ubiquitin carboxyl-terminal hydrolase family 2
255609	Ubiquitin carboxyl-terminal hydrolases family 2
255610	Ubiquitin carboxyl-terminal hydrolase family 2
255610	Ubiquitin carboxyl-terminal hydrolases family 2
255631	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
255681	Ribosomal protein L31e
255701	Ribosomal protein L44
255713	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
255723	7 transmembrane receptor (rhodopsin family)
255725	7 transmembrane receptor (rhodopsin family)
255726	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
255726	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
255731	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
255738	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold con
255743	MAM domain. An extracellular domain found in many receptors
255758	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
255761	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
255777	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
255812	FAD binding domain. This family includes members that bind FAD. This family includes the flavoprotein subunits from succinate and fumarate dehydrogenase, aspartate oxidase and the alpha subunit of adenylylsulfate reductase
255814	Ribosomal S17
255870	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
255877	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
255877	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
255896	Radical SAM superfamily
255896	MoaC family. Members of this family are involved in molybdenum cofactor biosynthesis. However their molecular function is not known
255926	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
255928	C2 domain
255997	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
256000	ADP-ribosylation factor family
256000	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
256006	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
256042	7 transmembrane receptor (rhodopsin family)
256051	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
256055	Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities
256143	7 transmembrane receptor (rhodopsin family)
256144	7 transmembrane receptor (rhodopsin family)
256148	7 transmembrane receptor (rhodopsin family)
256158	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
256190	7 transmembrane receptor (rhodopsin family)
256233	Eukaryotic ribosomal protein L18
256248	Adenylate kinase
256281	NUDIX domain
256297	Helix-loop-helix DNA-binding domain
256329	Intermediate filament tail domain
256355	Ribosomal protein S5, N-terminal domain
256364	HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes
256368	LIM domain. This family represents two copies of the LIM structural domain
256372	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
256374	Cyclophilin type peptidyl-prolyl cis-trans isomerase
256380	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
256394	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
256401	Protein-tyrosine phosphatase
256401	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
256435	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
256441	Ribosomal L22e protein family
256447	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
256457	Ribosomal protein L21e
256470	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
256471	Sugar (and other) transporter
256536	FF domain. This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions
256568	XRCC1 N terminal domain
256576	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
256597	7 transmembrane receptor (rhodopsin family)
256691	MAM domain. An extracellular domain found in many receptors
256755	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
256892	7 transmembrane receptor (rhodopsin family)
256949	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
256970	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
256987	TMS membrane protein/tumour differentially expressed protein (TDE)
256987	TMS membrane protein/tumour differentially expressed protein (TDE)
257019	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
257022	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
257039	Ribosomal S17
257054	pfam02913, FAD-oxidase_C, FAD linked oxidases, C-terminal domain. This domain has a ferredoxin-like fold
257067	14-3-3 protein
257068	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
257079	Glutaredoxin
257089	7 transmembrane receptor (rhodopsin family)
257101	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
257106	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
257137	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
257202	Glutathione peroxidase
257202	Glutathione peroxidase
257210	Cyclophilin type peptidyl-prolyl cis-trans isomerase
257218	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
257218	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
257240	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
257240	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
257340	Trehalase
257364	PX domain. PX domains bind to phosphoinositides
257397	CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2
257408	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
257408	BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region
257632	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
257632	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
257634	Homeobox domain
257634	LIM domain. This family represents two copies of the LIM structural domain
257643	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
257646	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
257647	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
257648	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
257663	7 transmembrane receptor (rhodopsin family)
257665	7 transmembrane receptor (rhodopsin family)
257667	7 transmembrane receptor (rhodopsin family)
257913	7 transmembrane receptor (rhodopsin family)
258266	7 transmembrane receptor (rhodopsin family)
258267	7 transmembrane receptor (rhodopsin family)
258268	7 transmembrane receptor (rhodopsin family)
258269	7 transmembrane receptor (rhodopsin family)
258270	7 transmembrane receptor (rhodopsin family)
258271	7 transmembrane receptor (rhodopsin family)
258272	7 transmembrane receptor (rhodopsin family)
258273	7 transmembrane receptor (rhodopsin family)
258274	7 transmembrane receptor (rhodopsin family)
258275	7 transmembrane receptor (rhodopsin family)
258276	7 transmembrane receptor (rhodopsin family)
258277	7 transmembrane receptor (rhodopsin family)
258278	7 transmembrane receptor (rhodopsin family)
258279	7 transmembrane receptor (rhodopsin family)
258280	7 transmembrane receptor (rhodopsin family)
258281	7 transmembrane receptor (rhodopsin family)
258282	7 transmembrane receptor (rhodopsin family)
258283	7 transmembrane receptor (rhodopsin family)
258284	7 transmembrane receptor (rhodopsin family)
258285	7 transmembrane receptor (rhodopsin family)
258286	7 transmembrane receptor (rhodopsin family)
258287	7 transmembrane receptor (rhodopsin family)
258288	7 transmembrane receptor (rhodopsin family)
258289	7 transmembrane receptor (rhodopsin family)
258290	7 transmembrane receptor (rhodopsin family)
258291	7 transmembrane receptor (rhodopsin family)
258292	7 transmembrane receptor (rhodopsin family)
258293	7 transmembrane receptor (rhodopsin family)
258294	7 transmembrane receptor (rhodopsin family)
258295	7 transmembrane receptor (rhodopsin family)
258296	7 transmembrane receptor (rhodopsin family)
258297	7 transmembrane receptor (rhodopsin family)
258298	7 transmembrane receptor (rhodopsin family)
258300	7 transmembrane receptor (rhodopsin family)
258301	7 transmembrane receptor (rhodopsin family)
258302	7 transmembrane receptor (rhodopsin family)
258303	7 transmembrane receptor (rhodopsin family)
258304	7 transmembrane receptor (rhodopsin family)
258305	7 transmembrane receptor (rhodopsin family)
258306	7 transmembrane receptor (rhodopsin family)
258307	7 transmembrane receptor (rhodopsin family)
258308	7 transmembrane receptor (rhodopsin family)
258309	7 transmembrane receptor (rhodopsin family)
258311	7 transmembrane receptor (rhodopsin family)
258313	7 transmembrane receptor (rhodopsin family)
258314	7 transmembrane receptor (rhodopsin family)
258315	7 transmembrane receptor (rhodopsin family)
258316	7 transmembrane receptor (rhodopsin family)
258317	7 transmembrane receptor (rhodopsin family)
258318	7 transmembrane receptor (rhodopsin family)
258319	7 transmembrane receptor (rhodopsin family)
258320	7 transmembrane receptor (rhodopsin family)
258321	7 transmembrane receptor (rhodopsin family)
258322	7 transmembrane receptor (rhodopsin family)
258323	7 transmembrane receptor (rhodopsin family)
258324	7 transmembrane receptor (rhodopsin family)
258325	7 transmembrane receptor (rhodopsin family)
258326	7 transmembrane receptor (rhodopsin family)
258327	7 transmembrane receptor (rhodopsin family)
258328	7 transmembrane receptor (rhodopsin family)
258329	7 transmembrane receptor (rhodopsin family)
258330	7 transmembrane receptor (rhodopsin family)
258331	7 transmembrane receptor (rhodopsin family)
258333	7 transmembrane receptor (rhodopsin family)
258334	7 transmembrane receptor (rhodopsin family)
258335	7 transmembrane receptor (rhodopsin family)
258336	7 transmembrane receptor (rhodopsin family)
258338	7 transmembrane receptor (rhodopsin family)
258339	7 transmembrane receptor (rhodopsin family)
258340	7 transmembrane receptor (rhodopsin family)
258341	7 transmembrane receptor (rhodopsin family)
258342	7 transmembrane receptor (rhodopsin family)
258343	7 transmembrane receptor (rhodopsin family)
258344	7 transmembrane receptor (rhodopsin family)
258345	7 transmembrane receptor (rhodopsin family)
258346	7 transmembrane receptor (rhodopsin family)
258347	7 transmembrane receptor (rhodopsin family)
258348	7 transmembrane receptor (rhodopsin family)
258350	7 transmembrane receptor (rhodopsin family)
258351	7 transmembrane receptor (rhodopsin family)
258352	7 transmembrane receptor (rhodopsin family)
258353	7 transmembrane receptor (rhodopsin family)
258354	7 transmembrane receptor (rhodopsin family)
258355	7 transmembrane receptor (rhodopsin family)
258356	7 transmembrane receptor (rhodopsin family)
258357	7 transmembrane receptor (rhodopsin family)
258358	7 transmembrane receptor (rhodopsin family)
258359	7 transmembrane receptor (rhodopsin family)
258360	7 transmembrane receptor (rhodopsin family)
258361	7 transmembrane receptor (rhodopsin family)
258362	7 transmembrane receptor (rhodopsin family)
258363	7 transmembrane receptor (rhodopsin family)
258364	7 transmembrane receptor (rhodopsin family)
258365	7 transmembrane receptor (rhodopsin family)
258366	7 transmembrane receptor (rhodopsin family)
258368	7 transmembrane receptor (rhodopsin family)
258369	7 transmembrane receptor (rhodopsin family)
258370	7 transmembrane receptor (rhodopsin family)
258371	7 transmembrane receptor (rhodopsin family)
258372	7 transmembrane receptor (rhodopsin family)
258373	7 transmembrane receptor (rhodopsin family)
258374	7 transmembrane receptor (rhodopsin family)
258375	7 transmembrane receptor (rhodopsin family)
258377	7 transmembrane receptor (rhodopsin family)
258378	7 transmembrane receptor (rhodopsin family)
258379	7 transmembrane receptor (rhodopsin family)
258380	7 transmembrane receptor (rhodopsin family)
258381	7 transmembrane receptor (rhodopsin family)
258382	7 transmembrane receptor (rhodopsin family)
258383	7 transmembrane receptor (rhodopsin family)
258384	7 transmembrane receptor (rhodopsin family)
258385	7 transmembrane receptor (rhodopsin family)
258385	7 transmembrane receptor (rhodopsin family)
258386	7 transmembrane receptor (rhodopsin family)
258387	7 transmembrane receptor (rhodopsin family)
258387	7 transmembrane receptor (rhodopsin family)
258388	7 transmembrane receptor (rhodopsin family)
258389	7 transmembrane receptor (rhodopsin family)
258390	7 transmembrane receptor (rhodopsin family)
258391	7 transmembrane receptor (rhodopsin family)
258392	7 transmembrane receptor (rhodopsin family)
258393	7 transmembrane receptor (rhodopsin family)
258394	7 transmembrane receptor (rhodopsin family)
258395	7 transmembrane receptor (rhodopsin family)
258396	7 transmembrane receptor (rhodopsin family)
258397	7 transmembrane receptor (rhodopsin family)
258398	7 transmembrane receptor (rhodopsin family)
258399	7 transmembrane receptor (rhodopsin family)
258400	7 transmembrane receptor (rhodopsin family)
258401	7 transmembrane receptor (rhodopsin family)
258402	7 transmembrane receptor (rhodopsin family)
258404	7 transmembrane receptor (rhodopsin family)
258405	7 transmembrane receptor (rhodopsin family)
258406	7 transmembrane receptor (rhodopsin family)
258407	7 transmembrane receptor (rhodopsin family)
258408	7 transmembrane receptor (rhodopsin family)
258409	7 transmembrane receptor (rhodopsin family)
258410	7 transmembrane receptor (rhodopsin family)
258410	7 transmembrane receptor (rhodopsin family)
258411	7 transmembrane receptor (rhodopsin family)
258412	7 transmembrane receptor (rhodopsin family)
258412	7 transmembrane receptor (rhodopsin family)
258413	7 transmembrane receptor (rhodopsin family)
258414	7 transmembrane receptor (rhodopsin family)
258415	7 transmembrane receptor (rhodopsin family)
258415	7 transmembrane receptor (rhodopsin family)
258416	7 transmembrane receptor (rhodopsin family)
258417	7 transmembrane receptor (rhodopsin family)
258418	7 transmembrane receptor (rhodopsin family)
258418	7 transmembrane receptor (rhodopsin family)
258419	7 transmembrane receptor (rhodopsin family)
258420	7 transmembrane receptor (rhodopsin family)
258421	7 transmembrane receptor (rhodopsin family)
258422	7 transmembrane receptor (rhodopsin family)
258423	7 transmembrane receptor (rhodopsin family)
258424	7 transmembrane receptor (rhodopsin family)
258425	7 transmembrane receptor (rhodopsin family)
258426	7 transmembrane receptor (rhodopsin family)
258427	7 transmembrane receptor (rhodopsin family)
258428	7 transmembrane receptor (rhodopsin family)
258429	7 transmembrane receptor (rhodopsin family)
258430	7 transmembrane receptor (rhodopsin family)
258431	7 transmembrane receptor (rhodopsin family)
258432	7 transmembrane receptor (rhodopsin family)
258433	7 transmembrane receptor (rhodopsin family)
258434	7 transmembrane receptor (rhodopsin family)
258435	7 transmembrane receptor (rhodopsin family)
258436	7 transmembrane receptor (rhodopsin family)
258436	7 transmembrane receptor (rhodopsin family)
258437	7 transmembrane receptor (rhodopsin family)
258438	7 transmembrane receptor (rhodopsin family)
258439	7 transmembrane receptor (rhodopsin family)
258440	7 transmembrane receptor (rhodopsin family)
258441	7 transmembrane receptor (rhodopsin family)
258442	7 transmembrane receptor (rhodopsin family)
258442	7 transmembrane receptor (rhodopsin family)
258443	7 transmembrane receptor (rhodopsin family)
258444	7 transmembrane receptor (rhodopsin family)
258444	7 transmembrane receptor (rhodopsin family)
258445	7 transmembrane receptor (rhodopsin family)
258445	7 transmembrane receptor (rhodopsin family)
258445	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
258445	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
258445	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
258446	7 transmembrane receptor (rhodopsin family)
258447	7 transmembrane receptor (rhodopsin family)
258448	7 transmembrane receptor (rhodopsin family)
258449	7 transmembrane receptor (rhodopsin family)
258450	7 transmembrane receptor (rhodopsin family)
258451	7 transmembrane receptor (rhodopsin family)
258453	7 transmembrane receptor (rhodopsin family)
258454	7 transmembrane receptor (rhodopsin family)
258455	7 transmembrane receptor (rhodopsin family)
258456	7 transmembrane receptor (rhodopsin family)
258457	7 transmembrane receptor (rhodopsin family)
258458	7 transmembrane receptor (rhodopsin family)
258459	7 transmembrane receptor (rhodopsin family)
258460	7 transmembrane receptor (rhodopsin family)
258462	7 transmembrane receptor (rhodopsin family)
258463	7 transmembrane receptor (rhodopsin family)
258464	7 transmembrane receptor (rhodopsin family)
258465	7 transmembrane receptor (rhodopsin family)
258465	7 transmembrane receptor (rhodopsin family)
258466	7 transmembrane receptor (rhodopsin family)
258468	7 transmembrane receptor (rhodopsin family)
258469	7 transmembrane receptor (rhodopsin family)
258470	7 transmembrane receptor (rhodopsin family)
258471	7 transmembrane receptor (rhodopsin family)
258472	7 transmembrane receptor (rhodopsin family)
258472	7 transmembrane receptor (rhodopsin family)
258473	7 transmembrane receptor (rhodopsin family)
258474	7 transmembrane receptor (rhodopsin family)
258475	7 transmembrane receptor (rhodopsin family)
258477	7 transmembrane receptor (rhodopsin family)
258478	7 transmembrane receptor (rhodopsin family)
258479	7 transmembrane receptor (rhodopsin family)
258480	7 transmembrane receptor (rhodopsin family)
258481	7 transmembrane receptor (rhodopsin family)
258482	7 transmembrane receptor (rhodopsin family)
258483	7 transmembrane receptor (rhodopsin family)
258484	7 transmembrane receptor (rhodopsin family)
258484	7 transmembrane receptor (rhodopsin family)
258485	7 transmembrane receptor (rhodopsin family)
258486	7 transmembrane receptor (rhodopsin family)
258487	7 transmembrane receptor (rhodopsin family)
258488	7 transmembrane receptor (rhodopsin family)
258488	7 transmembrane receptor (rhodopsin family)
258489	7 transmembrane receptor (rhodopsin family)
258490	7 transmembrane receptor (rhodopsin family)
258491	7 transmembrane receptor (rhodopsin family)
258492	7 transmembrane receptor (rhodopsin family)
258493	7 transmembrane receptor (rhodopsin family)
258494	7 transmembrane receptor (rhodopsin family)
258495	7 transmembrane receptor (rhodopsin family)
258497	7 transmembrane receptor (rhodopsin family)
258498	7 transmembrane receptor (rhodopsin family)
258499	7 transmembrane receptor (rhodopsin family)
258499	7 transmembrane receptor (rhodopsin family)
258500	7 transmembrane receptor (rhodopsin family)
258500	7 transmembrane receptor (rhodopsin family)
258501	7 transmembrane receptor (rhodopsin family)
258502	7 transmembrane receptor (rhodopsin family)
258503	7 transmembrane receptor (rhodopsin family)
258504	7 transmembrane receptor (rhodopsin family)
258505	7 transmembrane receptor (rhodopsin family)
258506	7 transmembrane receptor (rhodopsin family)
258507	7 transmembrane receptor (rhodopsin family)
258508	7 transmembrane receptor (rhodopsin family)
258509	7 transmembrane receptor (rhodopsin family)
258509	7 transmembrane receptor (rhodopsin family)
258511	7 transmembrane receptor (rhodopsin family)
258511	7 transmembrane receptor (rhodopsin family)
258512	7 transmembrane receptor (rhodopsin family)
258513	7 transmembrane receptor (rhodopsin family)
258513	7 transmembrane receptor (rhodopsin family)
258515	7 transmembrane receptor (rhodopsin family)
258515	7 transmembrane receptor (rhodopsin family)
258516	7 transmembrane receptor (rhodopsin family)
258516	7 transmembrane receptor (rhodopsin family)
258517	7 transmembrane receptor (rhodopsin family)
258517	7 transmembrane receptor (rhodopsin family)
258518	7 transmembrane receptor (rhodopsin family)
258518	7 transmembrane receptor (rhodopsin family)
258519	7 transmembrane receptor (rhodopsin family)
258520	7 transmembrane receptor (rhodopsin family)
258521	7 transmembrane receptor (rhodopsin family)
258522	7 transmembrane receptor (rhodopsin family)
258523	7 transmembrane receptor (rhodopsin family)
258524	7 transmembrane receptor (rhodopsin family)
258525	7 transmembrane receptor (rhodopsin family)
258526	7 transmembrane receptor (rhodopsin family)
258527	7 transmembrane receptor (rhodopsin family)
258528	7 transmembrane receptor (rhodopsin family)
258529	7 transmembrane receptor (rhodopsin family)
258530	7 transmembrane receptor (rhodopsin family)
258531	7 transmembrane receptor (rhodopsin family)
258532	7 transmembrane receptor (rhodopsin family)
258533	7 transmembrane receptor (rhodopsin family)
258534	7 transmembrane receptor (rhodopsin family)
258536	7 transmembrane receptor (rhodopsin family)
258537	7 transmembrane receptor (rhodopsin family)
258538	7 transmembrane receptor (rhodopsin family)
258539	7 transmembrane receptor (rhodopsin family)
258539	7 transmembrane receptor (rhodopsin family)
258540	7 transmembrane receptor (rhodopsin family)
258541	7 transmembrane receptor (rhodopsin family)
258542	7 transmembrane receptor (rhodopsin family)
258543	7 transmembrane receptor (rhodopsin family)
258544	7 transmembrane receptor (rhodopsin family)
258545	7 transmembrane receptor (rhodopsin family)
258546	7 transmembrane receptor (rhodopsin family)
258547	7 transmembrane receptor (rhodopsin family)
258548	7 transmembrane receptor (rhodopsin family)
258549	7 transmembrane receptor (rhodopsin family)
258551	7 transmembrane receptor (rhodopsin family)
258552	7 transmembrane receptor (rhodopsin family)
258555	7 transmembrane receptor (rhodopsin family)
258557	7 transmembrane receptor (rhodopsin family)
258558	7 transmembrane receptor (rhodopsin family)
258559	7 transmembrane receptor (rhodopsin family)
258560	7 transmembrane receptor (rhodopsin family)
258561	7 transmembrane receptor (rhodopsin family)
258562	7 transmembrane receptor (rhodopsin family)
258563	7 transmembrane receptor (rhodopsin family)
258564	7 transmembrane receptor (rhodopsin family)
258565	7 transmembrane receptor (rhodopsin family)
258566	7 transmembrane receptor (rhodopsin family)
258568	7 transmembrane receptor (rhodopsin family)
258569	7 transmembrane receptor (rhodopsin family)
258570	7 transmembrane receptor (rhodopsin family)
258571	7 transmembrane receptor (rhodopsin family)
258572	7 transmembrane receptor (rhodopsin family)
258573	7 transmembrane receptor (rhodopsin family)
258574	7 transmembrane receptor (rhodopsin family)
258575	7 transmembrane receptor (rhodopsin family)
258577	7 transmembrane receptor (rhodopsin family)
258578	7 transmembrane receptor (rhodopsin family)
258579	7 transmembrane receptor (rhodopsin family)
258580	7 transmembrane receptor (rhodopsin family)
258581	7 transmembrane receptor (rhodopsin family)
258582	7 transmembrane receptor (rhodopsin family)
258583	7 transmembrane receptor (rhodopsin family)
258584	7 transmembrane receptor (rhodopsin family)
258585	7 transmembrane receptor (rhodopsin family)
258586	7 transmembrane receptor (rhodopsin family)
258587	7 transmembrane receptor (rhodopsin family)
258589	7 transmembrane receptor (rhodopsin family)
258590	7 transmembrane receptor (rhodopsin family)
258591	7 transmembrane receptor (rhodopsin family)
258592	7 transmembrane receptor (rhodopsin family)
258593	7 transmembrane receptor (rhodopsin family)
258594	7 transmembrane receptor (rhodopsin family)
258595	7 transmembrane receptor (rhodopsin family)
258597	7 transmembrane receptor (rhodopsin family)
258598	7 transmembrane receptor (rhodopsin family)
258600	7 transmembrane receptor (rhodopsin family)
258601	7 transmembrane receptor (rhodopsin family)
258602	7 transmembrane receptor (rhodopsin family)
258603	7 transmembrane receptor (rhodopsin family)
258603	7 transmembrane receptor (rhodopsin family)
258604	7 transmembrane receptor (rhodopsin family)
258604	7 transmembrane receptor (rhodopsin family)
258605	7 transmembrane receptor (rhodopsin family)
258606	7 transmembrane receptor (rhodopsin family)
258607	7 transmembrane receptor (rhodopsin family)
258608	7 transmembrane receptor (rhodopsin family)
258609	7 transmembrane receptor (rhodopsin family)
258609	7 transmembrane receptor (rhodopsin family)
258610	7 transmembrane receptor (rhodopsin family)
258611	7 transmembrane receptor (rhodopsin family)
258611	7 transmembrane receptor (rhodopsin family)
258612	7 transmembrane receptor (rhodopsin family)
258612	7 transmembrane receptor (rhodopsin family)
258613	7 transmembrane receptor (rhodopsin family)
258614	7 transmembrane receptor (rhodopsin family)
258615	7 transmembrane receptor (rhodopsin family)
258616	7 transmembrane receptor (rhodopsin family)
258617	7 transmembrane receptor (rhodopsin family)
258618	7 transmembrane receptor (rhodopsin family)
258620	7 transmembrane receptor (rhodopsin family)
258621	7 transmembrane receptor (rhodopsin family)
258623	7 transmembrane receptor (rhodopsin family)
258624	7 transmembrane receptor (rhodopsin family)
258625	7 transmembrane receptor (rhodopsin family)
258626	7 transmembrane receptor (rhodopsin family)
258627	7 transmembrane receptor (rhodopsin family)
258628	7 transmembrane receptor (rhodopsin family)
258629	7 transmembrane receptor (rhodopsin family)
258630	7 transmembrane receptor (rhodopsin family)
258631	7 transmembrane receptor (rhodopsin family)
258632	7 transmembrane receptor (rhodopsin family)
258633	7 transmembrane receptor (rhodopsin family)
258634	7 transmembrane receptor (rhodopsin family)
258635	7 transmembrane receptor (rhodopsin family)
258636	7 transmembrane receptor (rhodopsin family)
258638	7 transmembrane receptor (rhodopsin family)
258639	7 transmembrane receptor (rhodopsin family)
258640	7 transmembrane receptor (rhodopsin family)
258641	7 transmembrane receptor (rhodopsin family)
258642	7 transmembrane receptor (rhodopsin family)
258643	7 transmembrane receptor (rhodopsin family)
258644	7 transmembrane receptor (rhodopsin family)
258645	7 transmembrane receptor (rhodopsin family)
258647	7 transmembrane receptor (rhodopsin family)
258647	7 transmembrane receptor (rhodopsin family)
258648	7 transmembrane receptor (rhodopsin family)
258649	7 transmembrane receptor (rhodopsin family)
258649	7 transmembrane receptor (rhodopsin family)
258650	7 transmembrane receptor (rhodopsin family)
258650	7 transmembrane receptor (rhodopsin family)
258651	7 transmembrane receptor (rhodopsin family)
258652	7 transmembrane receptor (rhodopsin family)
258653	7 transmembrane receptor (rhodopsin family)
258654	7 transmembrane receptor (rhodopsin family)
258655	7 transmembrane receptor (rhodopsin family)
258656	7 transmembrane receptor (rhodopsin family)
258657	7 transmembrane receptor (rhodopsin family)
258658	7 transmembrane receptor (rhodopsin family)
258659	7 transmembrane receptor (rhodopsin family)
258660	7 transmembrane receptor (rhodopsin family)
258661	7 transmembrane receptor (rhodopsin family)
258661	7 transmembrane receptor (rhodopsin family)
258661	7TM chemoreceptor. This large family of proteins are related to pfam00001. They are 7 transmembrane receptors. This family does not include all known members, as there are problems with overlapping specificity with pfam00001. This family is greatly expand
258662	7 transmembrane receptor (rhodopsin family)
258663	7 transmembrane receptor (rhodopsin family)
258665	7 transmembrane receptor (rhodopsin family)
258666	7 transmembrane receptor (rhodopsin family)
258667	7 transmembrane receptor (rhodopsin family)
258668	7 transmembrane receptor (rhodopsin family)
258669	7 transmembrane receptor (rhodopsin family)
258670	7 transmembrane receptor (rhodopsin family)
258671	7 transmembrane receptor (rhodopsin family)
258672	7 transmembrane receptor (rhodopsin family)
258673	7 transmembrane receptor (rhodopsin family)
258674	7 transmembrane receptor (rhodopsin family)
258675	7 transmembrane receptor (rhodopsin family)
258676	7 transmembrane receptor (rhodopsin family)
258677	7 transmembrane receptor (rhodopsin family)
258678	7 transmembrane receptor (rhodopsin family)
258679	7 transmembrane receptor (rhodopsin family)
258680	7 transmembrane receptor (rhodopsin family)
258681	7 transmembrane receptor (rhodopsin family)
258682	7 transmembrane receptor (rhodopsin family)
258683	7 transmembrane receptor (rhodopsin family)
258684	7 transmembrane receptor (rhodopsin family)
258685	7 transmembrane receptor (rhodopsin family)
258686	7 transmembrane receptor (rhodopsin family)
258687	7 transmembrane receptor (rhodopsin family)
258688	7 transmembrane receptor (rhodopsin family)
258689	7 transmembrane receptor (rhodopsin family)
258690	7 transmembrane receptor (rhodopsin family)
258691	7 transmembrane receptor (rhodopsin family)
258692	7 transmembrane receptor (rhodopsin family)
258693	7 transmembrane receptor (rhodopsin family)
258694	7 transmembrane receptor (rhodopsin family)
258695	7 transmembrane receptor (rhodopsin family)
258696	7 transmembrane receptor (rhodopsin family)
258697	7 transmembrane receptor (rhodopsin family)
258698	7 transmembrane receptor (rhodopsin family)
258699	7 transmembrane receptor (rhodopsin family)
258700	7 transmembrane receptor (rhodopsin family)
258701	7 transmembrane receptor (rhodopsin family)
258702	7 transmembrane receptor (rhodopsin family)
258703	7 transmembrane receptor (rhodopsin family)
258704	7 transmembrane receptor (rhodopsin family)
258705	7 transmembrane receptor (rhodopsin family)
258706	7 transmembrane receptor (rhodopsin family)
258707	7 transmembrane receptor (rhodopsin family)
258708	7 transmembrane receptor (rhodopsin family)
258709	7 transmembrane receptor (rhodopsin family)
258710	7 transmembrane receptor (rhodopsin family)
258711	7 transmembrane receptor (rhodopsin family)
258712	7 transmembrane receptor (rhodopsin family)
258712	7 transmembrane receptor (rhodopsin family)
258713	7 transmembrane receptor (rhodopsin family)
258714	7 transmembrane receptor (rhodopsin family)
258715	7 transmembrane receptor (rhodopsin family)
258716	7 transmembrane receptor (rhodopsin family)
258717	7 transmembrane receptor (rhodopsin family)
258718	7 transmembrane receptor (rhodopsin family)
258719	7 transmembrane receptor (rhodopsin family)
258720	7 transmembrane receptor (rhodopsin family)
258721	7 transmembrane receptor (rhodopsin family)
258722	7 transmembrane receptor (rhodopsin family)
258723	7 transmembrane receptor (rhodopsin family)
258723	7 transmembrane receptor (rhodopsin family)
258724	7 transmembrane receptor (rhodopsin family)
258725	7 transmembrane receptor (rhodopsin family)
258726	7 transmembrane receptor (rhodopsin family)
258727	7 transmembrane receptor (rhodopsin family)
258728	7 transmembrane receptor (rhodopsin family)
258729	7 transmembrane receptor (rhodopsin family)
258730	7 transmembrane receptor (rhodopsin family)
258731	7 transmembrane receptor (rhodopsin family)
258732	7 transmembrane receptor (rhodopsin family)
258733	7 transmembrane receptor (rhodopsin family)
258734	7 transmembrane receptor (rhodopsin family)
258735	7 transmembrane receptor (rhodopsin family)
258736	7 transmembrane receptor (rhodopsin family)
258737	7 transmembrane receptor (rhodopsin family)
258738	7 transmembrane receptor (rhodopsin family)
258739	7 transmembrane receptor (rhodopsin family)
258740	7 transmembrane receptor (rhodopsin family)
258741	7 transmembrane receptor (rhodopsin family)
258742	7 transmembrane receptor (rhodopsin family)
258742	7 transmembrane receptor (rhodopsin family)
258743	7 transmembrane receptor (rhodopsin family)
258744	7 transmembrane receptor (rhodopsin family)
258745	7 transmembrane receptor (rhodopsin family)
258746	7 transmembrane receptor (rhodopsin family)
258747	7 transmembrane receptor (rhodopsin family)
258748	7 transmembrane receptor (rhodopsin family)
258749	7 transmembrane receptor (rhodopsin family)
258750	7 transmembrane receptor (rhodopsin family)
258751	7 transmembrane receptor (rhodopsin family)
258752	7 transmembrane receptor (rhodopsin family)
258753	7 transmembrane receptor (rhodopsin family)
258754	7 transmembrane receptor (rhodopsin family)
258755	7 transmembrane receptor (rhodopsin family)
258755	7 transmembrane receptor (rhodopsin family)
258756	7 transmembrane receptor (rhodopsin family)
258757	7 transmembrane receptor (rhodopsin family)
258758	7 transmembrane receptor (rhodopsin family)
258759	7 transmembrane receptor (rhodopsin family)
258760	7 transmembrane receptor (rhodopsin family)
258762	7 transmembrane receptor (rhodopsin family)
258763	7 transmembrane receptor (rhodopsin family)
258764	7 transmembrane receptor (rhodopsin family)
258765	7 transmembrane receptor (rhodopsin family)
258766	7 transmembrane receptor (rhodopsin family)
258767	7 transmembrane receptor (rhodopsin family)
258768	7 transmembrane receptor (rhodopsin family)
258769	7 transmembrane receptor (rhodopsin family)
258770	7 transmembrane receptor (rhodopsin family)
258771	7 transmembrane receptor (rhodopsin family)
258772	7 transmembrane receptor (rhodopsin family)
258773	7 transmembrane receptor (rhodopsin family)
258774	7 transmembrane receptor (rhodopsin family)
258775	7 transmembrane receptor (rhodopsin family)
258776	7 transmembrane receptor (rhodopsin family)
258777	7 transmembrane receptor (rhodopsin family)
258778	7 transmembrane receptor (rhodopsin family)
258780	7 transmembrane receptor (rhodopsin family)
258781	7 transmembrane receptor (rhodopsin family)
258782	7 transmembrane receptor (rhodopsin family)
258783	7 transmembrane receptor (rhodopsin family)
258784	7 transmembrane receptor (rhodopsin family)
258785	7 transmembrane receptor (rhodopsin family)
258786	7 transmembrane receptor (rhodopsin family)
258787	7 transmembrane receptor (rhodopsin family)
258788	7 transmembrane receptor (rhodopsin family)
258789	7 transmembrane receptor (rhodopsin family)
258790	7 transmembrane receptor (rhodopsin family)
258791	7 transmembrane receptor (rhodopsin family)
258792	7 transmembrane receptor (rhodopsin family)
258793	7 transmembrane receptor (rhodopsin family)
258794	7 transmembrane receptor (rhodopsin family)
258797	7 transmembrane receptor (rhodopsin family)
258798	7 transmembrane receptor (rhodopsin family)
258799	7 transmembrane receptor (rhodopsin family)
258800	7 transmembrane receptor (rhodopsin family)
258801	7 transmembrane receptor (rhodopsin family)
258802	7 transmembrane receptor (rhodopsin family)
258803	7 transmembrane receptor (rhodopsin family)
258804	7 transmembrane receptor (rhodopsin family)
258805	7 transmembrane receptor (rhodopsin family)
258806	7 transmembrane receptor (rhodopsin family)
258807	7 transmembrane receptor (rhodopsin family)
258808	7 transmembrane receptor (rhodopsin family)
258809	7 transmembrane receptor (rhodopsin family)
258810	7 transmembrane receptor (rhodopsin family)
258811	7 transmembrane receptor (rhodopsin family)
258812	7 transmembrane receptor (rhodopsin family)
258814	7 transmembrane receptor (rhodopsin family)
258815	7 transmembrane receptor (rhodopsin family)
258816	7 transmembrane receptor (rhodopsin family)
258817	7 transmembrane receptor (rhodopsin family)
258818	7 transmembrane receptor (rhodopsin family)
258819	7 transmembrane receptor (rhodopsin family)
258820	7 transmembrane receptor (rhodopsin family)
258821	7 transmembrane receptor (rhodopsin family)
258822	7 transmembrane receptor (rhodopsin family)
258823	7 transmembrane receptor (rhodopsin family)
258824	7 transmembrane receptor (rhodopsin family)
258825	7 transmembrane receptor (rhodopsin family)
258826	7 transmembrane receptor (rhodopsin family)
258828	7 transmembrane receptor (rhodopsin family)
258829	7 transmembrane receptor (rhodopsin family)
258830	7 transmembrane receptor (rhodopsin family)
258831	7 transmembrane receptor (rhodopsin family)
258832	7 transmembrane receptor (rhodopsin family)
258833	7 transmembrane receptor (rhodopsin family)
258834	7 transmembrane receptor (rhodopsin family)
258835	7 transmembrane receptor (rhodopsin family)
258836	7 transmembrane receptor (rhodopsin family)
258837	7 transmembrane receptor (rhodopsin family)
258838	7 transmembrane receptor (rhodopsin family)
258840	7 transmembrane receptor (rhodopsin family)
258841	7 transmembrane receptor (rhodopsin family)
258842	7 transmembrane receptor (rhodopsin family)
258843	7 transmembrane receptor (rhodopsin family)
258844	7 transmembrane receptor (rhodopsin family)
258845	7 transmembrane receptor (rhodopsin family)
258846	7 transmembrane receptor (rhodopsin family)
258850	7 transmembrane receptor (rhodopsin family)
258851	7 transmembrane receptor (rhodopsin family)
258852	7 transmembrane receptor (rhodopsin family)
258853	7 transmembrane receptor (rhodopsin family)
258854	7 transmembrane receptor (rhodopsin family)
258855	7 transmembrane receptor (rhodopsin family)
258856	7 transmembrane receptor (rhodopsin family)
258857	7 transmembrane receptor (rhodopsin family)
258859	7 transmembrane receptor (rhodopsin family)
258861	7 transmembrane receptor (rhodopsin family)
258862	7 transmembrane receptor (rhodopsin family)
258863	7 transmembrane receptor (rhodopsin family)
258865	7 transmembrane receptor (rhodopsin family)
258866	7 transmembrane receptor (rhodopsin family)
258867	7 transmembrane receptor (rhodopsin family)
258868	7 transmembrane receptor (rhodopsin family)
258871	7 transmembrane receptor (rhodopsin family)
258872	7 transmembrane receptor (rhodopsin family)
258873	7 transmembrane receptor (rhodopsin family)
258874	7 transmembrane receptor (rhodopsin family)
258875	7 transmembrane receptor (rhodopsin family)
258877	7 transmembrane receptor (rhodopsin family)
258879	7 transmembrane receptor (rhodopsin family)
258880	7 transmembrane receptor (rhodopsin family)
258881	7 transmembrane receptor (rhodopsin family)
258882	7 transmembrane receptor (rhodopsin family)
258883	7 transmembrane receptor (rhodopsin family)
258886	7 transmembrane receptor (rhodopsin family)
258887	7 transmembrane receptor (rhodopsin family)
258888	7 transmembrane receptor (rhodopsin family)
258889	7 transmembrane receptor (rhodopsin family)
258890	7 transmembrane receptor (rhodopsin family)
258891	7 transmembrane receptor (rhodopsin family)
258892	7 transmembrane receptor (rhodopsin family)
258893	7 transmembrane receptor (rhodopsin family)
258894	7 transmembrane receptor (rhodopsin family)
258895	7 transmembrane receptor (rhodopsin family)
258896	7 transmembrane receptor (rhodopsin family)
258897	7 transmembrane receptor (rhodopsin family)
258898	7 transmembrane receptor (rhodopsin family)
258899	7 transmembrane receptor (rhodopsin family)
258900	7 transmembrane receptor (rhodopsin family)
258901	7 transmembrane receptor (rhodopsin family)
258902	7 transmembrane receptor (rhodopsin family)
258903	7 transmembrane receptor (rhodopsin family)
258904	7 transmembrane receptor (rhodopsin family)
258905	7 transmembrane receptor (rhodopsin family)
258907	7 transmembrane receptor (rhodopsin family)
258908	7 transmembrane receptor (rhodopsin family)
258910	7 transmembrane receptor (rhodopsin family)
258912	7 transmembrane receptor (rhodopsin family)
258913	7 transmembrane receptor (rhodopsin family)
258914	7 transmembrane receptor (rhodopsin family)
258915	7 transmembrane receptor (rhodopsin family)
258917	7 transmembrane receptor (rhodopsin family)
258918	7 transmembrane receptor (rhodopsin family)
258919	7 transmembrane receptor (rhodopsin family)
258920	7 transmembrane receptor (rhodopsin family)
258921	7 transmembrane receptor (rhodopsin family)
258922	7 transmembrane receptor (rhodopsin family)
258923	7 transmembrane receptor (rhodopsin family)
258924	7 transmembrane receptor (rhodopsin family)
258925	7 transmembrane receptor (rhodopsin family)
258927	7 transmembrane receptor (rhodopsin family)
258928	7 transmembrane receptor (rhodopsin family)
258929	7 transmembrane receptor (rhodopsin family)
258930	7 transmembrane receptor (rhodopsin family)
258931	7 transmembrane receptor (rhodopsin family)
258932	7 transmembrane receptor (rhodopsin family)
258933	7 transmembrane receptor (rhodopsin family)
258934	7 transmembrane receptor (rhodopsin family)
258935	7 transmembrane receptor (rhodopsin family)
258937	7 transmembrane receptor (rhodopsin family)
258938	7 transmembrane receptor (rhodopsin family)
258939	7 transmembrane receptor (rhodopsin family)
258940	7 transmembrane receptor (rhodopsin family)
258941	7 transmembrane receptor (rhodopsin family)
258942	7 transmembrane receptor (rhodopsin family)
258943	7 transmembrane receptor (rhodopsin family)
258944	7 transmembrane receptor (rhodopsin family)
258945	7 transmembrane receptor (rhodopsin family)
258946	7 transmembrane receptor (rhodopsin family)
258947	7 transmembrane receptor (rhodopsin family)
258949	7 transmembrane receptor (rhodopsin family)
258950	7 transmembrane receptor (rhodopsin family)
258951	7 transmembrane receptor (rhodopsin family)
258952	7 transmembrane receptor (rhodopsin family)
258953	7 transmembrane receptor (rhodopsin family)
258954	7 transmembrane receptor (rhodopsin family)
258955	7 transmembrane receptor (rhodopsin family)
258956	7 transmembrane receptor (rhodopsin family)
258959	7 transmembrane receptor (rhodopsin family)
258960	7 transmembrane receptor (rhodopsin family)
258961	7 transmembrane receptor (rhodopsin family)
258962	7 transmembrane receptor (rhodopsin family)
258963	7 transmembrane receptor (rhodopsin family)
258964	7 transmembrane receptor (rhodopsin family)
258967	7 transmembrane receptor (rhodopsin family)
258968	7 transmembrane receptor (rhodopsin family)
258969	7 transmembrane receptor (rhodopsin family)
258970	7 transmembrane receptor (rhodopsin family)
258971	7 transmembrane receptor (rhodopsin family)
258972	7 transmembrane receptor (rhodopsin family)
258973	7 transmembrane receptor (rhodopsin family)
258974	7 transmembrane receptor (rhodopsin family)
258975	7 transmembrane receptor (rhodopsin family)
258976	7 transmembrane receptor (rhodopsin family)
258977	7 transmembrane receptor (rhodopsin family)
258979	7 transmembrane receptor (rhodopsin family)
258980	7 transmembrane receptor (rhodopsin family)
258981	7 transmembrane receptor (rhodopsin family)
258982	7 transmembrane receptor (rhodopsin family)
258983	7 transmembrane receptor (rhodopsin family)
258985	7 transmembrane receptor (rhodopsin family)
258986	7 transmembrane receptor (rhodopsin family)
258987	7 transmembrane receptor (rhodopsin family)
258988	7 transmembrane receptor (rhodopsin family)
258989	7 transmembrane receptor (rhodopsin family)
258990	7 transmembrane receptor (rhodopsin family)
258991	7 transmembrane receptor (rhodopsin family)
258992	7 transmembrane receptor (rhodopsin family)
258993	7 transmembrane receptor (rhodopsin family)
258995	7 transmembrane receptor (rhodopsin family)
258996	7 transmembrane receptor (rhodopsin family)
258997	7 transmembrane receptor (rhodopsin family)
258998	7 transmembrane receptor (rhodopsin family)
258999	7 transmembrane receptor (rhodopsin family)
259000	7 transmembrane receptor (rhodopsin family)
259001	7 transmembrane receptor (rhodopsin family)
259002	7 transmembrane receptor (rhodopsin family)
259003	7 transmembrane receptor (rhodopsin family)
259004	7 transmembrane receptor (rhodopsin family)
259005	7 transmembrane receptor (rhodopsin family)
259006	7 transmembrane receptor (rhodopsin family)
259007	7 transmembrane receptor (rhodopsin family)
259008	7 transmembrane receptor (rhodopsin family)
259009	7 transmembrane receptor (rhodopsin family)
259010	7 transmembrane receptor (rhodopsin family)
259011	7 transmembrane receptor (rhodopsin family)
259012	7 transmembrane receptor (rhodopsin family)
259013	7 transmembrane receptor (rhodopsin family)
259014	7 transmembrane receptor (rhodopsin family)
259015	7 transmembrane receptor (rhodopsin family)
259016	7 transmembrane receptor (rhodopsin family)
259017	7 transmembrane receptor (rhodopsin family)
259018	7 transmembrane receptor (rhodopsin family)
259019	7 transmembrane receptor (rhodopsin family)
259020	7 transmembrane receptor (rhodopsin family)
259021	7 transmembrane receptor (rhodopsin family)
259022	7 transmembrane receptor (rhodopsin family)
259023	7 transmembrane receptor (rhodopsin family)
259024	7 transmembrane receptor (rhodopsin family)
259025	7 transmembrane receptor (rhodopsin family)
259026	7 transmembrane receptor (rhodopsin family)
259027	7 transmembrane receptor (rhodopsin family)
259030	7 transmembrane receptor (rhodopsin family)
259031	7 transmembrane receptor (rhodopsin family)
259032	7 transmembrane receptor (rhodopsin family)
259033	7 transmembrane receptor (rhodopsin family)
259034	7 transmembrane receptor (rhodopsin family)
259035	7 transmembrane receptor (rhodopsin family)
259036	7 transmembrane receptor (rhodopsin family)
259037	7 transmembrane receptor (rhodopsin family)
259038	7 transmembrane receptor (rhodopsin family)
259039	7 transmembrane receptor (rhodopsin family)
259040	7 transmembrane receptor (rhodopsin family)
259041	7 transmembrane receptor (rhodopsin family)
259042	7 transmembrane receptor (rhodopsin family)
259042	7 transmembrane receptor (rhodopsin family)
259044	7 transmembrane receptor (rhodopsin family)
259045	7 transmembrane receptor (rhodopsin family)
259046	7 transmembrane receptor (rhodopsin family)
259047	7 transmembrane receptor (rhodopsin family)
259048	7 transmembrane receptor (rhodopsin family)
259049	7 transmembrane receptor (rhodopsin family)
259050	7 transmembrane receptor (rhodopsin family)
259051	7 transmembrane receptor (rhodopsin family)
259052	7 transmembrane receptor (rhodopsin family)
259053	7 transmembrane receptor (rhodopsin family)
259054	7 transmembrane receptor (rhodopsin family)
259055	7 transmembrane receptor (rhodopsin family)
259056	7 transmembrane receptor (rhodopsin family)
259057	7 transmembrane receptor (rhodopsin family)
259058	7 transmembrane receptor (rhodopsin family)
259061	7 transmembrane receptor (rhodopsin family)
259062	7 transmembrane receptor (rhodopsin family)
259063	7 transmembrane receptor (rhodopsin family)
259064	7 transmembrane receptor (rhodopsin family)
259066	7 transmembrane receptor (rhodopsin family)
259067	7 transmembrane receptor (rhodopsin family)
259069	7 transmembrane receptor (rhodopsin family)
259070	7 transmembrane receptor (rhodopsin family)
259071	7 transmembrane receptor (rhodopsin family)
259072	7 transmembrane receptor (rhodopsin family)
259073	7 transmembrane receptor (rhodopsin family)
259075	7 transmembrane receptor (rhodopsin family)
259078	7 transmembrane receptor (rhodopsin family)
259080	7 transmembrane receptor (rhodopsin family)
259081	7 transmembrane receptor (rhodopsin family)
259082	7 transmembrane receptor (rhodopsin family)
259083	7 transmembrane receptor (rhodopsin family)
259084	7 transmembrane receptor (rhodopsin family)
259085	7 transmembrane receptor (rhodopsin family)
259086	7 transmembrane receptor (rhodopsin family)
259087	7 transmembrane receptor (rhodopsin family)
259088	7 transmembrane receptor (rhodopsin family)
259089	7 transmembrane receptor (rhodopsin family)
259090	7 transmembrane receptor (rhodopsin family)
259091	7 transmembrane receptor (rhodopsin family)
259092	7 transmembrane receptor (rhodopsin family)
259093	7 transmembrane receptor (rhodopsin family)
259094	7 transmembrane receptor (rhodopsin family)
259095	7 transmembrane receptor (rhodopsin family)
259096	7 transmembrane receptor (rhodopsin family)
259097	7 transmembrane receptor (rhodopsin family)
259098	7 transmembrane receptor (rhodopsin family)
259099	7 transmembrane receptor (rhodopsin family)
259100	7 transmembrane receptor (rhodopsin family)
259101	7 transmembrane receptor (rhodopsin family)
259102	7 transmembrane receptor (rhodopsin family)
259103	7 transmembrane receptor (rhodopsin family)
259104	7 transmembrane receptor (rhodopsin family)
259105	7 transmembrane receptor (rhodopsin family)
259106	7 transmembrane receptor (rhodopsin family)
259107	7 transmembrane receptor (rhodopsin family)
259108	7 transmembrane receptor (rhodopsin family)
259109	7 transmembrane receptor (rhodopsin family)
259110	7 transmembrane receptor (rhodopsin family)
259111	7 transmembrane receptor (rhodopsin family)
259112	7 transmembrane receptor (rhodopsin family)
259113	7 transmembrane receptor (rhodopsin family)
259114	7 transmembrane receptor (rhodopsin family)
259115	7 transmembrane receptor (rhodopsin family)
259116	7 transmembrane receptor (rhodopsin family)
259117	7 transmembrane receptor (rhodopsin family)
259118	7 transmembrane receptor (rhodopsin family)
259119	7 transmembrane receptor (rhodopsin family)
259122	7 transmembrane receptor (rhodopsin family)
259123	7 transmembrane receptor (rhodopsin family)
259124	7 transmembrane receptor (rhodopsin family)
259125	7 transmembrane receptor (rhodopsin family)
259126	7 transmembrane receptor (rhodopsin family)
259172	CUB domain
259172	Low-density lipoprotein receptor domain class A
259173	Vacuolar sorting protein 9 (VPS9) domain
259173	Vacuolar sorting protein 9 (VPS9) domain
259217	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
259221	Oxysterol-binding protein
259229	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
259232	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
259233	Glutathione peroxidase
259235	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
259241	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
259241	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
259244	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
259245	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
259246	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
259247	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
259249	7 transmembrane receptor (rhodopsin family)
259271	HIT family
259273	Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors
259273	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
259279	Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin
259307	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
259307	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. T
260293	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
260298	Ets-domain
260302	VHS domain. Domain present in VPS-27, Hrs and STAM
260302	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
260302	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
260319	7 transmembrane receptor (rhodopsin family)
260321	TPR Domain
260321	FKBP-type peptidyl-prolyl cis-trans isomerase
260323	PX domain. PX domains bind to phosphoinositides
260326	7 transmembrane receptor (Secretin family)
260326	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
260430	7 transmembrane receptor (metabotropic glutamate family)
260434	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
261726	TIP41-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 interacts with TAP42 and negatively regulates the TOR signaling pathway
261737	Tricarboxylate carrier
263803	Hr1 repeat
263803	Protein kinase domain
263803	Protein kinase C terminal domain
266459	Eukaryotic initiation factor 1A
266600	'Cold-shock' DNA-binding domain
266601	MAS20 protein import receptor
266602	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tand
266603	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyd
266604	Lipoxygenase
266604	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
266607	7 transmembrane receptor (rhodopsin family)
266608	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
266610	Death domain
266610	Death effector domain
266612	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
266629	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
266632	Death domain
266632	Protein kinase domain
266632	Protein kinase domain
266632	Protein kinase domain
266668	Uncharacterized protein family UPF0005
266674	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
266675	Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown
266680	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
266682	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
266684	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
266685	UDP-glucoronosyl and UDP-glucosyl transferase
266686	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
266688	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
266689	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
266690	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
266690	Oxidoreductase FAD-binding domain
266690	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
266690	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
266692	C2 domain
266692	C2 domain
266707	Connexin
266707	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
266709	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
266715	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
266717	Intermediate filament protein
266727	MAM domain. An extracellular domain found in many receptors
266729	Phosphotyrosine interaction domain (PTB/PID)
266735	7 transmembrane receptor (Secretin family)
266735	7 transmembrane receptor (Secretin family)
266735	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
266735	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are
266738	Homeobox domain
266740	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
266758	Signal peptidase I
266759	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase dom
266760	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
266762	7 transmembrane receptor (rhodopsin family)
266763	Sodium:dicarboxylate symporter family
266765	Fibronectin type III domain
266765	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
266769	7 transmembrane receptor (rhodopsin family)
266771	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
266772	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
266773	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
266778	7 transmembrane receptor (metabotropic glutamate family)
266780	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
266781	PX domain. PX domains bind to phosphoinositides
266782	Somatotropin hormone family
266804	Somatotropin hormone family
266807	Animal haem peroxidase
266807	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mou
266812	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
266813	Homeobox domain
266815	Class I Histocompatibility antigen, domains alpha 1 and 2
266815	Class I Histocompatibility antigen, domains alpha 1 and 2
266975	tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt
266977	7 transmembrane receptor (Secretin family)
266998	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
267019	Ribosomal protein S8
268278	Ribosomal protein S5, N-terminal domain
268278	Ribosomal protein S5, C-terminal domain
268281	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
268281	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
268281	Zinc finger, C3HC4 type (RING finger)
268281	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
268281	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
268289	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
268294	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
268294	Adenylate kinase
268294	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
268294	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
268297	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
268299	Helix-loop-helix DNA-binding domain
268299	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
268299	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
268301	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
268308	Ribosomal protein L19e
268311	Ribosomal L15
268312	ENV polyprotein (coat polyprotein)
268319	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
268320	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
268321	Nucleoside diphosphate kinase
268326	NB-ARC domain
268326	WD domain, G-beta repeat
268326	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
268329	Laminin N-terminal (Domain VI)
268329	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
268329	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
268330	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
268330	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
268336	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
268337	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
268340	PhoH-like protein. PhoH is a cytoplasmic protein and predicted ATPase that is induced by phosphate starvation
268345	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
268345	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
268347	Ribosomal protein S19
268353	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
268353	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
268358	Ribosomal protein S2
268360	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
268361	7 transmembrane receptor (rhodopsin family)
268363	7 transmembrane receptor (rhodopsin family)
268373	Cyclophilin type peptidyl-prolyl cis-trans isomerase
268379	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
268385	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
268385	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
268385	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
268395	Methylpurine-DNA glycosylase (MPG). Methylpurine-DNA glycosylase is a base excision-repair protein. It is responsible for the hydrolysis of the deoxyribose N-glycosidic bond, excising 3-methyladenine and 3-methylguanine from damaged DNA
268396	PX domain. PX domains bind to phosphoinositides
268396	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
268398	Plant PEC family metallothionein
268398	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
268398	Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibers in association with hair keratin intermediate filaments. This family has been divided up int
268406	Kinesin motor domain
268413	7 transmembrane receptor (rhodopsin family)
268415	Core histone H2A/H2B/H3/H4
268417	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
268424	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
268426	AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synt
268428	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
268428	KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
268429	Ribosomal protein L13e
268435	SRP54-type protein, GTPase domain. This family includes relatives of the G-domain of the SRP54 family of proteins
268439	7 transmembrane receptor (rhodopsin family)
268444	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
268448	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
268448	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
268449	Ribosomal protein L23
268449	Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a
268452	Cyclin-dependent kinase 5 activator protein
268460	Animal haem peroxidase
268463	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
268467	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
268470	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lack
268474	PH domain. PH stands for pleckstrin homology
268477	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
268480	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
268481	Intermediate filament protein
268482	Intermediate filament protein
268482	Intermediate filament protein
268484	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
268493	TPR Domain
268495	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
268495	Integrins, beta chain. Sequences cut off at repeats due to overlap with EGF
268497	C2 domain
268497	Protein kinase domain
268497	Protein kinase C terminal domain
268497	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
268504	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
268512	Sulfate transporter family. Mutations may lead to several human diseases
268512	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
268515	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
268520	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
268522	Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction
268524	Ribosomal protein S19
268529	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
268539	Ribosomal protein S5, N-terminal domain
268542	BED zinc finger
268542	pfam02892, zf-BED, BED zinc finger
268543	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
268543	KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
268544	Ribosomal protein S26e
268545	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
268545	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
268550	Ribosomal protein L13e
268552	Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
268552	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
268553	Cyclophilin type peptidyl-prolyl cis-trans isomerase
268555	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
268555	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
268557	Vinculin family
268564	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
268564	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
268565	Uncharacterised protein family (UPF0041)
268573	Ribosomal protein L35Ae
268576	Ribosomal protein L21e
268584	NAC domain
268589	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
268589	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
268591	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
268593	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
268602	HMG (high mobility group) box
268604	Protein kinase domain
268604	Protein kinase C terminal domain
268604	PH domain. PH stands for pleckstrin homology
268612	L1 transposable element
268612	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
268614	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
268622	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
268623	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
268625	Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases
268633	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
268644	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
268650	Ribosomal S17
268656	Aminotransferase class I and II
268657	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
268663	Cadherin domain
268665	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an
268667	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
268676	Ribosomal protein L21e
268677	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
268681	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
268681	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
268686	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
268687	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
268695	Ribosomal protein L15
268700	Ribosomal S3Ae family
268706	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
268706	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
268708	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
268708	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
268713	PH domain. PH stands for pleckstrin homology
268714	Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly in
268728	Sugar (and other) transporter
268728	LacY proton/sugar symporter. This family is closely related to the sugar transporter family
268729	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
268741	HMG (high mobility group) box
268742	Amino acid permease
268742	Tryptophan/tyrosine permease family
268746	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
268747	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
268752	WD domain, G-beta repeat
268752	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
268756	FAD binding domain. This family consists of various enzymes that use FAD as a co-factor, most of the enzymes are similar to oxygen oxidoreductase. One of the enzymes Vanillyl-alcohol oxidase (VAO) has a solved structure, the alignment includes the FAD bi
268757	Ribosomal protein S5, C-terminal domain
268762	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
268766	HMG (high mobility group) box
268767	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
268772	Glypican
268772	Uncharacterised protein (DUF314)
268776	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
268777	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
268777	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
268780	Fibronectin type III domain
268780	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
268780	Fibronectin type III domain
268780	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
268780	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
268782	Aminotransferase class-III
268790	PH domain. PH stands for pleckstrin homology
268790	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
268795	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
268798	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
268801	Translation initiation factor SUI1
268802	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
268807	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
268807	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet
268807	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
268807	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
268818	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
268831	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
268838	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
268843	Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin
268848	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
268855	Peptidase family C50
268857	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
268857	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
268859	Cullin family
268859	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
268860	Aminotransferase class-III
268869	7 transmembrane receptor (rhodopsin family)
268877	KOW motif. This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG
268879	Helix-loop-helix DNA-binding domain
268880	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt
268882	F-box domain
268882	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
268884	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
268884	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
268885	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this fami
268886	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
268887	Ribosomal protein L6
268890	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
268891	Ribosomal protein S24e
268893	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
268895	BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1
268898	Ribosomal protein S21e
268899	Actin
268910	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
268921	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
268923	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
268923	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
268923	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
268923	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
268925	7 transmembrane receptor (metabotropic glutamate family)
268925	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
268927	7 transmembrane receptor (metabotropic glutamate family)
268928	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
268932	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
268933	WD domain, G-beta repeat
268935	CUB domain
268939	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
268940	Homeobox domain
268941	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
268941	Notch (DSL) domain. The Notch domain is also called the 'DSL' domain. The notch proteins are transmembrane proteins with extracellular domains of repeated EGF domains and the notch (or DSL) domain N-terminal to that. These proteins are generally involved
268945	Class II histocompatibility antigen, beta domain
268945	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
268955	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
268958	Calpain family cysteine protease
268964	Ribosomal protein L21e
268967	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
268977	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
268977	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
268980	WD domain, G-beta repeat
268981	CBF/Mak21 family
268986	GTP1/OBG family
268986	GTPase of unknown function
268988	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
268992	HMG (high mobility group) box
269002	Ribosomal protein L21e
269010	L1 transposable element
269010	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
269019	Protein kinase domain
269035	Proteasome A-type and B-type
269036	Myosin head (motor domain)
269038	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
269043	Ribosomal protein S8e
269044	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
269045	7 transmembrane receptor (rhodopsin family)
269047	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
269047	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
269049	Calpain family cysteine protease
269049	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by
269053	7 transmembrane receptor (rhodopsin family)
269063	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
269064	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
269064	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
269066	7 transmembrane receptor (rhodopsin family)
269069	7 transmembrane receptor (rhodopsin family)
269070	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
269075	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
269075	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
269079	Occludin/ELL family
269079	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
269079	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
269083	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
269085	Ribosomal L15
269086	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
269088	Homeobox domain
269091	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
269100	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
269103	Inward rectifier potassium channel
269109	Dienelactone hydrolase family
269109	Prolyl oligopeptidase family
269109	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
269109	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
269114	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whic
269116	Fibronectin type III domain
269116	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
269121	Ribosomal protein S12
269132	Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of va
269132	Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of var
269137	Sushi domain (SCR repeat)
269140	SH2 domain
269146	Pentaxin family. Pentaxins are also known as pentraxins
269155	Ribosomal protein L21e
269156	Ribosomal protein L21e
269163	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
269166	Ribosomal protein L15
269173	Thrombospondin N-terminal -like domain
269173	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
269190	Ribosomal L15
269191	Ribosomal protein S24e
269193	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
269193	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
269198	Beige/BEACH domain
269200	Ribosomal L18ae protein family
269204	Tau and MAP protein, tubulin-binding repeat
269207	Ribosomal L15
269211	ENV polyprotein (coat polyprotein)
269211	ENV polyprotein (coat polyprotein)
269213	L1 transposable element
269213	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
269224	Protein kinase domain
269224	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
269227	Phosphatidylinositol transfer protein
269229	Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold
269229	Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold
269229	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
269230	Intermediate filament protein
269232	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
269242	D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain
269242	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
269243	Protein kinase domain
269261	Ribosomal protein L11, RNA binding domain
269261	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
269265	Ribosomal S3Ae family
269267	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
269268	ENV polyprotein (coat polyprotein)
269275	Protein kinase domain
269275	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with t
269278	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
269280	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
269282	Protein kinase domain
269283	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
269283	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
269283	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
269285	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
269289	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
269297	7 transmembrane receptor (rhodopsin family)
269300	7 transmembrane receptor (rhodopsin family)
269308	7 transmembrane receptor (rhodopsin family)
269309	7 transmembrane receptor (rhodopsin family)
269310	7 transmembrane receptor (rhodopsin family)
269311	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
269311	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
269312	7 transmembrane receptor (rhodopsin family)
269316	Fibronectin type III domain
269316	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
269319	Enolase, N-terminal domain
269319	Enolase, C-terminal TIM barrel domain
269325	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
269325	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
269338	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
269338	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
269338	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
269346	Na+ dependent nucleoside transporter. This family consists of nucleoside transport proteins, like a purine-specific Na+-nucleoside cotransporter localized to the bile canalicular membrane, or a Na+-dependent nucleoside transporter selective for pyrimidin
269353	Cyclophilin type peptidyl-prolyl cis-trans isomerase
269354	Transglutaminase family
269354	Transglutaminase family, C-terminal ig like domain
269354	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
269356	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
269361	HMG (high mobility group) box
269363	Ribosomal L15
269364	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
269365	Ribosomal protein S19e
269366	Uncharacterized ACR, COG1579
269366	Intermediate filament protein
269366	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
269367	Ribosomal protein L35Ae
269369	Cyclophilin type peptidyl-prolyl cis-trans isomerase
269369	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
269371	Ribosomal protein L21e
269373	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
269378	S-adenosyl-L-homocysteine hydrolase
269378	S-adenosyl-L-homocysteine hydrolase
269378	Alanine dehydrogenase/pyridine nucleotide transhydrogenase. This family now also contains the lysine 2-oxoglutarate reductases
269378	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
269380	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
269380	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
269380	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family
269380	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases
269384	LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
269384	LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
269384	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
269406	Protein kinase domain
269406	ENV polyprotein (coat polyprotein)
269410	Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
269410	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
269413	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
269414	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
269415	Myosin head (motor domain)
269415	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
269415	PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins
269416	Ribosomal protein L15
269425	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
269427	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
269432	Ribosomal protein S17
269433	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
269434	ENV polyprotein (coat polyprotein)
269435	Ribosomal protein S12
269444	Sodium / potassium ATPase beta chain
269445	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
269452	Extracellular link domain
269452	Lectin C-type domain. This family includes both long and short form C-type
269452	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
269452	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
269466	Thi4 family. This family includes a putative thiamine biosynthetic enzyme
269466	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
269466	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
269466	FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1
269470	Dip2/Utp12 Family. This domain is found at the C-terminus of proteins containing WD40 repeats. These proteins are part of the U3 ribonucleoprotein the yeast protein is called Utp12 or DIP2
269470	WD domain, G-beta repeat
269473	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
269473	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
269478	Ribosomal protein L13e
269485	Vacuolar protein sorting 55. Vps55 is involved in the secretion of the Golgi form of the soluble vacuolar carboxypeptidase Y, but not the trafficking of the membrane-bound vacuolar alkaline phosphatase. Both Vps55 and obesity receptor gene-related protein
269487	Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str
269497	SH2 domain
269497	PH domain. PH stands for pleckstrin homology
269501	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
269501	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
269504	7 transmembrane receptor (Secretin family)
269504	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
269508	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
269508	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
269511	HMG (high mobility group) box
269512	Glutamine amidotransferase class-I
269514	F-box domain
269523	Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist o
269523	Bacterial dnaA protein
269523	Sigma-54 interaction domain
269523	RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses
269523	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
269523	Parvovirus non-structural protein NS1. This family also contains the NS2 protein. Parvoviruses encode two non-structural proteins, NS1 and NS2. The mRNA for NS2 contains the coding sequence for the first 87 amino acids of NS1, then by an alternative splic
269523	Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist of
269523	Bacterial dnaA protein
269523	Sigma-54 interaction domain
269523	RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses
269523	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
269523	Parvovirus non-structural protein NS1. This family also contains the NS2 protein. Parvoviruses encode two non-structural proteins, NS1 and NS2. The mRNA for NS2 contains the coding sequence for the first 87 amino acids of NS1, then by an alternative splic
269523	Cell division protein 48 (CDC48), N-terminal domain. This domain has a double psi-beta barrel fold and includes VCP-like ATPase and N-ethylmaleimide sensitive fusion protein N-terminal domains. Both the VAT and NSF N-terminal functional domains consist of
269527	L1 transposable element
269527	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
269529	F-box domain
269533	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
269534	Ribosomal S17
269538	Ribosomal L29e protein family
269540	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
269541	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
269542	ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER
269548	Ribosomal protein S6e
269552	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
269558	Respiratory-chain NADH dehydrogenase, 49 Kd subunit
269569	Cyclophilin type peptidyl-prolyl cis-trans isomerase
269579	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
269585	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
269587	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
269589	C2 domain
269589	C2 domain
269590	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
269591	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
269593	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
269594	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
269595	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
269597	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
269604	7 transmembrane receptor (Secretin family)
269604	7 transmembrane receptor (Secretin family)
269604	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are
269608	PH domain. PH stands for pleckstrin homology
269608	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
269611	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
269611	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
269611	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
269611	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
269614	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isofor
269615	PH domain. PH stands for pleckstrin homology
269615	PH domain. PH stands for pleckstrin homology
269615	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
269620	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
269622	Ribosomal family S4e
269625	Intermediate filament protein
269628	S-adenosylmethionine synthetase, central domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold
269628	S-adenosylmethionine synthetase, C-terminal domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold
269635	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
269636	Homeobox domain
269642	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
269650	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
269664	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
269666	L1 transposable element
269666	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
269674	Ribosomal protein L6
269691	14-3-3 protein
269697	Homeobox domain
269697	LIM domain. This family represents two copies of the LIM structural domain
269700	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
269709	Protein kinase domain
269713	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
269713	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
269713	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
269713	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
269715	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
269715	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
269715	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
269719	Ribosomal protein S7e
269725	Elongation factor 1 gamma, conserved domain
269725	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
269732	Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5
269732	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
269735	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
269740	Homeobox domain
269742	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
269743	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
269744	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
269745	7 transmembrane receptor (rhodopsin family)
269745	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
269750	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
269751	Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in
269754	Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018
269754	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
269756	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
269758	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
269758	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
269758	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
269758	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
269758	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
269763	Tubulin binding cofactor A
269765	HMG (high mobility group) box
269769	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
269769	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
269771	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
269771	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
269772	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
269773	Protein kinase domain
269783	Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding
269784	Fibronectin type III domain
269786	BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1
269792	Core histone H2A/H2B/H3/H4
269798	Phosphoribulokinase / Uridine kinase family
269799	Lectin C-type domain. This family includes both long and short form C-type
269804	Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
269804	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
269807	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
269813	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
269813	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
269819	L1 transposable element
269819	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
269823	Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity.
269830	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
269831	Tetraspanin family
269836	von Willebrand factor type D domain
269836	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
269841	ENV polyprotein (coat polyprotein)
269845	Fructose-bisphosphate aldolase class-I
269845	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
269855	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
269856	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
269857	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
269858	7 transmembrane receptor (rhodopsin family)
269858	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
269859	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
269862	7 transmembrane receptor (rhodopsin family)
269863	ENV polyprotein (coat polyprotein)
269863	7 transmembrane receptor (rhodopsin family)
269863	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
269863	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
269863	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
269863	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
269863	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
269864	WD domain, G-beta repeat
269865	Translation initiation factor SUI1
269870	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
269870	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
269870	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
269870	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
269870	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
269871	Sulfotransferase protein
269873	Ribosomal S3Ae family
269874	Sodium:dicarboxylate symporter family
269877	Ets-domain
269881	Protein kinase domain
269886	PH domain. PH stands for pleckstrin homology
269886	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
269894	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
269894	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
269902	7 transmembrane receptor (metabotropic glutamate family)
269902	7 transmembrane receptor (metabotropic glutamate family)
269905	Trypsin
269906	Trypsin
269907	Trypsin
269913	Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes
269919	7 transmembrane receptor (rhodopsin family)
269920	ENV polyprotein (coat polyprotein)
269928	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
269932	SH2 domain
269932	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
269939	Guanylate kinase
269939	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
269939	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
269940	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
269941	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
269943	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
269945	Sushi domain (SCR repeat)
269945	Extracellular link domain
269945	Lectin C-type domain. This family includes both long and short form C-type
269948	Translation initiation factor SUI1
269949	Helix-loop-helix DNA-binding domain
269954	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
269957	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a b
269959	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
269968	7 transmembrane receptor (rhodopsin family)
269969	7 transmembrane receptor (rhodopsin family)
269971	7 transmembrane receptor (rhodopsin family)
269972	7 transmembrane receptor (rhodopsin family)
269973	7 transmembrane receptor (rhodopsin family)
269974	7 transmembrane receptor (rhodopsin family)
269980	Ubiquitin carboxyl-terminal hydrolase family 2
269981	7 transmembrane receptor (rhodopsin family)
269982	7 transmembrane receptor (rhodopsin family)
269989	KOW motif. This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG
269990	Eukaryotic porin
269994	PMP-22/EMP/MP20/Claudin family
269997	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
270002	metallopeptidase family M24
270004	Fork head domain
270006	Ribosomal protein L24e
270009	7 transmembrane receptor (rhodopsin family)
270014	Protein kinase domain
270017	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
270017	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
270017	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
270017	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
270017	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
270026	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
270026	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
270030	Ribosomal S3Ae family
270033	Ribosomal protein L19e
270038	GTP1/OBG family
270038	GTPase of unknown function
270038	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
270040	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
270046	PMP-22/EMP/MP20/Claudin family
270049	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
270049	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
270055	Sushi domain (SCR repeat)
270055	Extracellular link domain
270055	Lectin C-type domain. This family includes both long and short form C-type
270055	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
270055	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The family is difficult to model due to many similar but different sub-types o
270056	3'5'-cyclic nucleotide phosphodiesterase
270057	Fibronectin type III domain
270082	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
270083	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
270089	Carboxylesterase
270093	Ribosomal protein S24e
270096	SAND family protein. Members of this family are uncharacterised proteins that have been called SAND proteins. These protein do not contain a SAND domain. The members of this family are 500-600 amino acids long and contain several conserved motifs
270099	Ribosomal S3Ae family
270102	Amino acid permease
270104	Translation initiation factor SUI1
270105	Cadherin domain
270105	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
270106	Ribosomal protein L13e
270106	Ribosomal protein L13e
270109	Competence protein. Members of this family are integral membrane proteins with 6 predicted transmembrane helices. Some members of this family have been shown to be essential for bacterial competence in uptake of extracellular DNA. These proteins may tran
270112	CUB domain
270112	MAM domain. An extracellular domain found in many receptors
270112	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
270119	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
270121	Ribosomal protein L21e
270140	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
270144	7 transmembrane receptor (rhodopsin family)
270145	7 transmembrane receptor (rhodopsin family)
270147	ENV polyprotein (coat polyprotein)
270147	7 transmembrane receptor (rhodopsin family)
270147	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
270148	Ribosomal protein S7e
270149	Zona pellucida-like domain
270149	von Willebrand factor type D domain
270149	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
270151	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two doma
270155	NNMT/PNMT/TEMT family
270160	ADP-ribosylation factor family
270160	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
270163	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
270164	ADP-ribosylation factor family
270166	Sigma-54 interaction domain
270166	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
270169	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
270174	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
270174	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
270183	DedA family. This family combines the DedA related proteins and YIAN/YGIK family. Members of this family are not functionally characterised. These proteins contain multiple predicted transmembrane regions
270187	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
270190	Fibronectin type III domain
270190	Giardia variant-specific surface protein
270190	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
270192	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
270192	ADP-ribosylation factor family
270192	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
270192	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
270193	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
270198	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
270201	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
270214	Peptidase family C54
270220	Ribosomal protein L23
270230	Core histone H2A/H2B/H3/H4
270231	Sec61beta family. This family consists of homologues of Sec61beta - a component of the Sec61/SecYEG protein secretory system. The domain is found in eukaryotes and archaea and is possibly homologous to the bacterial SecG
270234	Ribosomal protein L23
270236	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
270243	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
270243	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
270251	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
270263	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
270275	NOT2 / NOT3 / NOT5 family. NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5
270277	ENV polyprotein (coat polyprotein)
270289	7 transmembrane receptor (rhodopsin family)
270289	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
270309	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
270309	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
270319	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
270343	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
270344	Homeobox domain
270355	Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold
270358	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
270362	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
270363	UDP-glucoronosyl and UDP-glucosyl transferase
270363	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
270364	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
270366	Hsp90 protein
270366	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
270372	Ribosomal protein L19e
270379	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
270381	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
270384	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
270392	Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat
270407	Ribosomal protein L21e
270409	Ribosomal protein L15
270417	Intermediate filament protein
270417	Laminin N-terminal (Domain VI)
270417	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
270417	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
270417	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
270417	PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins
270425	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
270426	7 transmembrane receptor (rhodopsin family)
270426	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
270434	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
270437	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
270448	7 transmembrane receptor (rhodopsin family)
270461	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
270468	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
270470	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
270473	7 transmembrane receptor (rhodopsin family)
270491	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
270498	7 transmembrane receptor (rhodopsin family)
270499	Ribosomal protein L23
270500	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
270502	7 transmembrane receptor (rhodopsin family)
270502	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
270504	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
270504	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
270506	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
270507	7 transmembrane receptor (rhodopsin family)
270529	Protein kinase domain
270535	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
270539	7 transmembrane receptor (rhodopsin family)
270541	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
270566	Ribosomal protein L23
270567	Ribosomal protein L23
270575	Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes
270580	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
270582	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
270582	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
270584	Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a
270588	L1 transposable element
270589	L1 transposable element
270589	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
270596	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
270600	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
270602	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
270612	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
270612	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
270617	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
270617	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
270619	Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase
270621	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
270624	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members
270624	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
270625	PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline
270627	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
270636	ADP-ribosylation factor family
270636	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
270637	Ribosomal protein L11, RNA binding domain
270639	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
270639	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
270639	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
270639	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
270640	Hsp90 protein
270643	ENV polyprotein (coat polyprotein)
270643	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
270643	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
270658	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
270658	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
270660	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
270665	HMG (high mobility group) box
270667	Zinc finger, C3HC4 type (RING finger)
270669	Peptidase family M50
270670	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
270672	Protein kinase domain
270685	Formate--tetrahydrofolate ligase
270734	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
270742	Ribosomal protein L14p/L23e
270749	Trypsin
270757	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
270757	LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
270757	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
270763	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
270764	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
270784	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
270804	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
270822	Actin
270829	Ribosomal protein S2
270984	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
270984	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
270984	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
270986	Ribosomal protein S5, C-terminal domain
270986	Ribosomal protein S5, N-terminal domain
270988	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
270997	Homeobox domain
270999	Actin
271003	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
271005	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
271021	Core histone H2A/H2B/H3/H4
271022	Eukaryotic initiation factor 1A
271041	Ribosomal L29e protein family
271047	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
271048	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
271068	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
271115	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
271124	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
271127	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
271127	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
271133	HMG (high mobility group) box
271142	Ribosomal protein S8
271147	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
271171	Triosephosphate isomerase
271173	Thymidylate synthase
271202	HMG (high mobility group) box
271215	Phosphatidylethanolamine-binding protein
271228	Ribosomal protein L36e
271264	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
271268	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
271278	Growth-Arrest-Specific Protein 2 Domain
271280	WD domain, G-beta repeat
271294	Ribosomal L29e protein family
271297	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
271299	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
271300	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
271305	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
271350	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
271350	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
271363	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
271364	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
271374	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
271374	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
271375	Cation transport protein. This family consists of various cation transport proteins (Trk) and V-type sodium ATP synthase subunit J or translocating ATPase J EC:3.6.1.34. These proteins are involved in active sodium up-take utilizing ATP in the process. Tr
271395	Kinesin motor domain
271409	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
271415	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
271424	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
271427	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
271427	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
271444	Class I Histocompatibility antigen, domains alpha 1 and 2
271444	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
271451	LIM domain. This family represents two copies of the LIM structural domain
271457	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
271463	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
271489	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
271489	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
271489	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
271490	Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex
271490	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
271490	Glucuronate isomerase. This is a family of Glucuronate isomerases also known as D-glucuronate isomerase, uronic isomerase, uronate isomerase, or uronic acid isomerase, EC:5.3.1.12. This enzyme catalyses the reactions: D-glucuronate <=> D-fructuronate and
271501	Prp18 domain. The splicing factor Prp18 is required for the second step of pre-mRNA splicing. The structure of a large fragment of the Saccharomyces cerevisiae Prp18 is known. This fragment is fully active in yeast splicing in vitro and includes the seque
271505	'Cold-shock' DNA-binding domain
271508	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
271513	Ribosomal protein S28e
271519	DTW domain. This presumed domain is found in bacterial and eukaryotic proteins. Its function is unknown. The domain contains multiple conserved motifs including a DTXW motif that this domain has been named after
271521	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
271529	Proteasome A-type and B-type
271591	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
271591	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
271634	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
271658	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
271682	Ribosomal protein L6
271697	Protein kinase domain
271707	Protein kinase domain
271708	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
271709	Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain
271786	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
271844	C2 domain
271844	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop
271942	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
271944	C2 domain
271945	Ribosomal protein S5, C-terminal domain
271970	Sulfatase
271976	Eukaryotic ribosomal protein L18
271978	Ribosomal protein S28e
272002	Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars
272009	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
272031	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
272129	Zinc finger, C3HC4 type (RING finger)
272138	ENV polyprotein (coat polyprotein)
272143	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
272144	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
272149	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
272155	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
272158	DNA polymerase family A
272266	Clathrin adaptor complex small chain
272284	Protein kinase domain
272294	Peptidase family C25
272294	Ribosomal protein S8
272311	Lectin C-type domain. This family includes both long and short form C-type
272322	Helix-loop-helix DNA-binding domain
272331	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
272339	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involved
272341	Trypsin
272350	Galactoside-binding lectin
272355	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
272355	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
272357	7 transmembrane receptor (rhodopsin family)
272357	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
272378	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
272381	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
272381	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
272382	Ets-domain
272391	HMG (high mobility group) box
272407	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
272411	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
272417	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
272418	7 transmembrane receptor (rhodopsin family)
272422	HMG (high mobility group) box
272428	AMP-binding enzyme
272437	Sulfotransferase protein
272439	Virulence factor MVIN. The MVIN protein is a putative integral membrane protein. The function is unknown
272443	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
272443	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
272443	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
272462	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
272511	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
272518	Ribosomal L39 protein
272551	Domain of unknown function (DUF392). A eukaryotic specific domain of undetermined function.`
272557	Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
272557	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
272603	Ribosomal protein L21e
272633	Mitochondrial carrier protein
272636	C2 domain
272636	C2 domain
272643	Trypsin
272661	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
272667	Protein kinase domain
272667	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
272668	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
272671	Eukaryotic initiation factor 1A
272673	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
272680	Protein kinase domain
272681	Ribosomal protein S17
272683	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
272684	HMG (high mobility group) box
272691	S4 domain. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yea
272692	Eukaryotic initiation factor 1A
272699	Eukaryotic initiation factor 1A
272701	Phosphatidylserine decarboxylase. This is a family of phosphatidylserine decarboxylases, EC:4.1.1.65. These enzymes catalyse the reaction: Phosphatidyl-L-serine <=> phosphatidylethanolamine + CO2. Phosphatidylserine decarboxylase plays a central role in t
272723	7 transmembrane receptor (rhodopsin family)
272740	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
272771	ENV polyprotein (coat polyprotein)
272771	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
272790	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
272811	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
276720	Inward rectifier potassium channel
276731	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
276740	7 transmembrane receptor (rhodopsin family)
276742	7 transmembrane receptor (rhodopsin family)
276743	7 transmembrane receptor (rhodopsin family)
276756	Hsp90 protein
276756	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
276760	Ribosomal protein L14p/L23e
276764	7 transmembrane receptor (rhodopsin family)
276770	Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold
276792	Thymosin beta-4 family
276803	Actin
276804	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
276829	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
276830	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
276834	NAC domain
276837	PH domain. PH stands for pleckstrin homology
276841	Cyclophilin type peptidyl-prolyl cis-trans isomerase
276843	Hsp90 protein
276858	7 transmembrane receptor (rhodopsin family)
276860	7 transmembrane receptor (rhodopsin family)
276864	7 transmembrane receptor (rhodopsin family)
276865	7 transmembrane receptor (rhodopsin family)
276872	Respiratory-chain NADH dehydrogenase, 30 Kd subunit
276877	7 transmembrane receptor (rhodopsin family)
276889	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
276905	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
276905	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
276925	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
276929	KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
276929	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
276931	7 transmembrane receptor (rhodopsin family)
276941	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
276995	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
276995	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
276998	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
277001	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
277015	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
277015	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
277016	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
277016	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
277018	GTPase of unknown function
277018	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
277018	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
277027	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
277028	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
277035	Protein kinase domain
277041	Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold
277046	L1 transposable element
277046	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
277047	L1 transposable element
277047	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
277049	Eukaryotic initiation factor 1A
277051	L1 transposable element
277051	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
277069	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
277089	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
277097	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
277099	Core histone H2A/H2B/H3/H4
277099	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
277103	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
277104	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
277106	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
277108	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
277111	Core histone H2A/H2B/H3/H4
277125	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
277125	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
277128	ENV polyprotein (coat polyprotein)
277131	Domain of unknown function
277131	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
277135	ENV polyprotein (coat polyprotein)
277135	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
277136	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
277137	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
277145	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
277147	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
277148	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
277151	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
277154	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
277160	7 transmembrane receptor (metabotropic glutamate family)
277161	ENV polyprotein (coat polyprotein)
277161	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
277161	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
277161	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
277161	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
277164	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
277170	ENV polyprotein (coat polyprotein)
277183	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
277193	ENV polyprotein (coat polyprotein)
277200	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
277225	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
277225	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
277226	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
277226	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
277226	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
277231	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
277231	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
277231	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
277231	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
277231	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
277234	L1 transposable element
277234	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
277236	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
277236	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
277250	jmjC domain
277258	R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA
277270	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
277270	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
277274	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
277274	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
277278	L1 transposable element
277278	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
277281	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
277281	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
277283	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
277283	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
277288	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
277313	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
277318	ENV polyprotein (coat polyprotein)
277328	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
277328	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
277328	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
277328	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
277333	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
277333	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
277343	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
277378	ENV polyprotein (coat polyprotein)
277383	7 transmembrane receptor (rhodopsin family)
277389	Cyclophilin type peptidyl-prolyl cis-trans isomerase
277395	Putative zinc finger in N-recognin
277396	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
277396	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
277396	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
277419	7 transmembrane receptor (rhodopsin family)
277481	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
277482	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
277482	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
277489	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
277494	Ribosomal protein L15
277498	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
277511	7 transmembrane receptor (rhodopsin family)
277561	7 transmembrane receptor (rhodopsin family)
277562	7 transmembrane receptor (rhodopsin family)
277566	7 transmembrane receptor (rhodopsin family)
277567	7 transmembrane receptor (rhodopsin family)
277577	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
277603	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
277617	Ribosomal protein L21e
277618	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
277618	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
277618	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
277618	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
277636	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
277637	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
277657	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
277692	Ribosomal protein S19e
277704	Protein kinase domain
277704	Fibronectin type III domain
277704	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
277704	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
277709	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
277709	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
277709	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
277709	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
277752	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
277753	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
277753	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
277754	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
277757	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
277757	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
277768	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
277771	Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
277771	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
277785	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
277785	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
277785	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
277785	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
277786	Respiratory-chain NADH dehydrogenase, 30 Kd subunit
277792	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
277792	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
277792	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
277799	Eukaryotic-type carbonic anhydrase
277807	ADP-ribosylation factor family
277807	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
277819	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
277819	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
277819	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
277819	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
277820	ENV polyprotein (coat polyprotein)
277830	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
277837	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
277837	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
277854	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
277856	Mitochondrial carrier protein
277858	NAD-dependent glycerol-3-phosphate dehydrogenase
277860	Zinc finger, C3HC4 type (RING finger)
277860	YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named
277860	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
277868	Actin
277872	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
277879	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
277879	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
277881	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
277885	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
277885	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
277885	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
277885	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
277898	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
277905	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
277910	Lectin C-type domain. This family includes both long and short form C-type
277916	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
277916	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
277916	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
277916	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
277922	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
277922	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
277923	L1 transposable element
277923	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
277923	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
277923	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
277924	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
277925	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
277925	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
277935	7 transmembrane receptor (rhodopsin family)
277943	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
277949	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
277949	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
277955	Ribosomal protein L21e
277956	Hsp90 protein
277973	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
277979	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
277982	L1 transposable element
277982	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
277982	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
277982	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
277990	7 transmembrane receptor (rhodopsin family)
278002	ENV polyprotein (coat polyprotein)
278002	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
278020	F-actin capping protein alpha subunit
278028	L1 transposable element
278028	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
278037	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
278037	pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB
278041	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
278041	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
278041	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
278050	SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1 p
278061	ENV polyprotein (coat polyprotein)
278061	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
278062	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
278062	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
278067	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
278078	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
278078	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
278078	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
278078	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
278085	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
278087	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
278102	ENV polyprotein (coat polyprotein)
278103	HMG (high mobility group) box
278105	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
278125	BTG1 family. A novel family of anti-proliferative proteins
278129	Polyprenyl synthetase
278143	Ribosomal protein L19
278143	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
278143	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
278145	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
278148	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
278167	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
278170	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
278170	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
278174	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
278178	Pyruvate kinase, alpha/beta domain
278178	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
278178	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
278179	Eukaryotic porin
278179	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
278188	WD domain, G-beta repeat
278192	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
278194	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
278197	Aldose 1-epimerase
278203	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes
278209	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
278211	ENV polyprotein (coat polyprotein)
278214	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
278215	Zinc finger, C3HC4 type (RING finger)
278216	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
278227	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
278232	C2 domain
278240	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
278244	L1 transposable element
278244	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
278244	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
278244	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
278248	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
278263	L1 transposable element
278263	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
278265	L1 transposable element
278265	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
278265	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
278265	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
278266	L1 transposable element
278267	L1 transposable element
278268	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
278269	Mitosis protein DIM1
278279	TPR Domain
278319	Ribosomal protein S28e
278350	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
278350	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
278365	Ribosomal protein S5, C-terminal domain
278417	Eukaryotic ribosomal protein L18
278420	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
278420	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
278420	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
278473	Uncharacterized ACR, COG1579
278473	Intermediate filament protein
278473	Myosin head (motor domain)
278473	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
278473	Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown
278473	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
278507	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
278507	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
278581	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
278583	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
278603	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
278616	WD domain, G-beta repeat
278672	Homeobox domain
278718	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
278725	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
278748	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
278748	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
278748	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
278748	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
278757	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
278759	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
278759	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
278759	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
278786	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
278791	Fibronectin type III domain
278817	ENV polyprotein (coat polyprotein)
278876	ENV polyprotein (coat polyprotein)
278876	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
278876	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
278876	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
278876	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
278915	ENV polyprotein (coat polyprotein)
278986	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
279013	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
279013	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
279028	Thrombospondin type 1 domain
279028	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
279029	Protein kinase domain
279034	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
279050	7 transmembrane receptor (rhodopsin family)
279056	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
279057	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
279067	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
279071	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
279137	7 transmembrane receptor (metabotropic glutamate family)
279137	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
279138	7 transmembrane receptor (metabotropic glutamate family)
279138	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
279139	Glycosyl hydrolases family 39
279139	7 transmembrane receptor (metabotropic glutamate family)
279140	7 transmembrane receptor (metabotropic glutamate family)
279140	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
279181	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
279181	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
279184	ADP-ribosylation factor family
279184	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
279230	Skp1 family, dimerisation domain
279230	Skp1 family, tetramerisation domain
279260	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
279333	Protein kinase domain
279387	MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000
279414	Ribosomal protein S8e
279421	Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. M
279429	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
279430	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
279432	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
279464	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
279464	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
279472	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
279475	HMG (high mobility group) box
279499	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
279499	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
279500	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
279553	Homeobox domain
279554	Homeobox domain
279569	Ribosomal protein L21e
279572	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
279578	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
279578	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
279610	Homeobox domain
279639	Ribosomal protein L6e
279639	Ribosomal protein L6, N-terminal domain
279639	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
279643	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
279653	Cadherin domain
279656	HMG (high mobility group) box
279658	Ribosomal protein S2
279679	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
279684	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
279686	ENV polyprotein (coat polyprotein)
279688	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
279691	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
279691	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment f
279730	L1 transposable element
279730	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
279730	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
279730	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
279730	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
279766	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t
279766	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
279796	7 transmembrane receptor (rhodopsin family)
279851	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
279857	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
279872	L1 transposable element
279872	Intermediate filament protein
279872	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
279872	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
279872	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
279872	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
279881	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
279882	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
279887	L1 transposable element
279887	Intermediate filament protein
279887	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
279887	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
279887	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
279896	Protein kinase domain
279919	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
279922	L1 transposable element
279922	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
279922	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
279923	L1 transposable element
279923	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
279923	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
279926	L1 transposable element
279926	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
279934	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
279934	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
279935	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
279939	Class I Histocompatibility antigen, domains alpha 1 and 2
279940	7 transmembrane receptor (rhodopsin family)
279942	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
279947	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
279947	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
279953	7 transmembrane receptor (rhodopsin family)
279963	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
279991	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
280026	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
280043	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
280046	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
280046	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
280047	Ribosomal protein L13e
280078	Smr domain. This family includes the Smr (Small MutS Related) proteins, and the C-terminal region of the MutS2 protein. It has been suggested that this domain interacts with the MutS1 protein in the case of Smr proteins and with the N-terminal MutS relate
280093	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
280093	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
280093	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
280095	L1 transposable element
280096	L1 transposable element
280096	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
280097	L1 transposable element
280099	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
280112	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
280118	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
280121	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
280123	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
280132	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
280132	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
280133	L1 transposable element
280133	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
280156	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
280184	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
280184	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
280188	GH3 auxin-responsive promoter
280188	Repeat in ubiquitin-activating (UBA) protein
280188	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
280205	Ribosomal protein L6
280219	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
280316	ENV polyprotein (coat polyprotein)
280412	Ribosomal protein L21e
280413	von Willebrand factor type A domain
280413	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
280415	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
280420	Ribosomal L18ae protein family
280444	ENV polyprotein (coat polyprotein)
280444	Gag P30 core shell protein. This protein is the viral core shell protein. P30 is essential for viral assembly
280444	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
280444	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
280444	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
280458	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
280458	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
280458	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin u
280487	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
280487	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
280487	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
280487	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
280487	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
280487	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
280487	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
280487	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
280522	L1 transposable element
280522	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
280523	L1 transposable element
280523	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
280553	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
280553	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
280568	L1 transposable element
280568	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
280575	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
280645	Glycosyltransferase family 43
280662	Serum albumin family
280667	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
280667	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
280668	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
280668	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
280671	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
282546	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
282547	7 transmembrane receptor (rhodopsin family)
282581	7 transmembrane receptor (Secretin family)
282581	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are
282582	wnt family
282587	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
282636	Inositol monophosphatase family
282708	Serum albumin family
282712	Sodium:neurotransmitter symporter family
282817	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
282819	Cyclophilin type peptidyl-prolyl cis-trans isomerase
282821	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
282821	Chitin synthase. Members of this family are fungal chitin synthase EC:2.4.1.16 enzymes. They catalyse chitin synthesis as follows: UDP-N-acetyl-D-glucosamine + {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N) <=> UDP + {(1,4)-(N-acetyl-beta-D-glucosaminyl)}(N+1)
282824	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
282825	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
282832	7 transmembrane receptor (rhodopsin family)
282832	7 transmembrane receptor (rhodopsin family)
282833	Zona pellucida-like domain
282834	Eukaryotic protein of unknown function, DUF279
282836	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
282836	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
282839	7 transmembrane receptor (rhodopsin family)
282892	7 transmembrane receptor (rhodopsin family)
282893	7 transmembrane receptor (rhodopsin family)
282894	7 transmembrane receptor (rhodopsin family)
282896	7 transmembrane receptor (rhodopsin family)
282897	7 transmembrane receptor (rhodopsin family)
282898	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
282898	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
282901	Ribosomal S17
282907	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
282910	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
282925	Ribosomal protein S2
282929	7 transmembrane receptor (rhodopsin family)
282930	7 transmembrane receptor (rhodopsin family)
282933	7 transmembrane receptor (rhodopsin family)
282934	7 transmembrane receptor (rhodopsin family)
282935	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
282935	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
282938	Ribosomal S17
282947	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
282959	Class II histocompatibility antigen, beta domain
282959	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
282978	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
282982	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
282985	Ssl1-like. Ssl1-like proteins are 40kDa subunits of the Transcription factor II H complex
282988	Ribosomal protein L13
282998	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
282998	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
283002	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
283002	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
283004	Prenyltransferase and squalene oxidase repeat
283011	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, whic
283011	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
283018	PH domain. PH stands for pleckstrin homology
283018	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
283018	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
283021	PH domain. PH stands for pleckstrin homology
283021	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
283021	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
283023	Prenyltransferase and squalene oxidase repeat
283037	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
283037	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
283039	Homeobox domain
283043	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
283043	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
283044	PH domain. PH stands for pleckstrin homology
283044	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
283054	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
283056	Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised
283082	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
283092	7 transmembrane receptor (rhodopsin family)
283093	7 transmembrane receptor (rhodopsin family)
283097	7 transmembrane receptor (rhodopsin family)
283100	7 transmembrane receptor (rhodopsin family)
283101	7 transmembrane receptor (rhodopsin family)
283102	Intermediate filament protein
283103	Ribosomal protein L23
283109	7 transmembrane receptor (rhodopsin family)
283110	7 transmembrane receptor (rhodopsin family)
283111	7 transmembrane receptor (rhodopsin family)
283114	S25 ribosomal protein
283115	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
283116	Zinc finger, C3HC4 type (RING finger)
283116	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
283117	Zinc finger, C3HC4 type (RING finger)
283117	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
283130	Mitochondrial carrier protein
283130	Mitochondrial carrier protein
283137	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid
283150	Fork head domain
283155	Protein kinase domain
283158	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
283159	7 transmembrane receptor (rhodopsin family)
283160	7 transmembrane receptor (rhodopsin family)
283162	7 transmembrane receptor (rhodopsin family)
283176	Glycosyl hydrolases family 35
283180	NADH dehydrogenase
283181	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
283188	7 transmembrane receptor (rhodopsin family)
283189	7 transmembrane receptor (rhodopsin family)
283193	7 transmembrane receptor (rhodopsin family)
283198	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whic
283202	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
283208	2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2
283210	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
283219	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
283222	Ribosomal protein S28e
283226	Protein prenyltransferase alpha subunit repeat
283238	Sugar (and other) transporter
283240	Sugar (and other) transporter
283247	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous
283248	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. E
283256	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
283257	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
283258	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
283259	Zinc finger, C3HC4 type (RING finger)
283259	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
283264	P21-ARC (ARP2/3 complex 21 kDa subunit). The seven component ARP2/3 actin-organizing complex is involved in actin assembly and function
283265	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anch
283283	Mitochondrial carrier protein
283286	Formate--tetrahydrofolate ligase
283287	7 transmembrane receptor (rhodopsin family)
283288	7 transmembrane receptor (rhodopsin family)
283297	7 transmembrane receptor (rhodopsin family)
283298	Olfactomedin-like domain
283312	Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tran
283320	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
283321	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
283326	Lectin C-type domain. This family includes both long and short form C-type
283340	Ribosomal protein L13
283341	Adenylate kinase
283342	Core histone H2A/H2B/H3/H4
283343	Arginosuccinate synthase. This family contains a PP-loop motif
283345	Ribosomal protein L13e
283349	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
283349	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
283358	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
283365	7 transmembrane receptor (rhodopsin family)
283366	7 transmembrane receptor (rhodopsin family)
283367	7 transmembrane receptor (rhodopsin family)
283368	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
283375	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the
283377	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
283383	7 transmembrane receptor (Secretin family)
283398	ATP-grasp domain. This family does not contain all known ATP-grasp domain members. This family includes a diverse set of enzymes that possess ATP-dependent carboxylate-amine ligase activity
283398	CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP
283415	Uncharacterized protein PaaI, COG2050
283415	Fungal specific domain of unknown function (DUF391). A family of uncharacterised fungal proteins
283415	Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-Co
283418	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
283419	Lectin C-type domain. This family includes both long and short form C-type
283420	Lectin C-type domain. This family includes both long and short form C-type
283421	Ribosomal protein L13
283425	Skp1 family, dimerisation domain
283425	Skp1 family, tetramerisation domain
283431	Growth-Arrest-Specific Protein 2 Domain
283455	Protein kinase domain
283458	Nucleoside diphosphate kinase
283459	Glu-tRNAGln amidotransferase C subunit. This is a family of Glu-tRNAGln amidotransferase C subunits. The Glu-tRNA Gln amidotransferase enzyme itself is an important translational fidelity mechanism replacing incorrectly charged Glu-tRNAGln with the correc
283464	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt
283465	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
283465	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
283466	Ribosomal protein S27
283471	Trypsin
283471	Immunoglobulin A1 protease. This family consists of immunoglobulin A1 protease proteins. The immunoglobulin A1 protease cleaves immunoglobulin IgA and is found in pathogenic bacteria such as Neisseria gonorrhoeae. Not all of the members of this family are
283479	Ribosomal protein S26e
283490	Ribosomal protein L13
283493	7 transmembrane receptor (rhodopsin family)
283494	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
283509	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
283510	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
283510	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
283514	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results
283518	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
283523	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
283529	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
283535	7 transmembrane receptor (rhodopsin family)
283539	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
283548	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment f
283549	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
283558	Ribosomal protein L31e
283600	Mitochondrial carrier protein
283615	7 transmembrane receptor (rhodopsin family)
283617	7 transmembrane receptor (rhodopsin family)
283618	7 transmembrane receptor (rhodopsin family)
283620	7 transmembrane receptor (rhodopsin family)
283621	7 transmembrane receptor (rhodopsin family)
283623	7 transmembrane receptor (rhodopsin family)
283632	7 transmembrane receptor (rhodopsin family)
283634	Ribosomal protein L23
283652	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cells
283660	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
283675	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
283693	Actin
283694	7 transmembrane receptor (rhodopsin family)
283695	Ribosomal protein S8e
283698	Synaptogyrin. This family of proteins is distantly related to pfam01284
283704	Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi
283706	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
283716	Cyclic nucleotide-binding domain
283716	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
283719	Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi
283727	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
283741	Homeobox domain
283747	Skp1 family, tetramerisation domain
283748	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lyso
283748	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop
283754	Protein kinase domain
283756	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
283801	BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1
283804	Disintegrin
283806	Protein kinase domain
283819	Ribosomal L15
283848	Carboxylesterase
283853	Uracil DNA glycosylase superfamily
283855	Tropomyosin
283870	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
283870	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
283871	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
283878	Acyl CoA binding protein
283903	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
283912	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
283922	Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in
283925	SCP-like extracellular protein. This domain is also found in prokaryotes
283926	Chaperonin 10 Kd subunit
283927	NUDIX domain
283940	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
283955	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
283955	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
283957	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
283964	BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1
283970	Pyridoxal-dependent decarboxylase conserved domain
283985	Fatty acid desaturase
283985	Fatty acid desaturase
284013	Vitelline membrane outer layer protein I (VOMI). VOMI binds tightly to ovomucin fibrils of the egg yolk membrane. The structure that consists of three beta-sheets forming Greek key motifs, which are related by an internal pseudo three-fold symmetry. Furt
284013	Vitelline membrane outer layer protein I (VOMI). VOMI binds tightly to ovomucin fibrils of the egg yolk membrane. The structure that consists of three beta-sheets forming Greek key motifs, which are related by an internal pseudo three-fold symmetry. Furth
284015	ATP synthase subunit C
284024	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
284037	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
284054	Phosphotyrosine interaction domain (PTB/PID)
284057	Phosphotyrosine interaction domain (PTB/PID)
284064	Ribosomal L29e protein family
284076	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
284078	Eukaryotic initiation factor 4E
284085	Intermediate filament protein
284086	Protein kinase domain
284087	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
284089	Intermediate filament protein
284100	14-3-3 protein
284111	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
284121	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
284125	ATP synthase A chain
284125	Cytochrome b(N-terminal)/b6/petB
284125	Cytochrome C oxidase subunit II, periplasmic domain
284144	Ribosomal protein L13e
284161	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has b
284161	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be
284162	Yippee putative zinc-binding protein
284164	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
284165	Clathrin adaptor complex small chain
284165	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
284166	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
284167	DNA-dependent RNA polymerase. This is a family of single chain RNA polymerases
284195	Intermediate filament protein
284196	Intermediate filament protein
284200	Protein kinase domain
284201	Zinc finger, C3HC4 type (RING finger)
284202	Fatty acid desaturase
284203	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
284208	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
284210	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
284210	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
284212	BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems
284217	Laminin G domain
284217	Laminin B (Domain IV)
284217	Laminin N-terminal (Domain VI)
284217	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
284218	Mitochondrial carrier protein
284228	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
284229	Cyclophilin type peptidyl-prolyl cis-trans isomerase
284230	Ribosomal protein L44
284235	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
284249	L1 transposable element
284250	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
284250	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
284252	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
284265	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
284268	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling.
284270	Microtubule associated protein (MAP65/ASE1 family)
284287	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
284288	Ribosomal protein L24e
284293	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
284295	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
284299	Protein kinase domain
284301	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
284302	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
284302	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
284307	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
284312	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
284315	Ribosomal protein L23
284318	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
284319	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl
284320	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
284323	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
284335	PH domain. PH stands for pleckstrin homology
284341	A20-like zinc finger. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappa B activation
284345	PH domain. PH stands for pleckstrin homology
284347	Phospholipase A2 inhibitor
284355	Homeobox domain
284357	Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes
284358	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
284358	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
284363	Dihydrodipicolinate reductase
284363	Domain of unknown function DUF108. This family has no known function. It is found to compose the complete protein in archaebacteria and a single domain in a large C. elegans protein
284383	7 transmembrane receptor (rhodopsin family)
284387	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
284390	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
284391	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
284393	Ribosomal L10
284393	Ribosomal protein L16
284427	Mitochondrial carrier protein
284431	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
284432	Ribosomal protein L23
284434	WD domain, G-beta repeat
284436	AMP-binding enzyme
284438	Citrate synthase
284439	Mitochondrial carrier protein
284457	Ribosomal protein S27
284458	Ribosomal protein S2
284459	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
284479	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
284486	Fungal specific domain of unknown function (DUF391). A family of uncharacterised fungal proteins
284486	Fungal specific domain of unknown function (DUF391). A family of uncharacterised fungal proteins
284500	Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwee
284509	Helix-loop-helix DNA-binding domain
284515	Adenylate kinase
284521	7 transmembrane receptor (rhodopsin family)
284522	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
284525	Cyclic nucleotide-binding domain
284525	Cyclic nucleotide-binding domain
284525	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
284532	7 transmembrane receptor (rhodopsin family)
284533	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
284535	7 transmembrane receptor (rhodopsin family)
284541	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
284545	HMG (high mobility group) box
284553	Triosephosphate isomerase
284556	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
284556	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
284556	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
284563	Cyclophilin type peptidyl-prolyl cis-trans isomerase
284597	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
284613	Cytochrome b561
284614	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
284615	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
284642	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
284651	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
284652	CUB domain
284652	Sushi domain (SCR repeat)
284656	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
284659	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
284659	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
284674	Occludin/ELL family
284675	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
284676	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
284676	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
284679	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
284683	HMG (high mobility group) box
284684	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
284685	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
284689	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
284689	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
284690	Ubiquitin carboxyl-terminal hydrolase, family 1
284693	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
284696	XPG N-terminal domain
284697	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
284710	Ribosomal L29 protein
284711	Ribosomal protein S7e
284725	Ubiquinol-cytochrome C reductase hinge protein. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 'hinge' protein of the complex which is thought to mediate formatio
284759	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
284766	Ribosomal family S4e
284766	S4 domain. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yea
284767	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
284774	Ribosomal protein S17
284779	Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration
284810	Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids
284813	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling.
284819	Mitochondrial glycoprotein. This mitochondrial matrix protein family contains members of the MAM33 family which bind to the globular 'heads' of C1Q. It is thought to be involved in mitochondrial oxidative phosphorylation and in nucleus-mitochondrion inter
284820	Polyprenyl synthetase
284821	Ribosomal protein L13
284821	ssDNA binding protein. This protein is found in herpesviruses and is needed for replication
284832	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
284833	Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown
284843	Ribosomal protein L23
284849	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
284866	Proline dehydrogenase
284890	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
284897	Ribosomal L15
284904	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
284904	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
284905	Ribosomal S17
284918	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
284918	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
284922	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
284942	Ribosomal protein L23
284948	SH2 domain
284964	pfam02878, PGM_PMM_I, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I
284967	Trypsin
284988	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
284996	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
285003	Ribosomal protein L36
285004	Ribosomal L39 protein
285005	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
285010	Dihydrofolate reductase
285011	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
285031	HMG (high mobility group) box
285035	Gamma-glutamyltranspeptidase
285044	Adenylate kinase
285046	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
285053	Ribosomal L18ae protein family
285055	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
285062	SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily
285089	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
285099	HesB-like domain. This family includes HesB which may be involved in nitrogen fixation
285107	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
285109	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
285112	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
285117	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Membe
285133	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
285148	Lipase/Acylhydrolase with GDSL-like motif
285164	Cytochrome c oxidase subunit III
285176	Ribosomal L10
285189	PAN domain. The PAN domain contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge the links the N and C termini of the domain. The domain is found in diverse proteins, in some t
285190	TPR Domain
285190	RanBP1 domain
285190	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
285192	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
285193	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
285195	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
285207	Ribosomal protein L36e
285208	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
285208	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
285214	Ribosomal protein L23
285216	pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain
285218	Ribosomal L18ae protein family
285227	HMG (high mobility group) box
285233	Intermediate filament protein
285234	Intermediate filament protein
285241	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
285242	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
285242	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
285242	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
285242	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
285244	Hsp90 protein
285245	Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars
285246	Ribosomal protein S3, C-terminal domain. This family contains a central domain pfam00013, hence the amino and carboxyl terminal domains are stored separately. This is a minimal carboxyl-terminal domain. Some are much longer
285260	Ribosomal protein L31e
285268	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
285269	Ribosomal protein S27
285280	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
285289	Hsp90 protein
285289	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
285292	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
285301	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
285313	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
285320	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
285321	Tropomyosin
285329	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
285329	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
285335	Cyclic nucleotide-binding domain
285335	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
285335	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
285350	Ribosomal protein S24e
285358	Homeobox domain
285358	PBX domain. The PBX domain is a bipartite acidic domain
285361	Ribosomal protein S2
285362	Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown
285362	Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown
285373	Protein kinase domain
285384	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
285407	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid
285410	Ribosomal protein L21e
285412	HMG (high mobility group) box
285426	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
285433	PMP-22/EMP/MP20/Claudin family
285434	Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway
285440	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
285442	PCRF domain. This domain is found in peptide chain release factors
285442	Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons
285449	Ribosomal protein S12
285453	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
285458	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
285466	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
285484	HMG (high mobility group) box
285507	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
285519	UDP-glucoronosyl and UDP-glucosyl transferase
285519	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
285536	Ribosomal protein L21e
285537	Ribosomal protein S27
285544	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid
285554	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
285554	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
285554	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
285562	Core histone H2A/H2B/H3/H4
285590	PX domain. PX domains bind to phosphoinositides
285590	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
285598	ADP-ribosylation factor family
285605	DTW domain. This presumed domain is found in bacterial and eukaryotic proteins. Its function is unknown. The domain contains multiple conserved motifs including a DTXW motif that this domain has been named after
285615	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
285625	S25 ribosomal protein
285641	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
285641	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
285643	Kinesin motor domain
285643	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
285655	Ribosomal L10
285658	Ribosomal protein L19e
285659	7 transmembrane receptor (rhodopsin family)
285669	HesB-like domain. This family includes HesB which may be involved in nitrogen fixation
285671	Zinc finger, C3HC4 type (RING finger)
285676	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
285702	Actin
285703	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
285720	Ribosomal S3Ae family
285728	HMG (high mobility group) box
285732	14-3-3 protein
285737	7 transmembrane receptor (rhodopsin family)
285741	Translationally controlled tumor protein
285748	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
285752	Ribosomal protein L3
285755	Cyclophilin type peptidyl-prolyl cis-trans isomerase
285761	LCCL domain
285761	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
285770	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
285783	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
285785	Ribosomal L39 protein
285787	Dopey, N-terminal domain. DopA is the founding member of the Dopey family and is required for correct cell morphology and spatiotemporal organization of multicellular structures in the filamentous fungus Aspergillus nidulans. DopA homologues are found in
285788	Protein kinase domain
285802	Zinc-binding dehydrogenase
285809	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
285829	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
285840	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
285845	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
285846	Ribosomal protein S8
285848	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity o
285849	Cytochrome c oxidase subunit VIa
285850	Ribosomal protein L14p/L23e
285855	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
285865	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
285895	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
285897	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
285900	Ribosomal protein L6, N-terminal domain
285910	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
285911	Ribosomal protein L31e
285918	Kinesin motor domain
285926	DNA mismatch repair protein, C-terminal domain. This family represents the C-terminal domain of the mutL/hexB/PMS1 family. This domain has a ribosomal S5 domain 2-like fold
285929	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
285932	Ribosomal protein L44
285934	Helix-loop-helix DNA-binding domain
285938	Ribosomal protein S26e
285940	Protein kinase domain
285967	Ribosomal protein L44
285969	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
285984	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
285985	Ribosomal protein S27
286000	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
286016	Triosephosphate isomerase
286017	Protein kinase domain
286020	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
286061	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
286066	Asparagine synthase. This family is always found associated with pfam00310. Members of this family catalyse the conversion of aspartate to asparagine
286088	sic protein. Serotype M1 group A Streptococcus strains cause epidemic waves of human infections. This family includes the sic protein an extracellular protein (streptococcal inhibitor of complement) that inhibits human complement
286091	Ribosomal protein S26e
286102	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
286106	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
286136	Protein kinase domain
286145	Ribosomal family S4e
286150	Ribosomal protein S5, C-terminal domain
286150	Ribosomal protein S5, N-terminal domain
286151	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
286153	Ribosomal protein S26e
286157	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
286172	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
286181	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
286192	Hsp90 protein
286220	Helix-loop-helix DNA-binding domain
286222	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
286222	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
286228	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
286239	Malic enzyme, N-terminal domain
286240	Intermediate filament protein
286246	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
286246	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
286255	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
286256	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
286256	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
286310	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
286346	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
286346	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
286348	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
286360	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
286362	7 transmembrane receptor (rhodopsin family)
286364	7 transmembrane receptor (rhodopsin family)
286365	7 transmembrane receptor (rhodopsin family)
286380	Fork head domain
286397	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
286398	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
286400	Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA
286400	Proliferating cell nuclear antigen, N-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA
286402	Putative undecaprenyl diphosphate synthase. Previously known as uncharacterized protein family UPF0015, a single member of this family has been identified as an undecaprenyl diphosphate synthase
286409	Eukaryotic porin
286420	ADP-ribosylation factor family
286423	Ribosome recycling factor. The ribosome recycling factor (RRF / ribosome release factor) dissociates the ribosome from the mRNA after termination of translation, and is essential bacterial growth. Thus ribosomes are "recycled" and ready for another round
286425	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF00349. Some members of the family have two copies of each of these domains
286425	Hexokinase. Hexokinase contains two structurally similar domains represented by this family and PFAM:PF03727. Some members of the family have two copies of each of these domains
286426	Eukaryotic porin
286428	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
286429	Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly in
286436	Core histone H2A/H2B/H3/H4
286436	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
286444	Ribosomal protein S5, C-terminal domain
286444	Ribosomal protein S5, N-terminal domain
286454	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
286462	Cyclin-dependent kinase regulatory subunit
286468	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Membe
286469	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
286472	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
286474	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
286474	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
286479	Glutathione peroxidase
286480	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
286482	Hsp90 protein
286486	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
286487	14-3-3 protein
286488	Protein kinase domain
286489	AIR synthase related protein, N-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The N-terminal domain of AIR synthase forms the d
286494	Protein kinase domain
286500	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
286502	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
286503	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
286512	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
286513	Ribosomal protein S26e
286514	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of
286516	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
286517	FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain
286517	FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain
286526	ADP-ribosylation factor family
286526	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
286530	7 transmembrane receptor (rhodopsin family)
286530	7 transmembrane receptor (rhodopsin family)
286532	MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases
286535	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
286539	Ribosomal protein S26e
286544	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
286546	Sulfatase
286555	WD domain, G-beta repeat
286590	Ribosomal protein L23
286653	F-box domain
286662	Eukaryotic initiation factor 4E
286722	Protein kinase domain
286749	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
286749	Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are rep
286758	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
286759	Casein
286887	Intermediate filament protein
286888	WAP-type (Whey Acidic Protein) 'four-disulfide core'
286889	Somatotropin hormone family
286890	Tropomyosin
286892	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
286892	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
286893	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
286896	Aminotransferase class-V
286896	Aminotransferase class I and II
286896	Pyridoxal-dependent decarboxylase conserved domain
286898	ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen r
286898	ML domain. ML domain - MD-2-related lipid recognition domain. This family consists of proteins from plants, animals and fungi, including dust mite allergen Der P 2. It has been implicate in lipid recognition, particularly in the recognition of pathogen re
286901	Giardia variant-specific surface protein
286901	Giardia variant-specific surface protein
286901	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl
286904	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
286904	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
286905	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
286905	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
286908	LIM domain. This family represents two copies of the LIM structural domain
286908	LIM domain. This family represents two copies of the LIM structural domain
286908	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
286911	Trypsin
286912	Intermediate filament protein
286913	7 transmembrane receptor (rhodopsin family)
286913	7 transmembrane receptor (rhodopsin family)
286914	7 transmembrane receptor (metabotropic glutamate family)
286914	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
286914	7 transmembrane receptor (metabotropic glutamate family)
286914	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
286915	7 transmembrane receptor (metabotropic glutamate family)
286915	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
286916	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
286918	Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169
286920	Amiloride-sensitive sodium channel
286920	Amiloride-sensitive sodium channel
286921	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
286921	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
286922	Fatty acid desaturase
286922	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
286922	Fatty acid desaturase
286922	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
286923	Guanylate-kinase-associated protein (GKAP) protein
286923	Guanylate-kinase-associated protein (GKAP) protein
286924	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
286924	ADP-ribosylation factor family
286924	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
286925	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
286926	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
286927	Protein kinase domain
286927	Protein kinase domain
286929	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
286930	Guanylate-kinase-associated protein (GKAP) protein
286930	Guanylate-kinase-associated protein (GKAP) protein
286931	C2 domain
286931	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
286933	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
286933	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
286936	Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase. Glycerol-3-phosphate cytidylyltransferase
286938	GTPase of unknown function
286940	Filamin/ABP280 repeat
286953	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
286954	UDP-glucoronosyl and UDP-glucosyl transferase
286954	UDP-glucoronosyl and UDP-glucosyl transferase
286954	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
286955	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
286956	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
286956	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
286957	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
286957	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
286958	Mammalian defensin
286958	Defensin propeptide
286958	Mammalian defensin
286958	Defensin propeptide
286959	7 transmembrane receptor (rhodopsin family)
286959	7 transmembrane receptor (rhodopsin family)
286960	Trypsin
286960	Trypsin
286962	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
286963	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
286963	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
286964	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
286964	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
286966	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
286974	Sulfotransferase protein
286976	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryo
286981	7 transmembrane receptor (metabotropic glutamate family)
286981	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
286981	7 transmembrane receptor (metabotropic glutamate family)
286981	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
286982	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
286982	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
286983	7 transmembrane receptor (metabotropic glutamate family)
286983	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
286983	7 transmembrane receptor (metabotropic glutamate family)
286983	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
286984	7 transmembrane receptor (metabotropic glutamate family)
286984	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
286984	7 transmembrane receptor (metabotropic glutamate family)
286984	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
286985	7 transmembrane receptor (metabotropic glutamate family)
286985	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
286986	7 transmembrane receptor (metabotropic glutamate family)
286986	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
286986	7 transmembrane receptor (metabotropic glutamate family)
286986	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
286989	UDP-glucoronosyl and UDP-glucosyl transferase
286990	HMG (high mobility group) box
286990	HMG (high mobility group) box
286991	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated
286994	7 transmembrane receptor (rhodopsin family)
286995	Defensin propeptide
286995	Mammalian defensin
286995	Defensin propeptide
286996	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
286997	Protein kinase domain
286998	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
286998	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
287000	7 transmembrane receptor (rhodopsin family)
287000	7 transmembrane receptor (rhodopsin family)
287001	7 transmembrane receptor (rhodopsin family)
287001	7 transmembrane receptor (rhodopsin family)
287002	7 transmembrane receptor (rhodopsin family)
287002	7 transmembrane receptor (rhodopsin family)
287006	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
287006	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
287006	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
287006	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
287029	Ribosomal protein L31e
287031	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
287042	ParA family ATPase
287042	4Fe-4S iron sulfur cluster binding proteins, NifH/frxC family
287046	14-3-3 protein
287049	PMP-22/EMP/MP20/Claudin family
287051	Ribosomal protein L6
287055	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
287056	ribosomal L5P family C-terminus. This region is found associated with pfam00281
287060	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
287060	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
287065	Uncharacterised protein family (UPF0108)
287068	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
287069	Hsp90 protein
287070	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
287074	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
287075	Thymosin beta-4 family
287077	7 transmembrane receptor (rhodopsin family)
287079	7 transmembrane receptor (rhodopsin family)
287080	7 transmembrane receptor (rhodopsin family)
287081	7 transmembrane receptor (rhodopsin family)
287082	7 transmembrane receptor (rhodopsin family)
287083	7 transmembrane receptor (rhodopsin family)
287084	7 transmembrane receptor (rhodopsin family)
287085	7 transmembrane receptor (rhodopsin family)
287086	7 transmembrane receptor (rhodopsin family)
287087	Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However
287088	7 transmembrane receptor (rhodopsin family)
287089	7 transmembrane receptor (rhodopsin family)
287090	7 transmembrane receptor (rhodopsin family)
287090	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
287091	7 transmembrane receptor (rhodopsin family)
287092	Ribosomal family S4e
287093	7 transmembrane receptor (rhodopsin family)
287095	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
287095	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
287096	Ribosomal protein L44
287098	PMP-22/EMP/MP20/Claudin family
287099	PMP-22/EMP/MP20/Claudin family
287102	CUB domain
287102	WSC domain. This domain may be involved in carbohydrate binding
287102	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
287104	Trypsin
287105	S25 ribosomal protein
287106	Trypsin
287107	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
287108	Trypsin
287109	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
287110	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
287114	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
287115	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
287120	WD domain, G-beta repeat
287123	SelR domain. Methionine sulfoxide reduction is an important process, by which cells regulate biological processes and cope with oxidative stress. MsrA, a protein involved in the reduction of methionine sulfoxides in proteins, has been known for four decad
287125	ParA family ATPase
287125	4Fe-4S iron sulfur cluster binding proteins, NifH/frxC family
287125	ArgK protein. The ArgK protein acts as an ATPase enzyme and as a kinase, and phosphorylates periplasmic binding proteins involved in the LAO (lysine, arginine, ornithine)/AO transport systems
287125	Anion-transporting ATPase. This Pfam family represents a conserved domain, which is sometimes repeated, in an anion-transporting ATPase. The ATPase is involved in the removal of arsenate, antimonite, and arsenate from the cell
287130	Pentaxin family. Pentaxins are also known as pentraxins
287136	Trypsin
287137	Trypsin
287138	Trypsin
287140	Trypsin
287141	Trypsin
287142	Trypsin
287143	Trypsin
287145	RNA pseudouridylate synthase. Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes RluD, a pseudouridylate synthase that converts specific uracils to
287148	RNA pseudouridylate synthase. Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes RluD, a pseudouridylate synthase that converts specific uracils to
287155	TPR Domain
287160	Calpain family cysteine protease
287161	Ribosomal protein L34e
287162	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
287164	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
287165	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
287167	Globin
287168	Globin
287169	Uncharacterised protein family (UPF0171)
287172	Ribosomal protein L35Ae
287174	Ribosomal Proteins L2, C-terminal domain
287178	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
287180	Protein kinase domain
287182	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
287185	Fork head domain
287188	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
287189	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
287191	tRNA synthetases class I (R). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only arginyl tRNA synthetase
287195	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
287199	Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration
287200	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
287203	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
287203	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
287204	S25 ribosomal protein
287206	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
287207	Ribosomal protein S5, C-terminal domain
287207	Ribosomal protein S5, N-terminal domain
287208	TPR Domain
287209	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
287215	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
287223	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
287227	Mitosis protein DIM1
287227	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
287228	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
287229	CUB domain
287230	Thymosin beta-4 family
287232	7 transmembrane receptor (rhodopsin family)
287234	7 transmembrane receptor (rhodopsin family)
287235	Ribosomal protein L31e
287236	7 transmembrane receptor (rhodopsin family)
287237	7 transmembrane receptor (rhodopsin family)
287239	7 transmembrane receptor (rhodopsin family)
287240	7 transmembrane receptor (rhodopsin family)
287241	7 transmembrane receptor (rhodopsin family)
287242	7 transmembrane receptor (rhodopsin family)
287243	7 transmembrane receptor (rhodopsin family)
287244	7 transmembrane receptor (rhodopsin family)
287245	7 transmembrane receptor (rhodopsin family)
287245	1,3-beta-glucan synthase component. This family consists of various 1,3-beta-glucan synthase components including Gls1, Gls2 and Gls3 from yeast. 1,3-beta-glucan synthase EC:2.4.1.34 also known as callose synthase catalyses the formation of a beta-1,3-glu
287246	7 transmembrane receptor (rhodopsin family)
287247	7 transmembrane receptor (rhodopsin family)
287248	Ribosomal protein S28e
287249	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
287250	Ribosomal L29e protein family
287251	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
287252	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
287254	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
287255	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
287258	7 transmembrane receptor (rhodopsin family)
287259	7 transmembrane receptor (rhodopsin family)
287260	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
287260	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
287261	7 transmembrane receptor (rhodopsin family)
287262	7 transmembrane receptor (rhodopsin family)
287263	7 transmembrane receptor (rhodopsin family)
287264	7 transmembrane receptor (rhodopsin family)
287265	7 transmembrane receptor (rhodopsin family)
287266	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
287268	Protein kinase domain
287269	Protein kinase domain
287270	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
287273	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
287274	Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses
287276	ADP-ribosylation factor family
287276	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
287276	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
287277	Proteasome A-type and B-type
287280	Skp1 family, dimerisation domain
287280	Skp1 family, tetramerisation domain
287281	Eukaryotic porin
287282	Ribosomal protein S5, N-terminal domain
287283	Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14
287284	Ribosomal L10
287287	Interleukin 4
287291	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
287293	HMG (high mobility group) box
287300	BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems
287302	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
287305	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
287307	'Cold-shock' DNA-binding domain
287309	7 transmembrane receptor (rhodopsin family)
287310	7 transmembrane receptor (rhodopsin family)
287311	7 transmembrane receptor (rhodopsin family)
287312	7 transmembrane receptor (rhodopsin family)
287313	7 transmembrane receptor (rhodopsin family)
287314	7 transmembrane receptor (rhodopsin family)
287315	7 transmembrane receptor (rhodopsin family)
287316	7 transmembrane receptor (rhodopsin family)
287317	7 transmembrane receptor (rhodopsin family)
287318	7 transmembrane receptor (rhodopsin family)
287319	7 transmembrane receptor (rhodopsin family)
287320	7 transmembrane receptor (rhodopsin family)
287321	7 transmembrane receptor (rhodopsin family)
287322	7 transmembrane receptor (rhodopsin family)
287323	7 transmembrane receptor (rhodopsin family)
287324	7 transmembrane receptor (rhodopsin family)
287325	7 transmembrane receptor (rhodopsin family)
287326	7 transmembrane receptor (rhodopsin family)
287328	7 transmembrane receptor (rhodopsin family)
287329	7 transmembrane receptor (rhodopsin family)
287330	7 transmembrane receptor (rhodopsin family)
287331	7 transmembrane receptor (rhodopsin family)
287333	7 transmembrane receptor (rhodopsin family)
287334	7 transmembrane receptor (rhodopsin family)
287335	7 transmembrane receptor (rhodopsin family)
287336	7 transmembrane receptor (rhodopsin family)
287337	7 transmembrane receptor (rhodopsin family)
287338	Enolase, N-terminal domain
287338	Enolase, C-terminal TIM barrel domain
287339	7 transmembrane receptor (rhodopsin family)
287340	7 transmembrane receptor (rhodopsin family)
287341	7 transmembrane receptor (rhodopsin family)
287342	7 transmembrane receptor (rhodopsin family)
287343	7 transmembrane receptor (rhodopsin family)
287344	7 transmembrane receptor (rhodopsin family)
287345	7 transmembrane receptor (rhodopsin family)
287347	7 transmembrane receptor (rhodopsin family)
287348	7 transmembrane receptor (rhodopsin family)
287349	7 transmembrane receptor (rhodopsin family)
287352	Core histone H2A/H2B/H3/H4
287353	Protein kinase domain
287354	Guanylate kinase
287354	Rad17 cell cycle checkpoint protein
287354	RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses
287354	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
287357	wnt family
287358	Trypsin
287359	jmjC domain
287360	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
287361	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
287361	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
287361	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
287362	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
287363	7 transmembrane receptor (rhodopsin family)
287364	Uncharacterized protein family UPF0023
287365	7 transmembrane receptor (rhodopsin family)
287367	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
287373	GTP1/OBG family
287373	TGS domain. The TGS domain is named after ThrRS, GTPase, and SpoT. Interestingly, TGS domain was detected also at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochae
287379	Serine hydroxymethyltransferase
287380	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
287382	Fibrinogen beta and gamma chains, C-terminal globular domain
287384	SH2 domain
287384	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
287385	Sodium:solute symporter family
287392	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
287393	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
287394	Sulfotransferase protein
287395	Sulfotransferase protein
287398	Protein kinase domain
287399	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
287400	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
287400	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
287401	Ribosomal L15
287402	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
287403	HIT family
287408	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
287409	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
287411	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
287416	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
287418	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
287423	Helix-loop-helix DNA-binding domain
287424	Lipoxygenase
287425	Lipoxygenase
287425	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
287426	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
287427	Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses
287429	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
287435	Lysosome-associated membrane glycoprotein (Lamp)
287436	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
287436	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
287437	TNF(Tumor Necrosis Factor) family
287438	Ribosomal protein L13e
287441	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
287442	Protein of unknown function (DUF423). Potential integral membrane protein
287444	Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold
287447	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
287448	SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop
287449	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
287454	Lipoxygenase
287459	Ribosomal protein L21e
287465	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
287466	Sulfotransferase protein
287466	WSC domain. This domain may be involved in carbohydrate binding
287467	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera
287468	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
287469	Fork head domain
287470	MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function
287473	Protein kinase domain
287478	PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. Howe
287479	FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain
287480	7 transmembrane receptor (rhodopsin family)
287481	7 transmembrane receptor (rhodopsin family)
287482	7 transmembrane receptor (rhodopsin family)
287483	7 transmembrane receptor (rhodopsin family)
287484	7 transmembrane receptor (rhodopsin family)
287486	7 transmembrane receptor (rhodopsin family)
287487	7 transmembrane receptor (rhodopsin family)
287488	7 transmembrane receptor (rhodopsin family)
287489	7 transmembrane receptor (rhodopsin family)
287490	7 transmembrane receptor (rhodopsin family)
287491	7 transmembrane receptor (rhodopsin family)
287492	7 transmembrane receptor (rhodopsin family)
287493	7 transmembrane receptor (rhodopsin family)
287494	7 transmembrane receptor (rhodopsin family)
287495	7 transmembrane receptor (rhodopsin family)
287496	7 transmembrane receptor (rhodopsin family)
287497	7 transmembrane receptor (rhodopsin family)
287498	7 transmembrane receptor (rhodopsin family)
287500	7 transmembrane receptor (rhodopsin family)
287501	7 transmembrane receptor (rhodopsin family)
287502	7 transmembrane receptor (rhodopsin family)
287503	7 transmembrane receptor (rhodopsin family)
287504	7 transmembrane receptor (rhodopsin family)
287505	7 transmembrane receptor (rhodopsin family)
287506	7 transmembrane receptor (rhodopsin family)
287507	7 transmembrane receptor (rhodopsin family)
287508	7 transmembrane receptor (rhodopsin family)
287510	7 transmembrane receptor (rhodopsin family)
287511	7 transmembrane receptor (rhodopsin family)
287512	Ribosomal protein S6e
287513	7 transmembrane receptor (rhodopsin family)
287514	7 transmembrane receptor (rhodopsin family)
287515	7 transmembrane receptor (rhodopsin family)
287516	7 transmembrane receptor (rhodopsin family)
287517	7 transmembrane receptor (rhodopsin family)
287519	7 transmembrane receptor (rhodopsin family)
287520	7 transmembrane receptor (rhodopsin family)
287521	Helix-loop-helix DNA-binding domain
287522	GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins
287523	Domain of unknown function (DUF341)
287525	MYND finger
287526	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
287527	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
287530	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
287533	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
287537	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
287539	Ribosomal protein S2
287543	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
287546	Flavodoxin
287546	pfam02898, NO_synthase, Nitric oxide synthase, oxygenase domain
287546	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
287546	FAD binding domain. This domain is found in sulfite reductase, NADPH cytochrome P450 reductase, Nitric oxide synthase and methionine synthase reductase
287547	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
287551	Ribosomal protein L6
287552	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
287553	Ribosomal protein S19
287556	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa
287557	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes
287558	Ribosomal protein L35Ae
287559	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
287561	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
287562	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
287563	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
287565	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
287566	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
287567	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
287570	Growth-Arrest-Specific Protein 2 Domain
287570	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
287574	WAP-type (Whey Acidic Protein) 'four-disulfide core'
287575	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
287576	Ribosomal protein S8
287582	Homeobox domain
287583	Homeobox domain
287593	Domain of unknown function UPF0099. This domain has no known function
287601	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
287604	Ribosomal protein L21e
287606	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
287607	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
287609	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
287609	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
287609	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
287609	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
287610	Animal haem peroxidase
287611	Ribosomal protein S6e
287613	7 transmembrane receptor (rhodopsin family)
287614	7 transmembrane receptor (rhodopsin family)
287618	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
287619	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
287620	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
287622	VHS domain. Domain present in VPS-27, Hrs and STAM
287622	GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilizes membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins
287623	Ribosomal L29 protein
287624	Kinesin motor domain
287625	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
287625	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
287627	Eukaryotic-type carbonic anhydrase
287631	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
287631	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
287635	Ribosomal L27 protein
287636	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
287636	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
287638	Homeobox domain
287639	Ribosomal L28e protein family
287643	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
287644	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
287645	EAP30. EAP30 is a subunit of the ELL complex. The ELL is an 80-kDa RNA polymerase II transcription factor. ELL interacts with three other proteins to form the complex known as ELL complex. The ELL complex is capable of increasing that catalytic rate of tr
287646	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
287647	Homeobox domain
287648	PH domain. PH stands for pleckstrin homology
287650	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
287651	Ribosomal protein S26e
287658	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
287659	Hyaluronidase
287667	ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER
287670	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
287673	7 transmembrane receptor (rhodopsin family)
287675	Intermediate filament protein
287677	Intermediate filament protein
287678	Intermediate filament protein
287679	Intermediate filament protein
287685	Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibers in association with hair keratin intermediate filaments. This family has been divided up int
287686	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
287691	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
287697	Intermediate filament protein
287697	Male specific sperm protein. This family of drosophila proteins are typified by the repetitive motif C-G-P
287697	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
287698	Intermediate filament protein
287699	Intermediate filament protein
287700	Intermediate filament protein
287701	Intermediate filament protein
287702	Intermediate filament protein
287703	Translation initiation factor SUI1
287704	FKBP-type peptidyl-prolyl cis-trans isomerase
287706	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
287707	ADP-ribosylation factor family
287707	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
287708	ADP-ribosylation factor family
287708	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
287709	ADP-ribosylation factor family
287709	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
287711	Dephospho-CoA kinase. This family catalyzes the phosphorylation of the 3'-hydroxyl group of dephosphocoenzyme A to form Coenzyme A EC:2.7.1.24. This enzyme uses ATP in its reaction
287712	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
287716	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi
287716	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
287717	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
287720	Zinc-binding dehydrogenase
287721	Zinc-binding dehydrogenase
287723	Apoptosis regulator proteins, Bcl-2 family
287726	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
287729	Protein-tyrosine phosphatase
287729	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
287730	Pancreatic hormone peptide
287732	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
287735	Mitochondrial carrier protein
287737	Ribosomal protein L36e
287739	Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold
287740	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
287742	Dephospho-CoA kinase. This family catalyzes the phosphorylation of the 3'-hydroxyl group of dephosphocoenzyme A to form Coenzyme A EC:2.7.1.24. This enzyme uses ATP in its reaction
287745	C2 domain
287745	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
287745	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
287746	Formin Homology 2 Domain
287748	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
287748	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
287751	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes
287753	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
287753	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
287758	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
287760	Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid
287763	Protein kinase domain
287764	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
287765	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
287765	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
287769	Actin
287770	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
287772	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
287774	Sushi domain (SCR repeat)
287775	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
287778	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
287782	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
287784	Ribosomal protein L23
287786	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
287792	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
287801	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
287801	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
287805	7 transmembrane receptor (metabotropic glutamate family)
287807	ENV polyprotein (coat polyprotein)
287808	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
287809	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
287809	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
287810	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
287811	Mitochondrial carrier protein
287814	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
287817	Ribosomal protein L6e
287819	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
287820	Protein of unknown function, DUF270
287821	Protein of unknown function, DUF270
287824	B-box zinc finger
287824	Zinc finger, C3HC4 type (RING finger)
287824	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
287825	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
287827	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
287829	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
287831	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
287833	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
287835	GHMP kinases putative ATP-binding protein
287837	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
287838	Protein kinase domain
287844	Ribosomal protein L11, RNA binding domain
287846	Eukaryotic ribosomal protein L18
287848	Myosin head (motor domain)
287849	Myosin head (motor domain)
287853	Myosin head (motor domain)
287859	Myosin head (motor domain)
287862	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
287863	Myosin head (motor domain)
287868	Sulfate transporter family. Mutations may lead to several human diseases
287868	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
287876	Actin
287877	Delta 1-pyrroline-5-carboxylate reductase
287877	NAD-dependent glycerol-3-phosphate dehydrogenase
287877	pfam02871, Octopine_DH_N, NAD/NADP octopine/nopaline dehydrogenase, NAD binding domain. This group of enzymes act on the CH-NH substrate bond using NAD(+) or NADP(+) as an acceptor. The Pfam family consists mainly of octopine and nopaline dehydrogenases f
287881	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
287888	7 transmembrane receptor (rhodopsin family)
287893	Fructosamine kinase. This family includes eukaryotic fructosamine-3-kinase enzymes. The family also includes bacterial members that have not been characterised but probably have a similar or identical function
287895	14-3-3 protein
287898	7 transmembrane receptor (metabotropic glutamate family)
287900	Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses
287902	Ribosomal L15
287903	HIT family
287905	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p
287908	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
287909	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
287910	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
287911	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
287912	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
287913	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
287915	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
287917	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
287919	Thymosin beta-4 family
287920	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
287923	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
287923	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
287928	FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal
287931	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
287933	Cyclophilin type peptidyl-prolyl cis-trans isomerase
287939	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
287940	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
287940	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
287941	SH2 domain
287942	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
287943	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
287945	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
287947	Ribosomal protein L31e
287950	Proline dehydrogenase
287953	Conserved hypothetical protein 95
287953	ubiE/COQ5 methyltransferase family
287953	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
287953	DNA methylase. Members of this family are DNA methylases. The family contains both N-4 cytosine-specific DNA methylases and N-6 Adenine-specific DNA methylases
287953	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
287953	Putative methyltransferase. This is a family of putative methyltransferases. The aligned region contains the GXGXG S-AdoMet binding site suggesting a putative methyltransferase activity
287953	Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat
287955	NeuB family. NeuB is the prokaryotic N-acetylneuraminic acid (Neu5Ac) synthase. It catalyses the direct formation of Neu5Ac (the most common sialic acid) by condensation of phosphoenolpyruvate (PEP) and N-acetylmannosamine (ManNAc). This reaction has only
287960	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
287961	CDC45-like protein. CDC45 is an essential gene required for initiation of DNA replication in S. cerevisiae, forming a complex with MCM5/CDC46. Homologs of CDC45 have been identified in human, mouse and smut fungus among others
287964	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
287965	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
287966	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
287968	7 transmembrane receptor (rhodopsin family)
287969	7 transmembrane receptor (rhodopsin family)
287970	7 transmembrane receptor (rhodopsin family)
287971	7 transmembrane receptor (rhodopsin family)
287972	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
287973	7 transmembrane receptor (rhodopsin family)
287974	Zn-finger in Ran binding protein and others
287974	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
287975	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
287979	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa
287980	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
287981	Ribosomal protein S8
287982	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
287986	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
287988	RNA polymerase Rpb8. Rpb8 is a subunit common to the three yeast RNA polymerases, pol I, II and III. Rpb8 interacts with the largest subunit Rpb1, and with Rpb3 and Rpb11, two smaller subunits
287989	Protein kinase domain
287989	Fibronectin type III domain
287989	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
287989	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
287993	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
287994	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
287995	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
287996	Ribosomal protein L21e
287997	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
287999	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
288000	Core histone H2A/H2B/H3/H4
288001	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
288002	Core histone H2A/H2B/H3/H4
288003	Rad17 cell cycle checkpoint protein
288003	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
288004	S-adenosyl-L-homocysteine hydrolase
288005	Ribosomal protein L21e
288006	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
288006	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
288007	Trypsin
288007	CUB domain
288007	Sushi domain (SCR repeat)
288018	Poly-adenylate binding protein, unique domain
288018	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
288019	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
288020	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
288020	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
288021	ENV polyprotein (coat polyprotein)
288024	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
288024	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
288026	E1-E2 ATPase
288026	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
288030	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
288036	Ribosomal protein L31e
288037	Cyclophilin type peptidyl-prolyl cis-trans isomerase
288039	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
288039	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
288040	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
288042	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
288044	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
288046	Ribosomal protein L10
288047	Leishmanolysin
288051	Orotidine 5'-phosphate decarboxylase / HUMPS family. This family includes Orotidine 5'-phosphate decarboxylase enzymes EC:4.1.1.23 that are involved in the final step of pyrimidine biosynthesis. The family also includes enzymes such as hexulose-6-phosphat
288052	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
288052	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
288057	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
288059	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
288060	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
288063	ICE-like protease (caspase) p20 domain
288064	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
288066	Ribosomal protein S5, N-terminal domain
288067	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
288068	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
288070	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
288071	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
288072	Orbivirus VP3 (T2) protein. The orbivirus VP3 protein is part of the virus core and makes a 'subcore' shell made up of 120 copies of the 100K protein. VP3 particles can also bind RNA and are fundamental in the early stages of viral core formation. Also fo
288072	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
288073	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
288074	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
288075	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
288077	Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018
288077	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
288079	DNA polymerase family A
288079	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
288082	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
288083	homogentisate 1,2-dioxygenase. Homogentisate dioxygenase cleaves the aromatic ring during the metabolic degradation of Phe and Tyr. Homogentisate dioxygenase deficiency causes alkaptonuria. The structure of homogentisate dioxygenase shows that the enzyme
288084	TFIIE alpha subunit. The general transcription factor TFIIE has an essential role in eukaryotic transcription initiation together with RNA polymerase II and other general factors. Human TFIIE consists of two subunits TFIIE-alpha and TFIIE-beta and joins t
288085	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
288086	homogentisate 1,2-dioxygenase. Homogentisate dioxygenase cleaves the aromatic ring during the metabolic degradation of Phe and Tyr. Homogentisate dioxygenase deficiency causes alkaptonuria. The structure of homogentisate dioxygenase shows that the enzyme
288087	Intermediate filament protein
288087	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
288089	Ribosomal protein S17
288095	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
288096	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
288097	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
288097	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
288098	Cyclophilin type peptidyl-prolyl cis-trans isomerase
288099	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
288100	Ribosomal protein L21e
288103	Protein kinase domain
288106	7 transmembrane receptor (rhodopsin family)
288108	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
288113	GTPase of unknown function
288114	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
288115	AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this
288117	Cyclic nucleotide-binding domain
288117	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
288118	Trypsin
288122	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
288126	6-phosphogluconate dehydrogenase, C-terminal domain. This family represents the C-terminal all-alpha domain of 6-phosphogluconate dehydrogenase. The domain contains two structural repeats of 5 helices each
288127	NAD binding domain of 6-phosphogluconate dehydrogenase. The NAD binding domain of 6-phosphogluconate dehydrogenase adopts a Rossman fold
288127	6-phosphogluconate dehydrogenase, C-terminal domain. This family represents the C-terminal all-alpha domain of 6-phosphogluconate dehydrogenase. The domain contains two structural repeats of 5 helices each
288129	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
288130	Ribosomal protein L10
288132	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
288134	Ribosomal protein L6
288136	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
288137	Myosin head (motor domain)
288138	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
288140	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
288140	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
288144	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
288145	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
288147	Eukaryotic porin
288149	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
288151	Homeobox domain
288152	Homeobox domain
288153	Homeobox domain
288154	Homeobox domain
288155	Homeobox domain
288156	Ribosomal protein S26e
288158	Ribosomal protein S7e
288159	HMG (high mobility group) box
288160	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
288161	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
288162	Ribosomal protein L31e
288163	DNA directed RNA polymerase, 7 kDa subunit
288167	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
288171	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
288172	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
288173	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
288174	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
288175	LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in respo
288176	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
288178	Protein kinase domain
288179	Translationally controlled tumor protein
288179	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
288180	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
288181	7 transmembrane receptor (rhodopsin family)
288182	PMP-22/EMP/MP20/Claudin family
288183	7 transmembrane receptor (rhodopsin family)
288184	7 transmembrane receptor (rhodopsin family)
288185	7 transmembrane receptor (rhodopsin family)
288186	7 transmembrane receptor (rhodopsin family)
288187	7 transmembrane receptor (rhodopsin family)
288187	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
288188	7 transmembrane receptor (rhodopsin family)
288189	7 transmembrane receptor (rhodopsin family)
288190	7 transmembrane receptor (rhodopsin family)
288191	7 transmembrane receptor (rhodopsin family)
288192	7 transmembrane receptor (rhodopsin family)
288193	7 transmembrane receptor (rhodopsin family)
288194	7 transmembrane receptor (rhodopsin family)
288195	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
288196	7 transmembrane receptor (rhodopsin family)
288197	7 transmembrane receptor (rhodopsin family)
288198	7 transmembrane receptor (rhodopsin family)
288199	7 transmembrane receptor (rhodopsin family)
288200	7 transmembrane receptor (rhodopsin family)
288201	7 transmembrane receptor (rhodopsin family)
288202	7 transmembrane receptor (rhodopsin family)
288203	7 transmembrane receptor (rhodopsin family)
288204	QXW lectin repeat
288205	Ribosomal protein L36e
288207	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
288207	Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
288207	Beta-ketoacyl synthase, N-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains.
288208	Fibronectin type III domain
288209	Fibronectin type III domain
288212	Ribosomal protein L34e
288215	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
288215	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
288216	Phosphoglycerate kinase
288217	Eukaryotic initiation factor 1A
288218	Eukaryotic initiation factor 1A
288219	Ribosomal protein L19e
288220	Ribosomal protein L19e
288221	NOL1/NOP2/sun family
288226	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
288230	Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen
288235	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
288241	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
288243	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
288246	Actinobacillus constitutively-expressed outer membrane lipoprotein A
288246	Protein of unknown function (DUF390). This family of long proteins are currently only found in the rice genome. They have no known function. However they may be some kind of transposable element
288246	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
288251	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
288256	Zinc-binding dehydrogenase
288258	Phosphoribosylglycinamide synthetase, C domain. Phosphoribosylglycinamide synthetase catalyses the second step in the de novo biosynthesis of purine. The reaction catalysed by Phosphoribosylglycinamide synthetase is the ATP- dependent addition of 5-phosph
288259	Formyl transferase. This family includes the following members. Glycinamide ribonucleotide transformylase catalyses the third step in de novo purine biosynthesis, the transfer of a formyl group to 5'-phosphoribosylglycinamide. Formyltetrahydrofolate defor
288260	Ribosomal protein S7e
288265	Ribosomal protein S7e
288267	SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin
288276	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
288280	Fibronectin type III domain
288280	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
288281	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
288282	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
288283	Mitochondrial carrier protein
288285	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
288286	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
288287	CUB domain
288287	MAM domain. An extracellular domain found in many receptors
288287	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
288288	CUB domain
288288	MAM domain. An extracellular domain found in many receptors
288288	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
288289	Lectin C-type domain. This family includes both long and short form C-type
288290	Trypsin
288291	Trypsin
288293	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
288294	Enolase, N-terminal domain
288294	Enolase, C-terminal TIM barrel domain
288301	Protein-tyrosine phosphatase
288301	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
288304	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
288305	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
288306	Ubiquitin carboxyl-terminal hydrolase family 2
288307	Ubiquitin carboxyl-terminal hydrolases family 2
288309	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
288310	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
288311	Mitochondrial carrier protein
288312	SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase
288312	AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this
288313	Ribosomal L29e protein family
288314	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
288314	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
288316	Zinc-binding dehydrogenase
288317	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
288318	Ribosomal L29e protein family
288321	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
288322	Lipase
288323	HMG (high mobility group) box
288328	Eukaryotic initiation factor 4E
288330	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
288332	pfam02922, isoamylase_N, Isoamylase N-terminal domain. This domain is found in a range of enzymes that act on branched substrates - isoamylase, pullulanase and branching enzyme. This family also contains the beta subunit of 5' AMP activated kinase
288333	Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str
288340	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
288340	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
288344	Eukaryotic porin
288345	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
288346	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
288347	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
288348	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
288349	Eukaryotic protein of unknown function, DUF279
288352	RNA polymerase Rpb5, C-terminal domain. The assembly domain of Rpb5. The archaeal equivalent to this domain is subunit H. Subunit H lacks the N-terminal domain
288355	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
288356	Homeobox domain
288358	Protein kinase domain
288360	Eukaryotic-type carbonic anhydrase
288362	Myosin head (motor domain)
288364	Queuine tRNA-ribosyltransferase. This is a family of queuine tRNA-ribosyltransferases EC:2.4.2.29, also known as tRNA-guanine transglycosylase and guanine insertion enzyme. Queuine tRNA-ribosyltransferase modifies tRNAs for asparagine, aspartic acid, hist
288366	Ribosomal protein L21e
288367	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
288375	Lectin C-type domain. This family includes both long and short form C-type
288376	Lectin C-type domain. This family includes both long and short form C-type
288377	Lectin C-type domain. This family includes both long and short form C-type
288378	Lectin C-type domain. This family includes both long and short form C-type
288379	Lectin C-type domain. This family includes both long and short form C-type
288381	Lectin C-type domain. This family includes both long and short form C-type
288383	Lectin C-type domain. This family includes both long and short form C-type
288384	Lectin C-type domain. This family includes both long and short form C-type
288385	Lectin C-type domain. This family includes both long and short form C-type
288386	Lectin C-type domain. This family includes both long and short form C-type
288387	Transcription initiation factor TFIID subunit A
288387	Lectin C-type domain. This family includes both long and short form C-type
288388	Lectin C-type domain. This family includes both long and short form C-type
288389	Glutamyl-tRNAGlu reductase. This family use NADPH as a cofactor
288389	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
288391	Zona pellucida-like domain
288394	7 transmembrane receptor (metabotropic glutamate family)
288395	7 transmembrane receptor (metabotropic glutamate family)
288395	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
288396	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
288397	7 transmembrane receptor (metabotropic glutamate family)
288398	Poly-adenylate binding protein, unique domain
288399	7 transmembrane receptor (metabotropic glutamate family)
288401	7 transmembrane receptor (metabotropic glutamate family)
288402	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
288403	7 transmembrane receptor (metabotropic glutamate family)
288404	'chromo' (CHRromatin Organization MOdifier) domain
288405	'chromo' (CHRromatin Organization MOdifier) domain
288406	'chromo' (CHRromatin Organization MOdifier) domain
288407	'chromo' (CHRromatin Organization MOdifier) domain
288408	'chromo' (CHRromatin Organization MOdifier) domain
288409	'chromo' (CHRromatin Organization MOdifier) domain
288410	'chromo' (CHRromatin Organization MOdifier) domain
288412	'chromo' (CHRromatin Organization MOdifier) domain
288413	'chromo' (CHRromatin Organization MOdifier) domain
288415	BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems
288419	'chromo' (CHRromatin Organization MOdifier) domain
288420	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
288421	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
288421	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
288422	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
288422	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
288425	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
288427	ENV polyprotein (coat polyprotein)
288427	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
288428	Proteasome A-type and B-type
288429	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
288430	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
288432	'chromo' (CHRromatin Organization MOdifier) domain
288434	'chromo' (CHRromatin Organization MOdifier) domain
288435	'chromo' (CHRromatin Organization MOdifier) domain
288436	'chromo' (CHRromatin Organization MOdifier) domain
288437	'chromo' (CHRromatin Organization MOdifier) domain
288438	'chromo' (CHRromatin Organization MOdifier) domain
288439	'chromo' (CHRromatin Organization MOdifier) domain
288442	Translationally controlled tumor protein
288446	Ribosomal L28e protein family
288448	Eukaryotic ribosomal protein L18
288449	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
288452	Cyclophilin type peptidyl-prolyl cis-trans isomerase
288456	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
288456	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
288456	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
288457	Homeobox domain
288458	RNA polymerases L / 13 to 16 kDa subunit
288459	Ubiquitin carboxyl-terminal hydrolase family 2
288460	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
288462	Hsp90 protein
288463	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
288465	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
288469	Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex
288469	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
288470	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
288471	Phosphatidylinositol 3- and 4-kinase
288471	FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal
288472	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
288474	ENV polyprotein (coat polyprotein)
288475	Pentaxin family. Pentaxins are also known as pentraxins
288480	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
288481	Fructose-bisphosphate aldolase class-I
288483	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
288485	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
288486	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
288489	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
288490	7 transmembrane receptor (rhodopsin family)
288491	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
288492	Fibronectin type III domain
288493	Fibronectin type III domain
288494	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
288494	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
288496	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
288499	Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their o
288504	PX domain. PX domains bind to phosphoinositides
288512	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
288515	Kinesin motor domain
288515	LIM domain. This family represents two copies of the LIM structural domain
288515	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
288516	Kinesin motor domain
288517	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
288518	Protein of unknown function (DUF410). Family of conserved eukaryotic proteins with undetermined function
288527	Maf-like protein. Maf is a putative inhibitor of septum formation in eukaryotes, bacteria, and archaea
288528	7 transmembrane receptor (rhodopsin family)
288529	Connexin
288531	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
288532	MCM2/3/5 family
288537	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
288538	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
288539	7 transmembrane receptor (metabotropic glutamate family)
288541	7 transmembrane receptor (metabotropic glutamate family)
288541	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
288543	7 transmembrane receptor (metabotropic glutamate family)
288544	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
288550	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
288551	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
288556	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
288557	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
288558	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
288560	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
288562	Peptidase family M28D
288562	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
288562	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
288567	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
288569	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
288571	7 transmembrane receptor (metabotropic glutamate family)
288571	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
288572	7 transmembrane receptor (metabotropic glutamate family)
288573	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
288574	7 transmembrane receptor (metabotropic glutamate family)
288574	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
288575	7 transmembrane receptor (metabotropic glutamate family)
288576	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
288578	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
288579	7 transmembrane receptor (metabotropic glutamate family)
288580	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
288581	7 transmembrane receptor (metabotropic glutamate family)
288581	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
288582	PMP-22/EMP/MP20/Claudin family
288583	2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2
288585	ADP-ribosylation factor family
288585	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
288586	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
288588	RNA polymerases L / 13 to 16 kDa subunit
288593	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
288594	Ribosomal protein S15
288595	NOL1/NOP2/sun family
288596	B-box zinc finger
288597	FKBP-type peptidyl-prolyl cis-trans isomerase
288598	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
288598	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
288599	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
288602	ENV polyprotein (coat polyprotein)
288605	PMP-22/EMP/MP20/Claudin family
288606	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
288608	Phosphorylase family 2
288613	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
288614	Ribosomal protein S8
288615	Uncharacterized protein family UPF0023
288618	Ribosomal protein S5, C-terminal domain
288619	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
288620	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
288621	Ribosomal protein S17
288622	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
288623	S25 ribosomal protein
288624	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
288625	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
288632	7 transmembrane receptor (rhodopsin family)
288633	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
288636	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
288637	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
288640	Ribosomal protein L19e
288640	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
288641	Glutaredoxin
288642	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
288646	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
288649	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
288651	Transcription factor Tfb4
288653	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
288655	Phosphatidylinositol transfer protein
288660	Caldesmon
288660	Intermediate filament protein
288660	DNA gyrase/topoisomerase IV, subunit A
288660	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
288660	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
288660	Protein of unknown function (DUF390). This family of long proteins are currently only found in the rice genome. They have no known function. However they may be some kind of transposable element
288660	I/LWEQ domain. I/LWEQ domains bind to actin. It has been shown that the I/LWEQ domains from mouse talin and yeast Sla2p interact with F-actin. I/LWEQ domains can be placed into four major groups based on sequence similarity: (1) Metazoan talin
288660	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
288660	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
288662	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
288664	PH domain. PH stands for pleckstrin homology
288665	Homeobox domain
288665	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
288668	Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity
288669	P21-ARC (ARP2/3 complex 21 kDa subunit). The seven component ARP2/3 actin-organizing complex is involved in actin assembly and function
288674	Protein kinase domain
288675	Disintegrin
288675	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
288675	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
288677	Cyclophilin type peptidyl-prolyl cis-trans isomerase
288682	PH domain. PH stands for pleckstrin homology
288682	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
288692	WD domain, G-beta repeat
288694	Ribosomal S17
288698	Protein kinase domain
288700	ADP-ribosylation factor family
288700	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
288700	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
288701	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
288723	TPR Domain
288729	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
288730	CBF/Mak21 family
288731	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
288736	Fork head domain
288738	ATP synthase subunit C
288740	NifU-like N terminal domain. This domain is found in NifU in combination with pfam01106. This domain is found on isolated in several bacterial species. The nif genes are responsible for nitrogen fixation. However this domain is found in bacteria that do n
288741	Fic protein family. This family consists of the Fic (filamentation induced by cAMP) protein and its relatives. The Fic protein is involved in cell division and is suggested to be involved in the synthesis of PAB or folate, indicating that the Fic protein
288742	Sulfotransferase protein
288743	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
288743	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
288746	Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans
288747	7 transmembrane receptor (metabotropic glutamate family)
288749	START domain
288752	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
288755	P21-ARC (ARP2/3 complex 21 kDa subunit). The seven component ARP2/3 actin-organizing complex is involved in actin assembly and function
288758	Eukaryotic-type carbonic anhydrase
288764	Mitosis protein DIM1
288764	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
288768	Zinc finger, C3HC4 type (RING finger)
288770	NAC domain
288772	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
288774	SH2 domain
288774	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. Th
288774	STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STAT
288774	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fac
288777	Ribosomal protein S19
288780	ADP-ribosylation factor family
288780	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
288783	ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER
288786	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
288787	Tellurite resistance protein TehB
288787	ubiE/COQ5 methyltransferase family
288788	7 transmembrane receptor (rhodopsin family)
288789	7 transmembrane receptor (rhodopsin family)
288790	7 transmembrane receptor (rhodopsin family)
288791	ENV polyprotein (coat polyprotein)
288791	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
288792	7 transmembrane receptor (rhodopsin family)
288794	7 transmembrane receptor (rhodopsin family)
288795	7 transmembrane receptor (rhodopsin family)
288796	7 transmembrane receptor (rhodopsin family)
288797	7 transmembrane receptor (rhodopsin family)
288798	7 transmembrane receptor (rhodopsin family)
288799	7 transmembrane receptor (rhodopsin family)
288800	7 transmembrane receptor (rhodopsin family)
288801	7 transmembrane receptor (rhodopsin family)
288802	7 transmembrane receptor (rhodopsin family)
288803	7 transmembrane receptor (rhodopsin family)
288805	7 transmembrane receptor (rhodopsin family)
288806	7 transmembrane receptor (rhodopsin family)
288807	7 transmembrane receptor (rhodopsin family)
288808	7 transmembrane receptor (rhodopsin family)
288808	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
288809	7 transmembrane receptor (rhodopsin family)
288810	ENV polyprotein (coat polyprotein)
288810	7 transmembrane receptor (rhodopsin family)
288811	7 transmembrane receptor (rhodopsin family)
288813	7 transmembrane receptor (rhodopsin family)
288814	7 transmembrane receptor (rhodopsin family)
288815	7 transmembrane receptor (rhodopsin family)
288816	7 transmembrane receptor (rhodopsin family)
288817	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
288817	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
288818	7 transmembrane receptor (rhodopsin family)
288819	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
288819	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
288821	Helix-loop-helix DNA-binding domain
288822	7 transmembrane receptor (rhodopsin family)
288823	7 transmembrane receptor (rhodopsin family)
288824	7 transmembrane receptor (rhodopsin family)
288825	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
288825	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
288826	7 transmembrane receptor (rhodopsin family)
288827	7 transmembrane receptor (rhodopsin family)
288828	7 transmembrane receptor (rhodopsin family)
288829	7 transmembrane receptor (rhodopsin family)
288830	7 transmembrane receptor (rhodopsin family)
288831	7 transmembrane receptor (rhodopsin family)
288832	7 transmembrane receptor (rhodopsin family)
288833	7 transmembrane receptor (rhodopsin family)
288834	7 transmembrane receptor (rhodopsin family)
288835	7 transmembrane receptor (rhodopsin family)
288836	7 transmembrane receptor (rhodopsin family)
288837	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
288838	7 transmembrane receptor (rhodopsin family)
288839	7 transmembrane receptor (rhodopsin family)
288840	7 transmembrane receptor (rhodopsin family)
288840	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
288840	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
288841	7 transmembrane receptor (rhodopsin family)
288841	Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73
288843	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
288844	7 transmembrane receptor (rhodopsin family)
288845	7 transmembrane receptor (rhodopsin family)
288846	7 transmembrane receptor (rhodopsin family)
288847	7 transmembrane receptor (rhodopsin family)
288848	7 transmembrane receptor (rhodopsin family)
288850	7 transmembrane receptor (rhodopsin family)
288851	7 transmembrane receptor (rhodopsin family)
288852	7 transmembrane receptor (rhodopsin family)
288853	7 transmembrane receptor (rhodopsin family)
288854	7 transmembrane receptor (rhodopsin family)
288855	7 transmembrane receptor (rhodopsin family)
288855	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
288858	7 transmembrane receptor (rhodopsin family)
288859	7 transmembrane receptor (rhodopsin family)
288861	7 transmembrane receptor (rhodopsin family)
288862	7 transmembrane receptor (rhodopsin family)
288864	7 transmembrane receptor (rhodopsin family)
288865	7 transmembrane receptor (rhodopsin family)
288866	7 transmembrane receptor (rhodopsin family)
288867	7 transmembrane receptor (rhodopsin family)
288868	7 transmembrane receptor (rhodopsin family)
288869	7 transmembrane receptor (rhodopsin family)
288870	7 transmembrane receptor (rhodopsin family)
288871	7 transmembrane receptor (rhodopsin family)
288872	7 transmembrane receptor (rhodopsin family)
288873	7 transmembrane receptor (rhodopsin family)
288873	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
288873	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
288874	7 transmembrane receptor (rhodopsin family)
288875	7 transmembrane receptor (rhodopsin family)
288876	7 transmembrane receptor (rhodopsin family)
288876	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
288877	7 transmembrane receptor (rhodopsin family)
288878	7 transmembrane receptor (rhodopsin family)
288879	7 transmembrane receptor (rhodopsin family)
288880	7 transmembrane receptor (rhodopsin family)
288881	7 transmembrane receptor (rhodopsin family)
288882	Ribosomal protein S8e
288883	Ribosomal protein S8e
288884	7 transmembrane receptor (rhodopsin family)
288885	7 transmembrane receptor (rhodopsin family)
288886	7 transmembrane receptor (rhodopsin family)
288887	7 transmembrane receptor (rhodopsin family)
288889	7 transmembrane receptor (rhodopsin family)
288891	7 transmembrane receptor (rhodopsin family)
288892	7 transmembrane receptor (rhodopsin family)
288894	7 transmembrane receptor (rhodopsin family)
288895	7 transmembrane receptor (rhodopsin family)
288897	7 transmembrane receptor (rhodopsin family)
288898	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
288898	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
288899	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
288906	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
288911	Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fact
288912	Initiation factor 2 subunit family. This family includes initiation factor 2B alpha, beta and delta subunits from eukaryotes, initiation factor 2B subunits 1 and 2 from archaebacteria and some proteins of unknown function from prokaryotes. Initiation fact
288915	CTF/NF-I family
288917	tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain
288918	Ribonuclease HII
288919	Anion-transporting ATPase. This Pfam family represents a conserved domain, which is sometimes repeated, in an anion-transporting ATPase. The ATPase is involved in the removal of arsenate, antimonite, and arsenate from the cell
288922	Deoxyhypusine synthase. Eukaryotic initiation factor 5A (eIF-5A) contains an unusual amino acid, hypusine [N epsilon-(4-aminobutyl-2-hydroxy)lysine]. The first step in the post-translational formation of hypusine is catalysed by the enzyme deoxyhypusine s
288923	Deoxyhypusine synthase. Eukaryotic initiation factor 5A (eIF-5A) contains an unusual amino acid, hypusine [N epsilon-(4-aminobutyl-2-hydroxy)lysine]. The first step in the post-translational formation of hypusine is catalysed by the enzyme deoxyhypusine s
288925	Uncharacterised protein family (UPF0139)
288929	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
288930	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
288930	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
288932	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
288933	S-adenosylmethionine synthetase, central domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold
288933	S-adenosylmethionine synthetase, C-terminal domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold
288933	S-adenosylmethionine synthetase, N-terminal domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold
288936	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
288937	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
288941	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
288946	Peptidase family M49
288948	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
288949	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
288949	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
288950	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a b
288953	Ribosomal S3Ae family
288956	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
288959	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
288963	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
288965	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
288966	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
288967	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
288967	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
288969	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
288974	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
288975	ENV polyprotein (coat polyprotein)
288979	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
288980	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
288982	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
288983	Translationally controlled tumor protein
288986	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
288988	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
288989	Ribosomal protein L36e
288990	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
288990	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
288993	Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region
288995	7 transmembrane receptor (rhodopsin family)
288999	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
289001	Protein kinase domain
289003	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
289005	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289005	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289006	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
289007	Thrombospondin type 1 domain
289010	Sushi domain (SCR repeat)
289010	Sushi domain (SCR repeat)
289013	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
289014	Protein kinase domain
289016	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
289020	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
289021	PX domain. PX domains bind to phosphoinositides
289022	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
289027	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
289032	Chitin binding Peritrophin-A domain. This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruse
289040	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
289041	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
289043	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
289047	LIM domain. This family represents two copies of the LIM structural domain
289048	Homeobox domain
289048	LIM domain. This family represents two copies of the LIM structural domain
289048	LIM domain. This family represents two copies of the LIM structural domain
289049	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
289050	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289050	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289051	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
289051	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
289052	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
289055	Sushi domain (SCR repeat)
289056	Sushi domain (SCR repeat)
289057	Sushi domain (SCR repeat)
289058	Sushi domain (SCR repeat)
289060	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
289064	Ribosomal protein S5, C-terminal domain
289067	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
289068	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
289069	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
289070	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
289071	ENV polyprotein (coat polyprotein)
289074	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
289075	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
289076	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
289077	Ribosomal L29e protein family
289078	Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73
289080	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
289081	Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of var
289085	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
289085	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
289089	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
289090	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
289091	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289097	Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates
289098	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
289099	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
289100	Ribosomal L15
289103	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
289107	Ribosomal L15
289109	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
289111	Ribosomal protein L23
289114	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289117	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289117	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289118	Ribosomal protein S27
289120	Class I Histocompatibility antigen, domains alpha 1 and 2
289120	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
289121	EXS family. We have named this region the EXS family after (ERD1, XPR1, and SYG1). This family includes C-terminus portions from the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisae, and sequences that are thought to be m
289122	Ribosomal S17
289123	Acyl CoA binding protein
289124	Acyl CoA binding protein
289125	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
289128	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
289128	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
289129	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
289134	Enolase, N-terminal domain
289134	Enolase, C-terminal TIM barrel domain
289137	TSC-22/dip/bun family
289138	Zinc-binding dehydrogenase
289143	Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes
289145	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
289147	7 transmembrane receptor (rhodopsin family)
289151	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
289152	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
289154	GDP dissociation inhibitor
289155	Actin
289156	Amidase
289157	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289157	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289158	Homeobox domain
289160	Synaptobrevin
289165	Ribosomal protein S8e
289166	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
289172	Cyclophilin type peptidyl-prolyl cis-trans isomerase
289173	Adenylate kinase
289175	Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analogu
289180	7 transmembrane receptor (rhodopsin family)
289181	WD domain, G-beta repeat
289182	Uncharacterised protein family (UPF0041)
289187	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
289189	'Cold-shock' DNA-binding domain
289190	HMG (high mobility group) box
289191	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
289191	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
289192	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
289192	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
289193	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
289193	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
289195	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
289197	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
289198	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
289199	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
289200	LIM domain. This family represents two copies of the LIM structural domain
289201	Homeobox domain
289201	LIM domain. This family represents two copies of the LIM structural domain
289203	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
289205	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
289208	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
289210	Ribosomal protein S24e
289212	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
289213	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
289217	Succinate dehydrogenase cytochrome b subunit
289218	Respiratory-chain NADH dehydrogenase, 49 Kd subunit
289219	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
289222	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
289222	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
289223	KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif
289225	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289225	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289234	Death effector domain
289238	7 transmembrane receptor (rhodopsin family)
289239	7 transmembrane receptor (rhodopsin family)
289240	7 transmembrane receptor (rhodopsin family)
289241	7 transmembrane receptor (rhodopsin family)
289242	7 transmembrane receptor (rhodopsin family)
289243	Pentaxin family. Pentaxins are also known as pentraxins
289245	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
289245	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
289246	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
289246	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
289247	7 transmembrane receptor (rhodopsin family)
289248	7 transmembrane receptor (rhodopsin family)
289249	7 transmembrane receptor (rhodopsin family)
289250	7 transmembrane receptor (rhodopsin family)
289251	7 transmembrane receptor (rhodopsin family)
289252	7 transmembrane receptor (rhodopsin family)
289253	7 transmembrane receptor (rhodopsin family)
289254	7 transmembrane receptor (rhodopsin family)
289255	7 transmembrane receptor (rhodopsin family)
289256	7 transmembrane receptor (rhodopsin family)
289257	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
289258	7 transmembrane receptor (rhodopsin family)
289259	7 transmembrane receptor (rhodopsin family)
289260	7 transmembrane receptor (rhodopsin family)
289261	7 transmembrane receptor (rhodopsin family)
289264	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
289264	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerbe
289265	Ribosomal S3Ae family
289266	GDP dissociation inhibitor
289267	7 transmembrane receptor (rhodopsin family)
289269	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
289276	Adenylosuccinate synthetase
289279	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
289282	Elongation factor 1 gamma, conserved domain
289284	Ribosomal protein L14p/L23e
289288	Protein kinase domain
289290	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
289291	Protein kinase domain
289292	7 transmembrane receptor (metabotropic glutamate family)
289292	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
289293	7 transmembrane receptor (metabotropic glutamate family)
289294	7 transmembrane receptor (metabotropic glutamate family)
289295	7 transmembrane receptor (metabotropic glutamate family)
289295	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
289296	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289296	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289297	Low molecular weight phosphotyrosine protein phosphatase
289298	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289299	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
289300	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289301	7 transmembrane receptor (metabotropic glutamate family)
289301	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
289302	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
289302	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289302	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289303	7 transmembrane receptor (metabotropic glutamate family)
289303	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
289304	7 transmembrane receptor (metabotropic glutamate family)
289305	7 transmembrane receptor (metabotropic glutamate family)
289305	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
289311	Homeobox domain
289312	Acyl CoA binding protein
289314	Core histone H2A/H2B/H3/H4
289316	Transforming growth factor beta like domain
289316	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
289317	Delta 1-pyrroline-5-carboxylate reductase
289318	Transforming growth factor beta like domain
289318	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
289319	Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length
289320	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
289322	Fatty acid desaturase
289323	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
289324	Cornichon protein
289325	WD domain, G-beta repeat
289326	Ribosomal protein L36e
289330	Ribosomal protein S11
289333	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
289335	Sushi domain (SCR repeat)
289336	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
289337	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
289337	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
289339	Ribosomal protein L6
289343	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
289344	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
289349	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289349	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289353	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
289354	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
289355	Ribosomal protein L21e
289357	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
289358	Ribosomal protein L21e
289364	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
289365	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
289367	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
289368	Thrombospondin N-terminal -like domain
289369	Fibronectin type III domain
289370	Fibronectin type III domain
289374	MYND finger
289378	TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme
289380	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
289381	Ribosomal L29e protein family
289384	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
289390	Ribosomal protein S5, C-terminal domain
289391	Ribosomal protein L21e
289392	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
289394	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
289395	Sushi domain (SCR repeat)
289396	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
289397	WD domain, G-beta repeat
289401	Ribosomal protein L31e
289403	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
289404	Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA
289405	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
289407	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
289408	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
289411	7 transmembrane receptor (rhodopsin family)
289414	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
289415	Actin
289416	Prolyl oligopeptidase family
289417	Sushi domain (SCR repeat)
289419	Protein kinase domain
289420	Ribosomal L15
289421	Cyclophilin type peptidyl-prolyl cis-trans isomerase
289422	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289424	EXS family. We have named this region the EXS family after (ERD1, XPR1, and SYG1). This family includes C-terminus portions from the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisae, and sequences that are thought to be m
289425	Fibronectin type III domain
289426	Ribosomal protein L31e
289427	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
289428	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
289430	MOSC N-terminal beta barrel domain. This domain is found to the N-terminus of pfam03473. The function of this domain is unknown, however it is predicted to adopt a beta barrel fold
289431	7 transmembrane receptor (metabotropic glutamate family)
289432	7 transmembrane receptor (metabotropic glutamate family)
289432	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
289432	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
289433	7 transmembrane receptor (metabotropic glutamate family)
289433	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
289434	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289434	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289435	7 transmembrane receptor (metabotropic glutamate family)
289436	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
289438	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
289439	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
289439	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
289440	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
289445	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
289446	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
289448	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
289448	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
289450	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
289451	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
289452	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
289456	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
289457	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
289459	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the fa
289460	Protein-tyrosine phosphatase
289460	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
289462	PH domain. PH stands for pleckstrin homology
289469	Ribosomal protein S18
289473	NAC domain
289475	Ribosomal protein S2
289481	FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain
289481	FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain
289482	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
289485	Protein kinase domain
289488	Giardia variant-specific surface protein
289490	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
289491	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
289498	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
289501	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
289503	'Cold-shock' DNA-binding domain
289505	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
289505	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
289509	Ribosomal protein L21e
289510	Ribosomal protein S6e
289516	pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain
289517	Enolase, N-terminal domain
289517	Enolase, C-terminal TIM barrel domain
289518	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
289522	60Kd inner membrane protein
289531	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
289532	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289533	UDP-glucoronosyl and UDP-glucosyl transferase
289533	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
289534	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
289534	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
289535	UDP-glucoronosyl and UDP-glucosyl transferase
289535	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
289536	UDP-glucoronosyl and UDP-glucosyl transferase
289536	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
289537	UDP-glucoronosyl and UDP-glucosyl transferase
289537	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
289538	UDP-glucoronosyl and UDP-glucosyl transferase
289538	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
289539	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
289540	UDP-glucoronosyl and UDP-glucosyl transferase
289541	UDP-glucoronosyl and UDP-glucosyl transferase
289543	UDP-glucoronosyl and UDP-glucosyl transferase
289543	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
289544	Trypsin
289545	ATP synthase subunit C
289546	Trypsin
289547	Trypsin
289547	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
289549	Protein kinase domain
289551	Fibronectin type III domain
289557	Ribosomal protein L36e
289560	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
289561	RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase comp
289565	ADP-ribosylation factor family
289565	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
289569	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
289570	Phosducin
289571	Uncharacterized protein family UPF0016. This family contains integral membrane proteins of unknown function. Most members of the family contain two copies of a region that contains an EXGD motif. Each of these regions contains three predicted transmembran
289574	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
289575	Protein kinase domain
289581	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
289588	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
289596	Low-density lipoprotein receptor domain class A
289596	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
289596	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
289596	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
289600	Trypsin
289606	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
289606	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
289607	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
289608	Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase
289610	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
289611	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
289612	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
289612	Beta-ketoacyl synthase, N-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains.
289614	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
289618	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289619	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289619	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289623	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
289623	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
289625	Glycosyl hydrolase family 1
289627	Glycosyl hydrolase family 1
289628	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
289630	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
289638	Cyclophilin type peptidyl-prolyl cis-trans isomerase
289639	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289639	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289641	WD domain, G-beta repeat
289642	7 transmembrane receptor (metabotropic glutamate family)
289643	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
289647	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
289648	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
289651	C-5 cytosine-specific DNA methylase
289654	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
289656	Ribosomal protein L23
289662	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
289663	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
289664	LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located in
289665	LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located in
289666	LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located in
289668	Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase
289671	Intermediate filament protein
289671	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
289672	Hsp90 protein
289673	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289676	Ribosomal protein L13e
289677	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
289679	Ribosomal protein S6e
289680	Ribosomal S3Ae family
289686	Galactoside-binding lectin
289687	Glycosyl hydrolase family 1
289691	Ribosomal L10
289692	Ribosomal protein L36e
289693	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
289699	Eukaryotic initiation factor 4E
289701	Ribosomal protein S17
289706	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
289710	ENV polyprotein (coat polyprotein)
289715	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
289720	Saposin A-type domain
289721	LIM domain. This family represents two copies of the LIM structural domain
289722	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
289724	Nop14-like family. Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production
289727	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
289734	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
289738	PH domain. PH stands for pleckstrin homology
289741	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
289743	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
289743	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
289744	Ribosomal protein L6
289747	LIF / OSM family
289749	Intermediate filament protein
289749	Transposase. Transposase proteins are necessary for efficient DNA transposition. Contains transposases for IS204, IS1001, IS1096 and IS1165
289750	Growth-Arrest-Specific Protein 2 Domain
289751	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
289753	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
289755	Eukaryotic ribosomal protein L18
289756	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
289757	Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to
289757	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
289761	P53
289764	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289764	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289767	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289767	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289770	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
289771	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
289777	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
289779	Ribosomal protein L13
289784	Core histone H2A/H2B/H3/H4
289786	NAC domain
289793	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
289794	Protein-tyrosine phosphatase
289795	Ribosomal protein L36e
289797	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
289798	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289798	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289799	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289799	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289801	Phosphorylase family. Members of this family include: purine nucleoside phosphorylase (PNP) Uridine phosphorylase (UdRPase) 5'-methylthioadenosine phosphorylase (MTA phosphorylase)
289804	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
289806	'Cold-shock' DNA-binding domain
289808	Cytochrome oxidase c subunit VIII. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIII
289814	Ribosomal protein L21e
289818	Ribosomal protein S5, C-terminal domain
289818	Ribosomal protein S5, N-terminal domain
289821	Ribosomal protein S27
289824	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
289825	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
289827	UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate urid
289828	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289829	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
289830	Ribosomal L28e protein family
289835	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
289843	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289843	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289844	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289844	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289848	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
289849	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
289851	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
289852	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
289853	Ribosomal protein S26e
289858	S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure
289862	Ribosomal protein L19e
289867	WD domain, G-beta repeat
289869	Acylphosphatase
289871	Macrophage migration inhibitory factor (MIF)
289872	Ribosomal protein L21e
289873	7 transmembrane receptor (rhodopsin family)
289877	Ku70/Ku80 C-terminal arm. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the C terminal arm. Thi
289878	3'5'-cyclic nucleotide phosphodiesterase
289879	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
289879	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
289881	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
289883	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
289884	Ribosomal S3Ae family
289887	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
289888	UDP-glucoronosyl and UDP-glucosyl transferase
289889	7 transmembrane receptor (metabotropic glutamate family)
289890	7 transmembrane receptor (metabotropic glutamate family)
289890	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
289891	7 transmembrane receptor (metabotropic glutamate family)
289891	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
289891	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
289897	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
289897	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
289899	7 transmembrane receptor (rhodopsin family)
289902	Ribosomal protein S16
289910	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
289914	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isoform
289915	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
289919	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
289920	ENV polyprotein (coat polyprotein)
289921	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
289923	7 transmembrane receptor (rhodopsin family)
289924	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
289927	Cadherin domain
289929	Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15
289930	Translationally controlled tumor protein
289932	Ribosomal protein S7e
289933	Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA
289934	Beta-ketoacyl synthase, C-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains
289936	HMG (high mobility group) box
289939	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
289943	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
289944	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
289950	Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site
289950	Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases
289951	Filamin/ABP280 repeat
289957	7 transmembrane receptor (rhodopsin family)
289961	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
289961	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
289965	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
289968	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
289971	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
289973	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
289974	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
289975	Glutaredoxin
289976	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
289988	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
289989	Core histone H2A/H2B/H3/H4
289990	RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses
289990	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
289993	Protein-tyrosine phosphatase
289993	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
289994	Cornichon protein
289996	Translationally controlled tumor protein
290000	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
290001	7 transmembrane receptor (rhodopsin family)
290002	7 transmembrane receptor (rhodopsin family)
290003	7 transmembrane receptor (rhodopsin family)
290004	7 transmembrane receptor (rhodopsin family)
290005	7 transmembrane receptor (rhodopsin family)
290006	7 transmembrane receptor (rhodopsin family)
290007	7 transmembrane receptor (rhodopsin family)
290008	7 transmembrane receptor (rhodopsin family)
290009	7 transmembrane receptor (rhodopsin family)
290010	7 transmembrane receptor (rhodopsin family)
290011	7 transmembrane receptor (rhodopsin family)
290012	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
290013	7 transmembrane receptor (rhodopsin family)
290014	7 transmembrane receptor (rhodopsin family)
290015	7 transmembrane receptor (rhodopsin family)
290017	7 transmembrane receptor (rhodopsin family)
290018	7 transmembrane receptor (rhodopsin family)
290018	Riboflavin kinase / FAD synthetase. This family consists part of the bifunctional enzyme riboflavin kinase / FAD synthetase. These enzymes have both ATP:riboflavin 5'-phospho transferase and ATP:FMN-adenylyltransferase activitys. They catalyse the 5'-phos
290019	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
290020	7 transmembrane receptor (rhodopsin family)
290021	7 transmembrane receptor (rhodopsin family)
290022	7 transmembrane receptor (rhodopsin family)
290023	7 transmembrane receptor (rhodopsin family)
290024	7 transmembrane receptor (rhodopsin family)
290025	7 transmembrane receptor (rhodopsin family)
290026	Ribosomal protein L35Ae
290027	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
290028	Glycoprotease family
290029	Phosphorylase family 2
290030	S-adenosyl-L-homocysteine hydrolase
290031	Proteasome A-type and B-type
290033	Core histone H2A/H2B/H3/H4
290035	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
290036	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
290039	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
290040	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
290043	Ribosomal protein L31e
290044	7 transmembrane receptor (rhodopsin family)
290045	Ribosomal protein L19e
290046	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
290049	7 transmembrane receptor (rhodopsin family)
290050	7 transmembrane receptor (rhodopsin family)
290051	7 transmembrane receptor (rhodopsin family)
290055	7 transmembrane receptor (rhodopsin family)
290056	7 transmembrane receptor (rhodopsin family)
290057	Enolase, N-terminal domain
290057	Enolase, C-terminal TIM barrel domain
290058	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290059	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290061	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290062	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290064	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290066	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290067	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290069	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290070	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290071	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290072	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290073	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290074	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290075	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290076	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290077	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290078	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290079	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290080	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290081	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290084	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290085	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290086	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290087	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290088	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290089	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290090	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290091	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290092	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290093	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290094	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290095	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290097	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290098	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290099	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290101	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290104	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290105	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290106	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290107	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290108	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290110	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290111	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290112	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290113	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290114	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290115	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290117	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290118	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290119	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290120	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290121	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290122	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290123	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290124	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290125	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290130	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290131	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290132	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290133	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290134	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290135	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290136	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290137	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290138	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290139	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290140	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290141	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290145	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290147	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290148	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290149	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290150	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290151	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290152	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290154	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290155	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290156	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290157	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290161	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290162	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290163	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290164	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290165	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290166	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290167	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290168	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290169	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290170	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290172	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290173	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290174	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290175	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290176	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290178	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290179	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290181	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290182	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290185	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290186	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290187	Hsp90 protein
290188	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290190	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290191	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290192	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290196	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290197	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290199	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290202	7 transmembrane receptor (rhodopsin family)
290203	Ribosomal protein L34e
290204	60Kd inner membrane protein
290206	Proteasome A-type and B-type
290207	Cadherin domain
290207	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
290210	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p
290211	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
290215	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
290218	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
290219	Ribosomal L29e protein family
290221	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
290222	Phosphoenolpyruvate carboxykinase. Catalyses the formation of phosphoenolpyruvate by decarboxylation of oxaloacetate
290224	Uncharacterised protein family (UPF0172)
290226	Ribosomal protein S19
290229	Protein kinase domain
290231	Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest
290233	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
290239	Ribosomal protein L21e
290240	Trypsin
290241	Trypsin
290242	Trypsin
290243	Trypsin
290244	Trypsin
290245	Trypsin
290247	Trypsin
290248	Trypsin
290249	Trypsin
290250	Trypsin
290251	Trypsin
290252	Trypsin
290253	Trypsin
290255	Trypsin
290256	Trypsin
290257	Trypsin
290259	Trypsin
290260	Trypsin
290261	Trypsin
290262	Trypsin
290263	Trypsin
290264	Trypsin
290265	Trypsin
290266	Trypsin
290267	Trypsin
290271	Ribosomal L39 protein
290272	KOW motif. This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG
290276	Ribosomal protein L23
290277	3-hydroxyacyl-CoA dehydrogenase, C-terminal domain. This family also includes lambda crystallin. Some proteins include two copies of this domain
290281	Ribosomal protein L35Ae
290283	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
290288	ENV polyprotein (coat polyprotein)
290291	Mo25 protein family
290292	Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However
290295	Ets-domain
290295	E1-E2 ATPase
290298	Cyclophilin type peptidyl-prolyl cis-trans isomerase
290299	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
290301	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
290303	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
290306	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
290310	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290310	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290311	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
290312	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
290317	HMG (high mobility group) box
290319	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
290323	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
290324	Ribosomal S17
290326	Protein kinase domain
290332	Ribosomal protein L6, N-terminal domain
290338	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
290340	'chromo' (CHRromatin Organization MOdifier) domain
290341	WD domain, G-beta repeat
290342	WD domain, G-beta repeat
290344	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
290344	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
290344	Disintegrin
290344	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
290344	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
290345	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
290346	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
290347	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
290348	ENV polyprotein (coat polyprotein)
290350	Lysyl oxidase
290352	Death domain
290354	LIM domain. This family represents two copies of the LIM structural domain
290354	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
290359	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
290360	Importin-beta N-terminal domain
290361	PTB domain (IRS-1 type)
290363	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
290364	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
290365	NUDIX domain
290367	CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP
290368	ENV polyprotein (coat polyprotein)
290370	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
290376	Ribosomal protein S5, C-terminal domain
290377	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
290378	PH domain. PH stands for pleckstrin homology
290379	Protein kinase domain
290384	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
290386	Hsp90 protein
290387	Zinc-binding dehydrogenase
290388	Ribosomal S17
290390	Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly in
290392	ENV polyprotein (coat polyprotein)
290400	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
290401	Dienelactone hydrolase family
290401	Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is possibly related to the pfam00135 family
290402	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
290406	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results
290409	Olfactomedin-like domain
290410	NOL1/NOP2/sun family
290412	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
290412	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
290414	Ribosomal L15
290415	D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain
290415	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
290422	Eukaryotic porin
290426	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
290426	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
290428	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
290429	Pou domain - N-terminal to homeobox domain
290432	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
290433	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
290434	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
290434	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
290435	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
290438	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290441	Phosphotyrosine interaction domain (PTB/PID)
290442	LIM domain. This family represents two copies of the LIM structural domain
290443	Aminotransferase class I and II
290445	Nucleoside diphosphate kinase
290449	Ribosomal L29e protein family
290452	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290452	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290458	Zinc finger, C3HC4 type (RING finger)
290459	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
290459	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
290461	ENV polyprotein (coat polyprotein)
290463	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290463	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290465	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290465	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290466	Actin
290467	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
290470	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
290471	Ribosomal protein S17
290472	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
290474	Glypican
290476	Glypican
290477	Ribosomal protein L21e
290479	Ribosomal protein S6e
290480	Glypican
290481	Glypican
290483	Glypican
290484	Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372
290485	PMP-22/EMP/MP20/Claudin family
290492	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290492	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290493	7 transmembrane receptor (rhodopsin family)
290496	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
290497	Tim10/DDP family zinc finger. Putative zinc binding domain with four conserved cysteine residues. This domain is found in X-linked deafness dystonia protein. Members of this family such as Tim9 and Tim10 are involved in mitochondrial protein import. Membe
290498	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
290499	FKBP-type peptidyl-prolyl cis-trans isomerase
290500	Uncharacterised protein family (UPF0131). This family of proteins are uncharacterised, however BtrG is part of a butirosin-biosynthetic gene cluster from Bacillus circulans
290501	TPR Domain
290503	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
290503	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
290507	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
290510	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
290511	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290512	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290513	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290514	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290515	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290516	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290521	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
290522	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
290524	Uncharacterised protein family (UPF0131). This family of proteins are uncharacterised, however BtrG is part of a butirosin-biosynthetic gene cluster from Bacillus circulans
290528	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
290530	Homeobox domain
290530	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
290531	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
290534	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
290537	Phosphotyrosine interaction domain (PTB/PID)
290538	Homeobox domain
290541	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
290542	Ribosomal protein L21e
290544	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
290547	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
290548	Ribosomal protein L21e
290548	Ribosomal protein S7e
290551	GMC oxidoreductase. This family of proteins bind FAD as a cofactor
290556	GTPase of unknown function
290557	HMG (high mobility group) box
290562	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
290566	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
290567	Choline/Carnitine o-acyltransferase
290570	PH domain. PH stands for pleckstrin homology
290574	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
290575	LIM domain. This family represents two copies of the LIM structural domain
290592	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
290594	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
290596	Uncharacterized protein family UPF0005
290597	Acyl CoA binding protein
290597	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
290601	ENV polyprotein (coat polyprotein)
290604	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290605	Ribosomal protein S2
290606	ENV polyprotein (coat polyprotein)
290607	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
290608	CAP protein
290610	Putative snoRNA binding domain. This family consists of various Pre RNA processing ribonucleoproteins. The function of the aligned region is unknown however it may be a common RNA or snoRNA or Nop1p binding domain. Nop5p (Nop58p) from yeast is the protein
290611	SH2 domain
290613	Myosin head (motor domain)
290613	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
290616	Calreticulin family
290620	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
290623	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
290624	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
290625	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
290632	Ribosomal protein L34
290633	GTPase of unknown function
290634	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290636	Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase
290637	Glycosyltransferase family 25 (LPS biosynthesis protein). Members of this family belong to Glycosyltransferase family 25 This is a family of glycosyltransferases involved in lipopolysaccharide (LPS) biosynthesis. These enzymes catalyse the transfer of var
290638	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
290641	Ribosomal L18ae protein family
290644	Gamma interferon inducible lysosomal thiol reductase (GILT). This family includes the two characterized human gamma-interferon-inducible lysosomal thiol reductase (GILT) sequences. It also contains several other eukaryotic putative proteins with similarit
290645	Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is
290646	3'5'-cyclic nucleotide phosphodiesterase
290647	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
290648	Pyroglutamyl peptidase
290651	Myo-inositol-1-phosphate synthase. This is a family of myo-inositol-1-phosphate synthases. Inositol-1-phosphate catalyses the conversion of glucose-6- phosphate to inositol-1-phosphate, which is then dephosphorylated to inositol. Inositol phosphates play
290656	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
290657	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
290657	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
290658	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
290660	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
290661	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
290662	Mitochondrial carrier protein
290666	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
290670	Protein kinase domain
290674	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
290676	Ribosomal protein L17
290677	Skp1 family, tetramerisation domain
290678	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
290680	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
290681	N-acetyltransferase. Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30kDa. It facilitates the transfer of an acetyl group from Acetyl Coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation o
290682	Enolase, C-terminal TIM barrel domain
290682	Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain
290683	Ribosomal protein L35Ae
290685	Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site
290685	Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases
290685	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
290685	D-xylulose 5-phosphate/D-fructose 6-phosphate phosphoketolase. Bacterial enzyme splits fructose-6-P and/or xylulose-5-P with the aid of inorganic phosphate into either acetyl-P and erythrose-4-P and/or acetyl-P and glyeraldehyde-3-P EC:4.1.2.9, EC:4.1.2.2
290685	Transketolase, thiamine diphosphate binding domain. This family includes transketolase enzymes EC:2.2.1.1. and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit EC:1.2.4.4. Both these enzymes utilise thiamine pyrophosphate as a cofacto
290687	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290687	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290690	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
290690	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
290691	Actin
290692	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
290694	CUB domain
290694	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family
290695	CUB domain
290696	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290696	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290697	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
290698	Ribosomal L10
290700	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
290700	Gag polyprotein, inner coat protein p12. The retroviral p12 is a virion structural protein. p12 is proline rich. The function carried out by p12 in assembly and replication is unknown. p12 is associated with pathogenicity of the virus
290704	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290707	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290708	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
290711	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
290713	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
290716	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
290716	Gag polyprotein, inner coat protein p12. The retroviral p12 is a virion structural protein. p12 is proline rich. The function carried out by p12 in assembly and replication is unknown. p12 is associated with pathogenicity of the virus
290719	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290719	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290720	Phosphatidylethanolamine-binding protein
290721	Enolase, C-terminal TIM barrel domain
290724	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290726	Disintegrin
290726	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
290726	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
290727	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
290728	Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase
290729	Formamidopyrimidine-DNA glycosylase
290733	DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117
290734	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
290738	Ribosomal protein S27
290741	Cytidine and deoxycytidylate deaminase zinc-binding region
290744	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
290752	Mitochondrial carrier protein
290756	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290756	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290757	Trypsin
290757	PAN domain. The PAN domain contains a conserved core of three disulphide bridges. In some members of the family there is an additional fourth disulphide bridge the links the N and C termini of the domain. The domain is found in diverse proteins, in some t
290758	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
290760	Ribosomal protein L21e
290761	Fructose-bisphosphate aldolase class-I
290763	Disintegrin
290763	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
290763	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
290766	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
290767	Zinc finger, C3HC4 type (RING finger)
290767	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
290768	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
290768	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
290772	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
290773	Disintegrin
290773	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
290773	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
290774	Disintegrin
290774	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
290774	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
290775	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
290778	WD domain, G-beta repeat
290781	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
290781	MH1 domain. This is the MH1 (MAD homology 1) domain found at the amino terminus of MAD related proteins. This domain can bind to DNA. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain sh
290784	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290784	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290787	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
290789	PMP-22/EMP/MP20/Claudin family
290792	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
290794	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
290795	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
290797	Vacuolar protein sorting 55. Vps55 is involved in the secretion of the Golgi form of the soluble vacuolar carboxypeptidase Y, but not the trafficking of the membrane-bound vacuolar alkaline phosphatase. Both Vps55 and obesity receptor gene-related protein
290799	Ribosomal protein L21e
290802	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
290804	Ribosomal protein S8
290805	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
290805	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
290805	3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termin
290807	Ribosomal protein S8
290810	Hsp90 protein
290812	Zinc finger, C3HC4 type (RING finger)
290817	Ribosomal protein S7e
290818	Protein kinase domain
290819	Protein kinase domain
290820	Ribosomal protein L13
290825	Ribosomal protein L21e
290826	Calcium-activated BK potassium channel alpha subunit
290827	Ribosomal protein S28e
290829	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
290834	Disintegrin
290834	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
290837	Ribosomal protein S8e
290840	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
290843	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
290844	Homeobox domain
290846	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
290849	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290849	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290851	Vacuolar protein sorting 36. Vps36 is involved in Golgi to endosome trafficking
290852	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid
290856	Defensin propeptide
290857	Defensin propeptide
290858	Mammalian defensin
290858	Defensin propeptide
290859	Defensin propeptide
290860	Mammalian defensin
290860	Defensin propeptide
290861	Beta defensin. The beta defensins are antimicrobial peptides implicated in the resistance of epithelial surfaces to microbial colonization
290862	ENV polyprotein (coat polyprotein)
290864	Ubiquitin carboxyl-terminal hydrolase family 2
290866	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
290867	Ubiquitin carboxyl-terminal hydrolase family 2
290868	tRNA synthetases class I (E and Q), anti-codon binding domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase a
290869	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
290869	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
290875	TPR Domain
290878	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
290879	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
290880	ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisin
290881	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
290885	CUB domain
290885	Sushi domain (SCR repeat)
290887	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
290888	CUB domain
290888	Sushi domain (SCR repeat)
290890	CUB domain
290890	Sushi domain (SCR repeat)
290891	Sushi domain (SCR repeat)
290892	Sushi domain (SCR repeat)
290894	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
290897	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
290898	tRNA synthetases class I (C). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only cysteinyl tRNA synthetases
290901	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290901	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290902	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
290902	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
290905	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
290907	ATP dependent DNA ligase domain. This domain belongs to a more diverse superfamily, including pfam01331 and pfam01653
290907	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
290908	Ribosomal protein S12
290909	Mitochondrial carrier protein
290916	Ribosomal protein L23
290917	HMG (high mobility group) box
290918	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
290921	Transforming growth factor beta like domain
290921	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
290922	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
290923	Asparaginase
290926	PBX domain. The PBX domain is a bipartite acidic domain
290928	Ribosomal protein L6
290931	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
290935	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
290936	Ribosomal protein S8e
290936	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
290937	e3 binding domain. This family represents a small domain of the E2 subunit of 2-oxo-acid dehydrogenases responsible for the binding of the E3 subunit
290937	2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain
290937	Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins
290944	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
290944	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
290947	Flavodoxin
290947	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
290947	FAD binding domain. This domain is found in sulfite reductase, NADPH cytochrome P450 reductase, Nitric oxide synthase and methionine synthase reductase
290948	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
290949	Ribosomal protein L21e
290950	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
290951	Protein kinase domain
290953	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
290954	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
290954	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
290956	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
290956	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
290956	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
290957	7 transmembrane receptor (rhodopsin family)
290960	Ribosomal protein L15
290962	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
290963	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
290964	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
290965	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
290966	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
290968	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
290969	Papain family cysteine protease
290969	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
290971	Papain family cysteine protease
290977	Papain family cysteine protease
290977	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
290980	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
290981	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
290982	SAM dependent carboxyl methyltransferase. This family of plant methyltransferases contains enzymes that act on a variety of substrates including salicylic acid, jasmonic acid and 7-Methylxanthine. Caffeine is synthesized through sequential three-step meth
290983	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
290983	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
290984	PAP/25A associated domain
290984	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
290985	HesB-like domain. This family includes HesB which may be involved in nitrogen fixation
290987	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
290989	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
290990	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
290991	Actin
290994	Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19-2
290996	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
291000	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
291003	Eukaryotic initiation factor 4E
291004	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
291006	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
291007	Cyclophilin type peptidyl-prolyl cis-trans isomerase
291009	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291014	Protein kinase domain
291016	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
291017	Ribosomal protein L6e
291023	Helix-loop-helix DNA-binding domain
291028	Ribosomal protein L21e
291029	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
291030	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
291031	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
291035	Ribosomal L29e protein family
291036	Ribosomal protein L36e
291038	Ribosomal protein L21e
291044	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
291047	GCM motif protein
291050	Ribosomal protein L6
291051	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon
291053	Ribosomal protein L13
291055	Microtubule associated protein (MAP65/ASE1 family)
291057	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
291059	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291059	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291062	Translocon-associated protein (TRAP), alpha subunit. The alpha-subunit of the TRAP complex (TRAP alpha) is a single-spanning membrane protein of the endoplasmic reticulum (ER) which is found in proximity of nascent polypeptide chains translocating across
291063	Triosephosphate isomerase
291064	Ribosomal protein L6
291064	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
291065	Enolase, C-terminal TIM barrel domain
291067	tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain
291069	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p
291073	Ribosomal protein L19e
291075	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
291076	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
291078	Protein kinase domain
291079	HMG (high mobility group) box
291082	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
291083	Ribosomal protein L13e
291084	Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones
291085	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
291086	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
291087	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
291089	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
291090	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
291091	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
291094	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
291095	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
291097	Fork head domain
291100	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved trypt
291103	Rieske [2Fe-2S] domain. The rieske domain has a [2Fe-2S] centre. Two conserved cysteines that one Fe ion while the other Fe ion is coordinated by two conserved histidines
291103	pfam02921, UCR_TM, Ubiquinol cytochrome reductase transmembrane region. Each subunit of the cytochrome bc1 complex provides a single helix (this family) to make up the transmembrane region of the complex
291104	Ribosomal protein L21e
291106	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
291113	Core histone H2A/H2B/H3/H4
291114	Somatotropin hormone family
291115	Somatotropin hormone family
291116	Somatotropin hormone family
291117	Somatotropin hormone family
291119	Somatotropin hormone family
291121	Somatotropin hormone family
291122	Ribosomal protein L19e
291122	Somatotropin hormone family
291123	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
291124	Somatotropin hormone family
291126	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
291126	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
291130	Doublecortin
291131	Doublecortin
291132	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
291133	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
291134	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
291135	Uncharacterized protein PaaI, COG2050
291138	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
291138	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
291141	Core histone H2A/H2B/H3/H4
291142	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
291143	Core histone H2A/H2B/H3/H4
291144	Core histone H2A/H2B/H3/H4
291145	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
291146	Zinc finger, C3HC4 type (RING finger)
291146	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
291147	Protein kinase domain
291147	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
291148	Core histone H2A/H2B/H3/H4
291148	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
291149	Core histone H2A/H2B/H3/H4
291152	Core histone H2A/H2B/H3/H4
291153	Core histone H2A/H2B/H3/H4
291153	Uracil DNA glycosylase superfamily
291154	Core histone H2A/H2B/H3/H4
291154	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
291155	Core histone H2A/H2B/H3/H4
291156	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
291157	Core histone H2A/H2B/H3/H4
291158	Core histone H2A/H2B/H3/H4
291158	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
291159	Core histone H2A/H2B/H3/H4
291160	Core histone H2A/H2B/H3/H4
291161	7 transmembrane receptor (rhodopsin family)
291162	7 transmembrane receptor (rhodopsin family)
291167	HMG (high mobility group) box
291169	Glutathione peroxidase
291171	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
291172	7 transmembrane receptor (rhodopsin family)
291173	7 transmembrane receptor (rhodopsin family)
291175	7 transmembrane receptor (rhodopsin family)
291179	7 transmembrane receptor (rhodopsin family)
291179	7 transmembrane receptor (rhodopsin family)
291180	Ependymin
291183	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
291188	Homeobox domain
291189	CAIB/BAIF family. This is a family of enzymes with diverse function, including fatty-acid CoA racemase enzymes such as arylpropionyl-CoA epimerase a key enzyme in the inversion metabolism of ibuprofen, carnitine dehydratase (CAIB) (EC 4.2.1.89) and bile a
291190	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291190	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291192	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291193	Macrophage migration inhibitory factor (MIF)
291198	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi
291198	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
291200	Cyclophilin type peptidyl-prolyl cis-trans isomerase
291201	CBF/Mak21 family
291204	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
291205	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
291208	C2 domain
291211	Polyprenyl synthetase
291217	Ribosomal protein S24e
291220	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
291221	ribosomal L5P family C-terminus. This region is found associated with pfam00281
291222	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
291222	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
291223	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
291224	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
291224	Gag polyprotein, inner coat protein p12. The retroviral p12 is a virion structural protein. p12 is proline rich. The function carried out by p12 in assembly and replication is unknown. p12 is associated with pathogenicity of the virus
291226	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
291227	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
291229	Ribosomal protein S11
291233	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
291236	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291238	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
291239	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes
291240	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
291244	Ribosomal protein S7e
291248	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
291249	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
291252	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
291256	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
291258	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
291260	ARD/ARD' family. The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+. ARD and ARD' can be e
291263	NUDIX domain
291266	NUDIX domain
291268	Ribosomal protein S5, N-terminal domain
291269	NUDIX domain
291270	NUDIX domain
291271	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
291272	NUDIX domain
291273	Elongation factor 1 gamma, conserved domain
291273	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
291275	NUDIX domain
291276	NUDIX domain
291277	Elongation factor 1 gamma, conserved domain
291280	MCM2/3/5 family
291281	Ribosomal protein L21e
291283	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
291284	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
291285	Eukaryotic porin
291287	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
291287	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
291291	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
291292	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
291293	UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Some members have large insertions near t
291296	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
291296	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
291297	Ribosomal protein S6e
291299	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
291301	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
291306	Actin
291307	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
291308	Ribosomal protein L21e
291314	AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synt
291314	AIR synthase related protein, N-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The N-terminal domain of AIR synthase forms the d
291315	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
291315	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
291317	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
291318	Myristoyl-CoA:protein N-myristoyltransferase, C-terminal domain. The N and C-terminal domains of NMT are structurally similar, each adopting an acyl-CoA N-acyltransferase-like fold
291324	C-5 cytosine-specific DNA methylase
291325	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
291327	Lectin C-type domain. This family includes both long and short form C-type
291328	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
291329	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291329	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291330	Ribosomal protein S2
291332	Low-density lipoprotein receptor domain class A
291332	MAM domain. An extracellular domain found in many receptors
291333	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
291334	MAM domain. An extracellular domain found in many receptors
291335	MAM domain. An extracellular domain found in many receptors
291336	MAM domain. An extracellular domain found in many receptors
291340	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
291344	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
291345	Protein kinase domain
291346	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
291347	Ribosomal protein L23
291349	Protein kinase domain
291350	Protein kinase domain
291350	Myosin head (motor domain)
291352	7 transmembrane receptor (metabotropic glutamate family)
291352	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
291353	Ribosomal protein S8e
291354	Ribosomal protein L23
291355	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
291358	Cyclophilin type peptidyl-prolyl cis-trans isomerase
291359	D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain
291359	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
291360	Somatotropin hormone family
291361	7 transmembrane receptor (rhodopsin family)
291367	Ribosomal protein L21e
291369	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
291372	Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal
291375	AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this
291376	PTB domain (IRS-1 type)
291379	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
291379	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
291382	Replication factor-A protein 1, N-terminal domain
291382	OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl -tRNA synthetases cata
291386	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
291388	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
291394	Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases
291403	Zinc-binding dehydrogenase
291407	Ribosomal protein L6
291413	Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase
291414	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
291414	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
291415	Mitosis protein DIM1
291415	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
291416	Ribosomal protein L10
291417	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291417	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291426	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
291427	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
291428	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
291432	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
291434	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
291435	Mitochondrial carrier protein
291436	Lipase
291437	Lipase
291437	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
291442	Cadherin domain
291442	TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin
291443	Ribosomal L10
291444	Protein of unknown function DUF84. The function of this prokaryotic protein family is unknown
291449	Ribosomal protein L35Ae
291450	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
291451	Eukaryotic porin
291452	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
291454	Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However
291455	metallopeptidase family M24
291456	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291456	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291457	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291458	HMG (high mobility group) box
291461	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
291461	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
291463	Cyclophilin type peptidyl-prolyl cis-trans isomerase
291465	PX domain. PX domains bind to phosphoinositides
291465	Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2
291466	Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in
291468	FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain
291468	FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain
291470	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
291472	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291474	Ribosomal protein L19e
291474	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291477	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291478	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291480	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291483	Ribosomal protein L19e
291483	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291484	Ribosomal protein L19e
291485	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291486	7 transmembrane receptor (rhodopsin family)
291487	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291488	Ribosomal protein L19e
291489	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291490	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291491	Ribosomal protein L19e
291494	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291495	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291496	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291499	Ribosomal protein L19e
291501	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291503	Glutaredoxin
291503	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
291504	Ribosomal protein L19e
291505	Ribosomal protein L19e
291506	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291507	7 transmembrane receptor (rhodopsin family)
291508	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291509	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291510	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291511	Ribosomal protein L19e
291514	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291516	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291519	Ribosomal protein L19e
291520	Ribosomal protein L19e
291521	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291522	7 transmembrane receptor (rhodopsin family)
291523	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291523	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291524	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291524	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291526	impB/mucB/samB family. These proteins are involved in UV protection
291527	START domain
291532	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291532	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291536	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291536	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291537	Ribosomal L29e protein family
291538	Thymosin beta-4 family
291541	CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain
291542	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291542	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291543	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291543	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291545	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291545	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291547	Eukaryotic ribosomal protein L18
291548	Ribosomal protein S28e
291551	S25 ribosomal protein
291553	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
291556	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
291559	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
291560	Ribosomal protein S7e
291561	Ribosomal protein L31e
291562	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
291564	PH domain. PH stands for pleckstrin homology
291568	Protein kinase domain
291568	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
291569	Homeobox domain
291581	Ribosomal protein L21e
291582	AMP-binding enzyme
291585	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
291586	DTW domain. This presumed domain is found in bacterial and eukaryotic proteins. Its function is unknown. The domain contains multiple conserved motifs including a DTXW motif that this domain has been named after
291588	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291589	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291590	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
291591	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
291593	Ribosomal protein S2
291594	Ribosomal protein S2
291597	B-box zinc finger
291597	Fibronectin type III domain
291598	Ribosomal protein L6
291599	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
291603	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291603	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291604	NUDIX domain
291605	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
291606	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
291607	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
291611	Ribosomal protein S5, C-terminal domain
291613	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
291615	Poly-adenylate binding protein, unique domain
291615	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
291617	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291617	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291618	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
291619	Ribosomal protein L36e
291621	Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site
291621	Transketolase, pyridine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases
291621	Transketolase, thiamine diphosphate binding domain. This family includes transketolase enzymes EC:2.2.1.1. and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit EC:1.2.4.4. Both these enzymes utilise thiamine pyrophosphate as a cofacto
291622	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
291623	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291623	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
291626	Ribosomal protein S5, C-terminal domain
291635	Cadherin domain
291636	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
291636	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
291637	Cadherin domain
291638	Cadherin domain
291639	Cadherin domain
291640	Mitochondrial carrier protein
291641	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
291642	Cadherin domain
291643	Cadherin domain
291644	Cadherin domain
291645	Cadherin domain
291646	Cadherin domain
291647	Cadherin domain
291648	Cadherin domain
291648	Ribosomal protein L15 amino terminal region. This family is always associated with pfam00256. This family is diagnostic of ribosomal L15 proteins
291649	Cadherin domain
291650	Fructose-bisphosphate aldolase class-I
291651	Cadherin domain
291652	Cadherin domain
291653	Cadherin domain
291654	Cadherin domain
291655	Cadherin domain
291656	Cadherin domain
291666	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
291668	Neuregulin family
291669	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
291670	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
291671	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
291678	wnt family
291679	Nucleoside diphosphate kinase
291680	Nucleoside diphosphate kinase
291681	wnt family
291682	wnt family
291685	SRP19 protein. The signal recognition particle (SRP) binds to the signal peptide of proteins as they are being translated. The binding of the SRP halts translation and the complex is then transported to the endoplasmic reticulum's cytoplasmic surface. The
291687	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
291688	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
291690	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
291691	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
291692	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
291696	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
291697	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
291698	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
291699	START domain
291701	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
291701	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
291704	Protein kinase domain
291706	Actin
291707	LIM domain. This family represents two copies of the LIM structural domain
291709	Mitochondrial carrier protein
291711	Actin
291712	Eukaryotic ribosomal protein L18
291713	ADP-ribosylation factor family
291713	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
291715	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
291715	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
291717	Chaperonin 10 Kd subunit
291718	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291719	Mitochondrial carrier protein
291721	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
291728	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291729	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
291729	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
291733	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
291734	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
291738	Ribosomal protein S2
291740	Ribosomal L10
291740	Ribosomal protein L16
291741	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
291742	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
291745	Cyclophilin type peptidyl-prolyl cis-trans isomerase
291749	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
291752	Cadherin domain
291753	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
291753	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
291754	Cadherin domain
291754	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
291755	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
291757	Ribosomal protein L3
291758	Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase
291759	Cadherin domain
291760	Cadherin domain
291761	Elongation factor 1 gamma, conserved domain
291761	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
291761	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
291762	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
291764	7 transmembrane receptor (metabotropic glutamate family)
291766	Cyclophilin type peptidyl-prolyl cis-trans isomerase
291767	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291768	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
291770	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
291771	Fibrillarin
291772	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
291774	Proteasome A-type and B-type
291776	Proteasome A-type and B-type
291782	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
291786	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291786	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291791	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
291794	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
291795	RNA polymerase Rpb8. Rpb8 is a subunit common to the three yeast RNA polymerases, pol I, II and III. Rpb8 interacts with the largest subunit Rpb1, and with Rpb3 and Rpb11, two smaller subunits
291796	Ubiquitin carboxyl-terminal hydrolase family 2
291797	Death domain
291798	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
291799	ADP-ribosylation factor family
291800	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
291800	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
291800	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
291801	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
291802	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
291805	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
291807	Ribosomal protein L19e
291808	Ribosomal protein L19e
291814	Carboxylesterase
291817	Carboxylesterase
291818	Domain of unknown function DUF59. This family includes prokaryotic proteins of unknown function. The family also includes PhaH from Pseudomonas putida. PhaH forms a complex with PhaF, PhaG and PhaI, which hydroxylates phenylacetic acid to 2-hydroxyphenyla
291819	Eukaryotic-type carbonic anhydrase
291822	Microtubule associated protein (MAP65/ASE1 family)
291823	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
291824	Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes
291826	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
291827	NAC domain
291828	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
291837	'Cold-shock' DNA-binding domain
291838	RNA polymerases N / 8 kDa subunit
291840	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
291846	Trypsin
291854	7 transmembrane receptor (Secretin family)
291854	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
291860	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
291862	Carboxylesterase
291863	Carboxylesterase
291864	Carboxylesterase
291871	Hsp90 protein
291871	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
291872	Hsp90 protein
291875	LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
291879	Ribosomal protein L13e
291881	Protein of unknown function, DUF259
291881	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
291884	Ribosomal protein S5, C-terminal domain
291884	Ribosomal protein S5, N-terminal domain
291885	MCM2/3/5 family
291886	Core histone H2A/H2B/H3/H4
291889	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
291890	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
291894	Myosin head (motor domain)
291902	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
291906	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
291909	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291911	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
291919	'Cold-shock' DNA-binding domain
291920	Ribosomal L29e protein family
291923	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
291923	SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily
291923	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
291926	Protein kinase domain
291929	7 transmembrane receptor (rhodopsin family)
291930	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
291931	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
291933	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
291933	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
291935	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
291937	Anaphase-promoting complex, subunit 10 (APC10)
291938	Possible metal-binding domain in RNase L inhibitor, RLI. Possible metal-binding domain in endoribonuclease RNase L inhibitor. Found at the N-terminal end of RNase L inhibitor proteins, adjacent to the 4Fe-4S binding domain, fer4, pfam00037. Also often fou
291939	ArgK protein. The ArgK protein acts as an ATPase enzyme and as a kinase, and phosphorylates periplasmic binding proteins involved in the LAO (lysine, arginine, ornithine)/AO transport systems
291941	Ribosomal L29e protein family
291943	Ribosomal protein L21e
291946	Domain of unknown function
291948	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
291949	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
291950	Core histone H2A/H2B/H3/H4
291951	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
291952	Carboxylesterase
291953	Carboxylesterase
291954	Carboxylesterase
291955	Carboxylesterase
291957	Ribosomal protein S8
291958	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
291965	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
291969	ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae
291972	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
291980	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
291981	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
291983	Proteasome A-type and B-type
291984	Renal dipeptidase
291985	Cadherin domain
291986	Cadherin domain
291986	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
291988	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
291989	Polypeptide deformylase
291991	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
291993	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
291994	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
291997	Ribosomal protein L11, RNA binding domain
291999	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
292002	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
292006	ENV polyprotein (coat polyprotein)
292008	Mitosis protein DIM1
292008	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
292013	Ribosomal protein S24e
292015	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
292016	FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in comple
292017	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
292018	Ribosomal L29e protein family
292019	CPSF A subunit region. This family includes a region that lies towards the C-terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pr
292022	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
292022	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
292023	DHHA1 domain. This domain is often found adjacent to the DHH domain pfam01368 and is called DHHA1 for DHH associated domain. This domain is diagnostic of DHH subfamily 1 members. This domains is also found in alanyl tRNA synthetase, suggesting that this d
292025	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
292026	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292026	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
292028	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
292030	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
292031	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
292033	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
292036	Skp1 family, dimerisation domain
292036	Skp1 family, tetramerisation domain
292037	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
292038	NUDIX domain
292039	Zinc-binding dehydrogenase
292041	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
292044	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
292045	'chromo' (CHRromatin Organization MOdifier) domain
292046	Ribosomal protein L21e
292048	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
292048	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
292048	REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six co
292049	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
292054	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
292057	Adenosine-deaminase (editase) domain
292057	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea
292058	Domain of unknown function (DUF392). A eukaryotic specific domain of undetermined function.`
292060	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved trypt
292072	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
292073	Sulfatase
292073	Protein of unknown function (DUF229). Members of this family are uncharacterised. They are 500-1200 amino acids in length and share a long region conservation that probably corresponds to several domains
292074	Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses
292077	Sulfotransferase protein
292079	Protein kinase domain
292079	Uncharacterized protein family (ORF7) DUF. Several members of this family are Borrelia burgdorferi plasmid proteins of uncharacterized function
292083	7 transmembrane receptor (rhodopsin family)
292084	Ribosomal protein L35Ae
292085	Nup133 nucleoporin. RNA undergoing nuclear export first encounters the basket of the nuclear pore. Nup133 is a nucleoporin accessible on the basket side of the pore
292089	PMP-22/EMP/MP20/Claudin family
292096	Ribosomal L29e protein family
292097	Arv1-like family. Arv1 is a transmembrane protein with potential zinc-binding motifs. ARV1 is a novel mediator of eukaryotic sterol homeostasis
292104	Ribosomal L29e protein family
292106	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
292107	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
292109	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
292115	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
292117	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
292119	Ribosomal protein L11, RNA binding domain
292123	S25 ribosomal protein
292124	Zona pellucida-like domain
292126	Trypsin
292126	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
292129	MYND finger
292129	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
292130	PH domain. PH stands for pleckstrin homology
292133	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
292134	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
292135	Ribosomal protein S19e
292137	Lipase
292138	Enolase, C-terminal TIM barrel domain
292141	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
292143	Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However
292147	Ribosomal protein L35Ae
292149	Tricarboxylate carrier
292150	Tricarboxylate carrier
292151	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
292152	Carboxylesterase
292153	Carboxylesterase
292154	Carboxylesterase
292156	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
292170	IMP4 family. This family includes IMP4, which is involved ribosomal RNA processing
292172	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
292173	Transcription factor TFIID (or TATA-binding protein, TBP)
292174	Ribosomal protein S5, C-terminal domain
292176	Peptidase C16 family
292176	Ribosomal protein L21e
292177	Ribosomal L29e protein family
292178	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
292180	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
292181	Sugar (and other) transporter
292182	pfam02892, zf-BED, BED zinc finger
292186	ENV polyprotein (coat polyprotein)
292192	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
292193	LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known
292195	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
292196	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
292201	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
292203	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292206	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
292211	Ribosomal protein L24e
292212	Ribosomal protein L24e
292213	Ribosomal protein L24e
292214	Protein kinase domain
292215	Protein kinase domain
292217	Protein kinase domain
292218	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
292219	Ribosomal protein L24e
292220	Protein kinase domain
292221	Protein kinase domain
292222	Protein kinase domain
292223	Protein kinase domain
292224	Protein kinase domain
292225	Protein kinase domain
292226	Protein kinase domain
292227	Protein kinase domain
292231	7 transmembrane receptor (rhodopsin family)
292231	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292232	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292233	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292234	Homeobox domain
292236	C2 domain
292238	Fibronectin type III domain
292241	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
292241	Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
292242	Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
292243	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
292244	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
292248	7 transmembrane receptor (metabotropic glutamate family)
292249	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292251	Sulfotransferase protein
292252	Phosphoglycerate kinase
292253	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
292255	Protein kinase domain
292256	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
292257	PH domain. PH stands for pleckstrin homology
292258	Ribosomal protein L13
292259	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
292261	Ribosomal protein L6
292262	Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons
292271	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
292279	Oxidoreductase FAD-binding domain
292279	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous
292280	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
292284	Protein-tyrosine phosphatase
292284	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
292287	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
292289	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
292289	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
292294	Intermediate filament protein
292295	Poly-adenylate binding protein, unique domain
292295	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
292296	Actin
292297	Core histone H2A/H2B/H3/H4
292301	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
292302	SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase
292304	Ribosomal protein L19e
292306	Ribonuclease T2 family
292308	Protein kinase domain
292310	7 transmembrane receptor (rhodopsin family)
292311	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
292313	Protein kinase domain
292314	Protein kinase domain
292316	Protein kinase domain
292316	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292317	Protein kinase domain
292319	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
292324	7 transmembrane receptor (rhodopsin family)
292325	Calpain family cysteine protease
292325	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
292327	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predict
292327	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
292330	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
292331	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
292332	Protein kinase domain
292334	Protein kinase domain
292334	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292335	Protein kinase domain
292335	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292336	Protein kinase domain
292336	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292338	Protein kinase domain
292340	Protein kinase domain
292340	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292342	Protein kinase domain
292342	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292343	Protein kinase domain
292346	Protein kinase domain
292346	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292347	Protein kinase domain
292348	Protein kinase domain
292348	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292350	Protein kinase domain
292351	Protein kinase domain
292352	Protein kinase domain
292353	Protein kinase domain
292354	Protein kinase domain
292354	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292355	Protein kinase domain
292357	Protein kinase domain
292358	Protein kinase domain
292358	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292360	Protein kinase domain
292360	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292362	Protein kinase domain
292363	Protein kinase domain
292364	Protein kinase domain
292365	Protein kinase domain
292366	Protein kinase domain
292367	Protein kinase domain
292368	Protein kinase domain
292368	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292371	Protein kinase domain
292371	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292372	Protein kinase domain
292376	Protein kinase domain
292376	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292377	Protein kinase domain
292379	Triosephosphate isomerase
292380	Leucine carboxyl methyltransferase. Family of leucine carboxyl methyltransferases EC:2.1.1.- . This family may need divides a the full alignment contains a significantly shorter mouse sequence
292383	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
292384	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292385	Ribosomal protein L23
292386	Protein kinase domain
292386	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292387	Protein kinase domain
292388	Protein kinase domain
292388	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292389	Protein kinase domain
292389	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292391	Protein kinase domain
292391	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292393	Protein kinase domain
292394	Protein kinase domain
292394	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292395	Protein kinase domain
292395	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292396	Protein kinase domain
292396	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292397	Hsp90 protein
292399	Protein kinase domain
292404	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
292405	Thrombospondin N-terminal -like domain
292407	Ribosomal L22e protein family
292409	7 transmembrane receptor (rhodopsin family)
292410	7 transmembrane receptor (rhodopsin family)
292411	7 transmembrane receptor (rhodopsin family)
292413	7 transmembrane receptor (rhodopsin family)
292414	7 transmembrane receptor (metabotropic glutamate family)
292414	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292415	7 transmembrane receptor (rhodopsin family)
292416	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292417	7 transmembrane receptor (metabotropic glutamate family)
292417	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292418	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292419	7 transmembrane receptor (rhodopsin family)
292420	7 transmembrane receptor (metabotropic glutamate family)
292420	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292421	7 transmembrane receptor (rhodopsin family)
292421	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292422	ENV polyprotein (coat polyprotein)
292422	7 transmembrane receptor (rhodopsin family)
292422	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292423	Protein kinase domain
292423	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
292425	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292426	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292428	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292430	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292431	7 transmembrane receptor (metabotropic glutamate family)
292432	7 transmembrane receptor (metabotropic glutamate family)
292432	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292433	7 transmembrane receptor (metabotropic glutamate family)
292433	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292436	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292437	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292438	7 transmembrane receptor (metabotropic glutamate family)
292438	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292440	7 transmembrane receptor (metabotropic glutamate family)
292440	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292441	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292442	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292443	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292444	7 transmembrane receptor (metabotropic glutamate family)
292445	7 transmembrane receptor (metabotropic glutamate family)
292445	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292447	7 transmembrane receptor (metabotropic glutamate family)
292447	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292452	7 transmembrane receptor (metabotropic glutamate family)
292464	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
292466	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
292467	Ets-domain
292471	ATP synthase alpha/beta chain, C terminal domain
292471	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
292473	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
292474	Ribosomal protein L15
292476	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
292478	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
292479	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
292482	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
292482	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
292483	Syntaxin
292485	Alpha-L-fucosidase
292488	7 transmembrane receptor (metabotropic glutamate family)
292488	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292489	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292490	7 transmembrane receptor (metabotropic glutamate family)
292493	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
292494	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292495	7 transmembrane receptor (metabotropic glutamate family)
292497	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292498	7 transmembrane receptor (metabotropic glutamate family)
292499	7 transmembrane receptor (metabotropic glutamate family)
292500	7 transmembrane receptor (metabotropic glutamate family)
292500	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
292503	7 transmembrane receptor (metabotropic glutamate family)
292504	7 transmembrane receptor (metabotropic glutamate family)
292505	7 transmembrane receptor (metabotropic glutamate family)
292505	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292506	7 transmembrane receptor (metabotropic glutamate family)
292506	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292508	7 transmembrane receptor (metabotropic glutamate family)
292508	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292509	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
292510	7 transmembrane receptor (metabotropic glutamate family)
292510	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292512	7 transmembrane receptor (metabotropic glutamate family)
292513	7 transmembrane receptor (metabotropic glutamate family)
292518	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
292519	ENV polyprotein (coat polyprotein)
292519	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
292521	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292523	Ribosomal protein S8
292524	Dihydroorotase-like. This family are predicted to adopt a TIM barrel fold
292524	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292526	7 transmembrane receptor (rhodopsin family)
292526	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292528	7 transmembrane receptor (rhodopsin family)
292528	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292529	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292530	7 transmembrane receptor (metabotropic glutamate family)
292530	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292531	7 transmembrane receptor (metabotropic glutamate family)
292536	Putative snoRNA binding domain. This family consists of various Pre RNA processing ribonucleoproteins. The function of the aligned region is unknown however it may be a common RNA or snoRNA or Nop1p binding domain. Nop5p (Nop58p) from yeast is the protein
292539	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
292540	7 transmembrane receptor (rhodopsin family)
292542	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
292543	Amino acid permease
292544	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
292545	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
292546	Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone
292547	7 transmembrane receptor (metabotropic glutamate family)
292547	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292548	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292551	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292552	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292553	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
292554	Protein kinase domain
292555	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
292558	7 transmembrane receptor (rhodopsin family)
292559	7 transmembrane receptor (rhodopsin family)
292560	7 transmembrane receptor (rhodopsin family)
292561	7 transmembrane receptor (rhodopsin family)
292562	7 transmembrane receptor (rhodopsin family)
292563	7 transmembrane receptor (rhodopsin family)
292564	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292566	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
292566	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
292567	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
292571	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
292572	Protein kinase domain
292573	Homeobox domain
292574	Homeobox domain
292575	Homeobox domain
292576	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292578	Sulfotransferase protein
292578	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
292579	Sulfotransferase protein
292579	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
292580	7 transmembrane receptor (rhodopsin family)
292580	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292581	ICE-like protease (caspase) p10 domain
292581	ICE-like protease (caspase) p20 domain
292582	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
292583	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
292585	Protein kinase domain
292587	Isochorismatase family. This family are hydrolase enzymes
292588	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
292590	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
292593	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
292600	7 transmembrane receptor (rhodopsin family)
292600	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292601	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
292602	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
292602	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
292603	Protein kinase domain
292604	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292607	7 transmembrane receptor (rhodopsin family)
292607	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292608	7 transmembrane receptor (rhodopsin family)
292608	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292609	7 transmembrane receptor (metabotropic glutamate family)
292610	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292612	7 transmembrane receptor (metabotropic glutamate family)
292612	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292615	7 transmembrane receptor (metabotropic glutamate family)
292615	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292617	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
292620	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292626	Ribosomal protein L23
292628	Homeobox domain
292629	Homeobox domain
292630	Homeobox domain
292631	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
292632	Sulfotransferase protein
292633	Sulfotransferase protein
292634	Sulfotransferase protein
292635	Sulfotransferase protein
292637	Sulfotransferase protein
292639	CUB domain
292641	ENV polyprotein (coat polyprotein)
292641	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
292641	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
292642	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292643	Histidine acid phosphatase
292644	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
292645	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop
292646	Fibronectin type II domain
292647	Homeobox domain
292648	Sulfotransferase protein
292651	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
292655	Anion-transporting ATPase. This Pfam family represents a conserved domain, which is sometimes repeated, in an anion-transporting ATPase. The ATPase is involved in the removal of arsenate, antimonite, and arsenate from the cell
292656	Macrophage migration inhibitory factor (MIF)
292657	Sodium:dicarboxylate symporter family
292657	Sodium:dicarboxylate symporter family
292658	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
292659	Protein kinase domain
292661	7 transmembrane receptor (rhodopsin family)
292662	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
292664	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
292665	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
292667	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
292670	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
292671	Eukaryotic porin
292671	Class I Histocompatibility antigen, domains alpha 1 and 2
292672	D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain
292672	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
292673	Helix-hairpin-helix motif
292673	DNA repair protein rad10
292677	FKBP-type peptidyl-prolyl cis-trans isomerase
292681	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
292686	F-box domain
292687	Glutaminyl cyclase. Glutaminyl cyclase catalyses the formation of the pyroglutamyl residue present at the amino terminus of numerous secretory peptides and proteins. Glutaminyl cyclase posses a zinc aminopeptidase domain in which the four functionally imp
292688	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
292693	Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits
292701	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
292702	Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15
292703	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292704	HMG (high mobility group) box
292705	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292710	Metallo-beta-lactamase superfamily
292719	DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai
292720	Ets-domain
292721	Ets-domain
292726	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
292727	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
292728	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
292729	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
292740	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292741	'Cold-shock' DNA-binding domain
292742	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292743	Ribosomal protein L23
292744	Low molecular weight phosphotyrosine protein phosphatase
292745	von Willebrand factor type D domain
292746	von Willebrand factor type D domain
292746	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
292747	Fibrillarin
292749	Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates
292755	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated
292756	Protein kinase domain
292756	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
292757	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated
292758	Ribosomal protein S12
292760	Vacuolar sorting protein 9 (VPS9) domain
292761	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
292763	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
292768	Hrf1 family. This family includes a number of eukaryotic proteins. It is an integral membrane protein, conserved in at least 1 copy in all sequenced eukaryotes. The gene name in S. pombe is hrf1+ for Heavy metal Resistance Factor 1
292770	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
292773	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292774	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292775	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292777	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
292778	RNA polymerases M/15 Kd subunit
292782	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
292783	Actin
292784	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
292784	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
292785	Proline dehydrogenase
292786	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
292798	Glycosyltransferase family 6
292799	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292802	Hsp90 protein
292803	Programmed cell death protein 2, C-terminal domain
292804	Phosphoglucose isomerase. Phosphoglucose isomerase catalyses the interconversion of glucose-6-phosphate and fructose-6-phosphate
292807	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
292808	metallopeptidase family M24
292809	WD domain, G-beta repeat
292812	Ribosomal protein S5, C-terminal domain
292812	Ribosomal protein S5, N-terminal domain
292814	Protein of unknown function DUF122. This protein family has no known function
292815	ENV polyprotein (coat polyprotein)
292818	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
292819	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
292821	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292822	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292825	Ribosomal protein L13
292826	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292827	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292828	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
292828	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
292830	Domain of unknown function UPF0086. This small conserved archaeal protein has no known function
292831	Domain of unknown function UPF0086. This small conserved archaeal protein has no known function
292833	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
292833	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
292836	eRF1 domain 1. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
292836	eRF1 domain 2. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
292836	eRF1 domain 3. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
292837	7 transmembrane receptor (metabotropic glutamate family)
292837	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
292839	7 transmembrane receptor (metabotropic glutamate family)
292840	ENV polyprotein (coat polyprotein)
292841	Core histone H2A/H2B/H3/H4
292841	Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73
292841	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292842	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292845	Electron transfer flavoprotein beta subunit. This protein is distantly related to and forms a heterodimer with pfam00766
292847	Uncharacterized protein family UPF0021
292848	Trypsin
292849	Trypsin
292851	Trypsin
292852	Trypsin
292853	Trypsin
292854	Translation initiation factor SUI1
292855	Trypsin
292856	Trypsin
292857	Trypsin
292858	Trypsin
292859	Trypsin
292861	Trypsin
292862	Trypsin
292863	Trypsin
292864	Trypsin
292865	Trypsin
292866	Trypsin
292867	Trypsin
292868	Trypsin
292870	Trypsin
292870	ENV polyprotein (coat polyprotein)
292870	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
292870	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
292871	Trypsin
292872	Trypsin
292873	Trypsin
292876	Josephin
292879	Fibronectin type III domain
292879	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
292881	Ets-domain
292882	Ribosomal protein L19e
292884	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
292895	Ribosomal protein L13
292896	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
292902	Sodium:neurotransmitter symporter family
292903	Sodium:neurotransmitter symporter family
292906	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
292909	Tub family
292910	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
292912	DIL domain. The DIL domain has no known function
292915	Sulfotransferase protein
292916	Synaptogyrin. This family of proteins is distantly related to pfam01284
292917	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
292918	von Willebrand factor type D domain
292918	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
292919	Helix-loop-helix DNA-binding domain
292919	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates s
292922	Serum amyloid A protein
292924	Serum amyloid A protein
292925	Uncharacterized ACR, COG1579
292926	7 transmembrane receptor (rhodopsin family)
292927	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
292928	7 transmembrane receptor (rhodopsin family)
292929	7 transmembrane receptor (rhodopsin family)
292932	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
292934	Homeobox domain
292936	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
292936	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
292948	Protein kinase domain
292949	PMP-22/EMP/MP20/Claudin family
292955	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
292956	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
292958	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
292962	Enolase, N-terminal domain
292962	Enolase, C-terminal TIM barrel domain
292964	Ribosomal protein L44
292965	Ribosomal protein L15
292968	TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucle
292969	TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucle
292977	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
292979	Core histone H2A/H2B/H3/H4
292981	Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates
292981	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
292983	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
292992	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
292993	WD domain, G-beta repeat
292993	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
292994	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
292998	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
292999	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
293001	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
293002	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
293003	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
293004	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
293006	Ribosomal L15
293009	Cyclophilin type peptidyl-prolyl cis-trans isomerase
293010	Ribosomal protein S6e
293011	Ribosomal protein S5, N-terminal domain
293011	Ribosomal protein S5, C-terminal domain
293012	Helix-loop-helix DNA-binding domain
293014	Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15
293015	Protein kinase domain
293022	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
293022	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
293023	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
293023	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
293025	Ribosomal protein L13e
293026	Ribosomal protein S27
293026	Enolase, N-terminal domain
293026	Enolase, C-terminal TIM barrel domain
293027	Core histone H2A/H2B/H3/H4
293028	Eukaryotic DNA topoisomerase I, catalytic core. Topoisomerase I promotes the relaxation of DNA superhelical tension by introducing a transient single-stranded break in duplex DNA and are vital for the processes of replication, transcription, and recombina
293028	pfam02919, Topoisomer_I_N, Eukaryotic DNA topoisomerase I, DNA binding fragment. Topoisomerase I promotes the relaxation of DNA superhelical tension by introducing a transient single-stranded break in duplex DNA and are vital for the processes of replicat
293029	Macrophage migration inhibitory factor (MIF)
293031	Ribosomal protein S7e
293032	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
293033	Ribosomal protein L21e
293035	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
293037	7 transmembrane receptor (rhodopsin family)
293039	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
293041	Protein kinase domain
293043	Isocitrate/isopropylmalate dehydrogenase
293044	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
293046	Helix-loop-helix DNA-binding domain
293049	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
293050	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
293051	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
293052	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
293053	Ribosomal protein S11
293059	Uncharacterized ACR, YggU family COG1872
293062	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
293063	Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold
293064	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
293066	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
293067	A20-like zinc finger. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappa B activation
293067	AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets in
293068	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
293069	7 transmembrane receptor (metabotropic glutamate family)
293069	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
293070	7 transmembrane receptor (rhodopsin family)
293072	7 transmembrane receptor (rhodopsin family)
293073	7 transmembrane receptor (metabotropic glutamate family)
293073	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
293075	7 transmembrane receptor (rhodopsin family)
293076	7 transmembrane receptor (rhodopsin family)
293077	7 transmembrane receptor (rhodopsin family)
293078	7 transmembrane receptor (rhodopsin family)
293080	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
293081	7 transmembrane receptor (rhodopsin family)
293082	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
293083	7 transmembrane receptor (rhodopsin family)
293084	7 transmembrane receptor (rhodopsin family)
293085	7 transmembrane receptor (rhodopsin family)
293086	7 transmembrane receptor (rhodopsin family)
293087	7 transmembrane receptor (rhodopsin family)
293088	7 transmembrane receptor (rhodopsin family)
293089	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
293090	7 transmembrane receptor (rhodopsin family)
293091	7 transmembrane receptor (rhodopsin family)
293092	7 transmembrane receptor (rhodopsin family)
293093	7 transmembrane receptor (metabotropic glutamate family)
293094	7 transmembrane receptor (rhodopsin family)
293094	7 transmembrane receptor (metabotropic glutamate family)
293095	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
293096	7 transmembrane receptor (metabotropic glutamate family)
293097	Nucleoside diphosphate kinase
293100	Trypsin
293101	Malic enzyme, N-terminal domain
293102	Malic enzyme, NAD binding domain
293106	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
293115	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
293119	Ribosomal protein L3
293120	ENV polyprotein (coat polyprotein)
293121	14-3-3 protein
293125	Ribosomal protein L23
293128	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
293128	OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl -tRNA synthetases cata
293129	ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum m
293131	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
293132	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
293133	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
293133	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
293133	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be
293134	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
293136	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
293140	7 transmembrane receptor (rhodopsin family)
293141	7 transmembrane receptor (rhodopsin family)
293145	Phosphoglucomutase/phosphomannomutase, C-terminal domain
293149	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
293150	START domain
293152	NAD:arginine ADP-ribosyltransferase
293153	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
293153	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
293154	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anch
293162	PH domain. PH stands for pleckstrin homology
293165	HMG (high mobility group) box
293168	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
293169	Ribosomal protein S11
293171	metallopeptidase family M24
293173	Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included
293174	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
293176	Enolase, N-terminal domain
293177	Enolase, N-terminal domain
293177	Enolase, C-terminal TIM barrel domain
293179	TEA/ATTS domain family
293182	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
293186	Extracellular link domain
293187	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
293188	7 transmembrane receptor (rhodopsin family)
293189	7 transmembrane receptor (rhodopsin family)
293190	Bcl-2 homology region 4
293190	Apoptosis regulator proteins, Bcl-2 family
293191	7 transmembrane receptor (rhodopsin family)
293192	7 transmembrane receptor (rhodopsin family)
293193	7 transmembrane receptor (rhodopsin family)
293194	7 transmembrane receptor (rhodopsin family)
293195	7 transmembrane receptor (rhodopsin family)
293196	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
293196	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
293197	7 transmembrane receptor (rhodopsin family)
293198	7 transmembrane receptor (rhodopsin family)
293199	7 transmembrane receptor (rhodopsin family)
293201	7 transmembrane receptor (rhodopsin family)
293202	7 transmembrane receptor (rhodopsin family)
293203	7 transmembrane receptor (rhodopsin family)
293204	7 transmembrane receptor (rhodopsin family)
293205	7 transmembrane receptor (rhodopsin family)
293206	7 transmembrane receptor (rhodopsin family)
293207	7 transmembrane receptor (rhodopsin family)
293209	7 transmembrane receptor (rhodopsin family)
293210	7 transmembrane receptor (rhodopsin family)
293211	HMG (high mobility group) box
293211	7 transmembrane receptor (rhodopsin family)
293212	7 transmembrane receptor (rhodopsin family)
293214	7 transmembrane receptor (rhodopsin family)
293215	7 transmembrane receptor (rhodopsin family)
293216	7 transmembrane receptor (rhodopsin family)
293218	7 transmembrane receptor (rhodopsin family)
293220	7 transmembrane receptor (rhodopsin family)
293221	7 transmembrane receptor (rhodopsin family)
293222	7 transmembrane receptor (rhodopsin family)
293223	7 transmembrane receptor (rhodopsin family)
293224	7 transmembrane receptor (rhodopsin family)
293225	7 transmembrane receptor (rhodopsin family)
293226	7 transmembrane receptor (rhodopsin family)
293227	7 transmembrane receptor (rhodopsin family)
293228	7 transmembrane receptor (rhodopsin family)
293229	7 transmembrane receptor (rhodopsin family)
293230	7 transmembrane receptor (rhodopsin family)
293231	7 transmembrane receptor (rhodopsin family)
293232	7 transmembrane receptor (rhodopsin family)
293233	7 transmembrane receptor (rhodopsin family)
293234	7 transmembrane receptor (rhodopsin family)
293235	7 transmembrane receptor (rhodopsin family)
293236	7 transmembrane receptor (rhodopsin family)
293237	7 transmembrane receptor (rhodopsin family)
293238	7 transmembrane receptor (rhodopsin family)
293239	7 transmembrane receptor (rhodopsin family)
293240	7 transmembrane receptor (rhodopsin family)
293242	7 transmembrane receptor (rhodopsin family)
293243	7 transmembrane receptor (rhodopsin family)
293244	7 transmembrane receptor (rhodopsin family)
293245	7 transmembrane receptor (rhodopsin family)
293246	7 transmembrane receptor (rhodopsin family)
293247	7 transmembrane receptor (rhodopsin family)
293248	7 transmembrane receptor (rhodopsin family)
293249	7 transmembrane receptor (rhodopsin family)
293250	7 transmembrane receptor (rhodopsin family)
293251	7 transmembrane receptor (rhodopsin family)
293252	7 transmembrane receptor (rhodopsin family)
293253	7 transmembrane receptor (rhodopsin family)
293254	7 transmembrane receptor (rhodopsin family)
293255	7 transmembrane receptor (rhodopsin family)
293256	7 transmembrane receptor (rhodopsin family)
293257	7 transmembrane receptor (rhodopsin family)
293258	7 transmembrane receptor (rhodopsin family)
293259	7 transmembrane receptor (rhodopsin family)
293260	7 transmembrane receptor (rhodopsin family)
293261	7 transmembrane receptor (rhodopsin family)
293262	7 transmembrane receptor (rhodopsin family)
293263	7 transmembrane receptor (rhodopsin family)
293264	Globin
293265	Globin
293266	Globin
293267	Globin
293268	7 transmembrane receptor (rhodopsin family)
293269	7 transmembrane receptor (rhodopsin family)
293270	7 transmembrane receptor (rhodopsin family)
293271	7 transmembrane receptor (rhodopsin family)
293272	7 transmembrane receptor (rhodopsin family)
293273	7 transmembrane receptor (rhodopsin family)
293274	7 transmembrane receptor (rhodopsin family)
293275	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
293276	7 transmembrane receptor (rhodopsin family)
293278	7 transmembrane receptor (rhodopsin family)
293279	7 transmembrane receptor (rhodopsin family)
293280	7 transmembrane receptor (rhodopsin family)
293281	7 transmembrane receptor (rhodopsin family)
293282	7 transmembrane receptor (rhodopsin family)
293283	7 transmembrane receptor (rhodopsin family)
293285	7 transmembrane receptor (rhodopsin family)
293286	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
293286	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
293287	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
293288	Olfactomedin-like domain
293289	7 transmembrane receptor (rhodopsin family)
293290	7 transmembrane receptor (rhodopsin family)
293291	Ribosomal protein L36e
293292	7 transmembrane receptor (rhodopsin family)
293294	B-box zinc finger
293294	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
293295	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
293296	B-box zinc finger
293296	Zinc finger, C3HC4 type (RING finger)
293298	B-box zinc finger
293298	Zinc finger, C3HC4 type (RING finger)
293298	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
293299	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
293300	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
293301	7 transmembrane receptor (rhodopsin family)
293302	7 transmembrane receptor (rhodopsin family)
293303	7 transmembrane receptor (rhodopsin family)
293304	7 transmembrane receptor (rhodopsin family)
293305	7 transmembrane receptor (rhodopsin family)
293306	7 transmembrane receptor (rhodopsin family)
293307	7 transmembrane receptor (rhodopsin family)
293308	7 transmembrane receptor (rhodopsin family)
293309	7 transmembrane receptor (rhodopsin family)
293310	Ubiquitin carboxyl-terminal hydrolase family 2
293310	Ubiquitin carboxyl-terminal hydrolases family 2
293311	7 transmembrane receptor (rhodopsin family)
293312	7 transmembrane receptor (rhodopsin family)
293314	7 transmembrane receptor (rhodopsin family)
293315	7 transmembrane receptor (rhodopsin family)
293316	7 transmembrane receptor (rhodopsin family)
293317	Ubiquitin carboxyl-terminal hydrolase family 2
293317	Ubiquitin carboxyl-terminal hydrolases family 2
293318	7 transmembrane receptor (rhodopsin family)
293319	7 transmembrane receptor (rhodopsin family)
293320	7 transmembrane receptor (rhodopsin family)
293321	7 transmembrane receptor (rhodopsin family)
293322	7 transmembrane receptor (rhodopsin family)
293323	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
293324	7 transmembrane receptor (rhodopsin family)
293325	7 transmembrane receptor (rhodopsin family)
293326	7 transmembrane receptor (rhodopsin family)
293327	7 transmembrane receptor (rhodopsin family)
293328	7 transmembrane receptor (rhodopsin family)
293329	7 transmembrane receptor (rhodopsin family)
293330	7 transmembrane receptor (rhodopsin family)
293331	7 transmembrane receptor (rhodopsin family)
293332	Ribosomal L10
293333	7 transmembrane receptor (rhodopsin family)
293334	7 transmembrane receptor (rhodopsin family)
293335	7 transmembrane receptor (rhodopsin family)
293336	7 transmembrane receptor (rhodopsin family)
293337	7 transmembrane receptor (rhodopsin family)
293338	7 transmembrane receptor (rhodopsin family)
293339	7 transmembrane receptor (rhodopsin family)
293340	7 transmembrane receptor (rhodopsin family)
293341	7 transmembrane receptor (rhodopsin family)
293342	7 transmembrane receptor (rhodopsin family)
293345	Transcription initiation factor TFIID 23-30kDa subunit
293349	Ribosomal protein L24e
293350	7 transmembrane receptor (rhodopsin family)
293351	7 transmembrane receptor (rhodopsin family)
293352	7 transmembrane receptor (rhodopsin family)
293353	7 transmembrane receptor (rhodopsin family)
293354	7 transmembrane receptor (rhodopsin family)
293355	7 transmembrane receptor (rhodopsin family)
293356	7 transmembrane receptor (rhodopsin family)
293357	7 transmembrane receptor (rhodopsin family)
293358	7 transmembrane receptor (rhodopsin family)
293359	7 transmembrane receptor (rhodopsin family)
293361	7 transmembrane receptor (rhodopsin family)
293362	7 transmembrane receptor (rhodopsin family)
293363	7 transmembrane receptor (rhodopsin family)
293364	7 transmembrane receptor (rhodopsin family)
293365	7 transmembrane receptor (rhodopsin family)
293366	7 transmembrane receptor (rhodopsin family)
293367	7 transmembrane receptor (rhodopsin family)
293368	7 transmembrane receptor (rhodopsin family)
293369	7 transmembrane receptor (rhodopsin family)
293370	7 transmembrane receptor (rhodopsin family)
293371	7 transmembrane receptor (rhodopsin family)
293372	Endomembrane protein 70
293373	7 transmembrane receptor (rhodopsin family)
293374	7 transmembrane receptor (rhodopsin family)
293375	7 transmembrane receptor (rhodopsin family)
293376	7 transmembrane receptor (rhodopsin family)
293377	7 transmembrane receptor (rhodopsin family)
293378	PX domain. PX domains bind to phosphoinositides
293379	7 transmembrane receptor (rhodopsin family)
293380	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
293381	7 transmembrane receptor (rhodopsin family)
293382	7 transmembrane receptor (rhodopsin family)
293383	7 transmembrane receptor (rhodopsin family)
293384	7 transmembrane receptor (rhodopsin family)
293385	7 transmembrane receptor (rhodopsin family)
293386	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
293386	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
293387	7 transmembrane receptor (rhodopsin family)
293388	7 transmembrane receptor (rhodopsin family)
293389	7 transmembrane receptor (rhodopsin family)
293390	7 transmembrane receptor (rhodopsin family)
293391	7 transmembrane receptor (rhodopsin family)
293392	7 transmembrane receptor (rhodopsin family)
293394	7 transmembrane receptor (rhodopsin family)
293395	7 transmembrane receptor (rhodopsin family)
293396	7 transmembrane receptor (rhodopsin family)
293398	7 transmembrane receptor (rhodopsin family)
293399	7 transmembrane receptor (rhodopsin family)
293400	7 transmembrane receptor (rhodopsin family)
293401	7 transmembrane receptor (rhodopsin family)
293402	7 transmembrane receptor (rhodopsin family)
293403	7 transmembrane receptor (rhodopsin family)
293404	7 transmembrane receptor (rhodopsin family)
293408	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
293409	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
293409	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
293410	PH domain. PH stands for pleckstrin homology
293411	Importin-beta N-terminal domain
293412	Ribosomal protein L36e
293417	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
293417	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
293418	Ribosomal protein L15
293420	HMG (high mobility group) box
293421	7 transmembrane receptor (rhodopsin family)
293422	7 transmembrane receptor (rhodopsin family)
293423	7 transmembrane receptor (rhodopsin family)
293424	7 transmembrane receptor (rhodopsin family)
293425	7 transmembrane receptor (rhodopsin family)
293426	NB-ARC domain
293426	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
293427	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
293428	7 transmembrane receptor (rhodopsin family)
293429	7 transmembrane receptor (rhodopsin family)
293430	7 transmembrane receptor (rhodopsin family)
293431	7 transmembrane receptor (rhodopsin family)
293432	7 transmembrane receptor (rhodopsin family)
293435	7 transmembrane receptor (rhodopsin family)
293436	7 transmembrane receptor (rhodopsin family)
293437	7 transmembrane receptor (rhodopsin family)
293438	7 transmembrane receptor (rhodopsin family)
293439	7 transmembrane receptor (rhodopsin family)
293440	7 transmembrane receptor (rhodopsin family)
293441	7 transmembrane receptor (rhodopsin family)
293442	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
293443	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
293445	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
293446	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
293448	Insulinase (Peptidase family M16)
293451	Sulfotransferase protein
293451	Sulfotransferase protein
293453	Phosphopantetheine attachment site. A 4'-phosphopantetheine prosthetic group is attached through a serine. This prosthetic group acts as a a 'swinging arm' for the attachment of activated fatty acid and amino-acid groups. This domain forms a four helix bu
293455	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
293458	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
293458	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
293461	Leucine carboxyl methyltransferase. Family of leucine carboxyl methyltransferases EC:2.1.1.- . This family may need divides a the full alignment contains a significantly shorter mouse sequence
293462	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
293462	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
293463	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
293463	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
293464	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
293464	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
293465	Sulfotransferase protein
293466	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
293467	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
293468	Sulfotransferase protein
293470	Ribosomal protein S8
293471	Hsp90 protein
293472	Ribosomal protein L13e
293480	Ribosomal protein L6
293481	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
293484	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
293485	CLN3 protein. This is a family of proteins from the CLN3 gene. A missense mutation of glutamic acid (E) to lysine (K) at position 295 in the human protein has been implicated in Juvenile neuronal ceroid lipofuscinosis (Batten disease)
293490	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be
293491	Yippee putative zinc-binding protein
293499	CUB domain
293504	Quinolinate phosphoribosyl transferase, C-terminal domain. Quinolinate phosphoribosyl transferase (QPRTase) or nicotinate-nucleotide pyrophosphorylase EC:2.4.2.19 is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses
293505	GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein
293507	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
293508	Protein kinase domain
293509	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
293510	Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates
293510	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
293511	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
293515	Ciliary neurotrophic factor
293516	Trypsin
293519	von Willebrand factor type A domain
293519	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
293520	von Willebrand factor type A domain
293523	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
293525	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
293533	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
293534	CUB domain
293534	Zona pellucida-like domain
293534	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
293537	Homeobox domain
293538	Homeobox domain
293544	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
293545	7 transmembrane receptor (rhodopsin family)
293546	7 transmembrane receptor (metabotropic glutamate family)
293553	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
293553	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
293560	HMG (high mobility group) box
293565	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
293565	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
293566	Zinc carboxypeptidase
293567	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
293568	Homeobox domain
293572	HMG (high mobility group) box
293573	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
293573	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
293575	Cyclophilin type peptidyl-prolyl cis-trans isomerase
293577	Ribosomal protein L31e
293578	Fimbrial Usher protein. This protein is involved in biogenesis of gram negative bacterial pili
293578	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
293582	Ribosomal protein S5, C-terminal domain
293582	Ribosomal protein S5, N-terminal domain
293584	Ribosomal protein L36e
293588	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
293590	7 transmembrane receptor (rhodopsin family)
293592	7 transmembrane receptor (rhodopsin family)
293593	7 transmembrane receptor (rhodopsin family)
293594	7 transmembrane receptor (rhodopsin family)
293595	7 transmembrane receptor (rhodopsin family)
293596	7 transmembrane receptor (rhodopsin family)
293597	7 transmembrane receptor (rhodopsin family)
293598	7 transmembrane receptor (rhodopsin family)
293599	7 transmembrane receptor (rhodopsin family)
293600	7 transmembrane receptor (rhodopsin family)
293601	7 transmembrane receptor (rhodopsin family)
293602	7 transmembrane receptor (rhodopsin family)
293603	7 transmembrane receptor (rhodopsin family)
293604	7 transmembrane receptor (rhodopsin family)
293605	7 transmembrane receptor (rhodopsin family)
293606	7 transmembrane receptor (rhodopsin family)
293607	7 transmembrane receptor (rhodopsin family)
293608	7 transmembrane receptor (rhodopsin family)
293609	7 transmembrane receptor (rhodopsin family)
293612	Ribosomal protein L21e
293613	Transthyretin precursor (formerly prealbumin). Transthyretin is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. Mutations in the human transthyretin are associated with several genetic disorders
293615	Sir2 family
293616	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
293620	Phosphatidyl serine synthase. Phosphatidyl serine synthase is also known as serine exchange enzyme. This family represents eukaryotic PSS I and II which are membrane bound proteins which catalyses the replacement of the head group of a phospholipid (phosp
293621	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
293621	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
293625	Death domain
293626	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
293627	Tetraspanin family
293628	Glycosyl hydrolases family 18
293631	Protein kinase domain
293639	7 transmembrane receptor (rhodopsin family)
293640	NAD synthase. NAD synthase (EC:6.3.5.1) is involved in the de novo synthesis of NAD and is induced by stress factors such as heat shock and glucose limitation
293641	Actin
293642	Ribosomal protein L21e
293643	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
293646	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
293648	7 transmembrane receptor (rhodopsin family)
293650	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
293652	4Fe-4S binding domain. Superfamily includes proteins containing domains which bind to iron-sulfur clusters. Members include bacterial ferredoxins, various dehydrogenases, and various reductases. Structure of the domain is an alpha-antiparallel beta sandwi
293654	C2 domain
293655	Respiratory-chain NADH dehydrogenase 51 Kd subunit
293656	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
293656	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
293657	DNA polymerase delta, subunit 4
293660	ADP-ribosylation factor family
293660	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
293661	PH domain. PH stands for pleckstrin homology
293662	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
293663	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
293664	Ribosomal L29e protein family
293665	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
293666	Ribosomal protein L11, RNA binding domain
293666	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
293670	ADP-ribosylation factor family
293670	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
293670	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
293672	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
293673	Eukaryotic initiation factor 1A
293676	Papain family cysteine protease
293676	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
293678	ERCC4 domain. This domain is predicted to be a nuclease domain
293686	Sugar (and other) transporter
293688	Ergosterol biosynthesis ERG4/ERG24 family
293690	SAC3/GANP family. This family of eukaryotic proteins brings together the yeast nuclear export factor Sac3, and mammalian GANP/MCM3-associated proteins, which facilitate the nuclear localization of MCM3, a protein that associates with chromatin in the G1 p
293691	MIT domain
293691	PX domain. PX domains bind to phosphoinositides
293693	Protein kinase domain
293693	Intermediate filament protein
293693	Protein kinase C terminal domain
293693	PH domain. PH stands for pleckstrin homology
293693	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
293693	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
293693	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
293694	Protein kinase domain
293694	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
293696	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
293697	Ribosomal L29e protein family
293699	ATP synthase subunit C
293700	Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest
293701	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
293702	FKBP-type peptidyl-prolyl cis-trans isomerase
293704	Domain of unknown function DUF60. The proteins in this family have no known function. Two proteins have been annotated as phosphotransferases, however this is not supported experimentally, and should be viewed with caution
293710	Galactoside-binding lectin
293715	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
293719	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
293721	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
293725	Elongation factor 1 gamma, conserved domain
293725	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
293729	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
293730	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
293733	Inner centromere protein, ARK binding region. This region of the inner centromere protein has been found to be necessary and sufficient for binding to aurora-related kinase. This interaction has been implicated in the coordination of chromosome segregatio
293735	Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown
293737	Core histone H2A/H2B/H3/H4
293739	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
293740	PX domain. PX domains bind to phosphoinositides
293741	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
293742	Ribosomal protein L6e
293743	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit,
293743	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
293744	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
293746	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
293747	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
293747	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
293748	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
293749	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
293751	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
293752	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
293754	Ribosomal protein L16
293757	Oxysterol-binding protein
293758	7 transmembrane receptor (rhodopsin family)
293759	7 transmembrane receptor (rhodopsin family)
293760	7 transmembrane receptor (rhodopsin family)
293761	7 transmembrane receptor (rhodopsin family)
293762	7 transmembrane receptor (rhodopsin family)
293763	7 transmembrane receptor (rhodopsin family)
293764	7 transmembrane receptor (rhodopsin family)
293765	7 transmembrane receptor (rhodopsin family)
293767	7 transmembrane receptor (rhodopsin family)
293768	7 transmembrane receptor (rhodopsin family)
293769	7 transmembrane receptor (rhodopsin family)
293770	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
293771	7 transmembrane receptor (rhodopsin family)
293773	7 transmembrane receptor (rhodopsin family)
293776	Fibrillarin
293778	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
293778	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
293780	7 transmembrane receptor (rhodopsin family)
293781	7 transmembrane receptor (rhodopsin family)
293783	LIM domain. This family represents two copies of the LIM structural domain
293784	7 transmembrane receptor (rhodopsin family)
293785	7 transmembrane receptor (rhodopsin family)
293786	7 transmembrane receptor (rhodopsin family)
293787	7 transmembrane receptor (rhodopsin family)
293788	7 transmembrane receptor (rhodopsin family)
293789	7 transmembrane receptor (rhodopsin family)
293790	7 transmembrane receptor (rhodopsin family)
293791	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
293792	7 transmembrane receptor (rhodopsin family)
293793	7 transmembrane receptor (rhodopsin family)
293794	7 transmembrane receptor (rhodopsin family)
293795	7 transmembrane receptor (rhodopsin family)
293796	7 transmembrane receptor (rhodopsin family)
293797	7 transmembrane receptor (rhodopsin family)
293798	BTG1 family. A novel family of anti-proliferative proteins
293799	7 transmembrane receptor (rhodopsin family)
293800	7 transmembrane receptor (rhodopsin family)
293801	7 transmembrane receptor (rhodopsin family)
293802	7 transmembrane receptor (rhodopsin family)
293803	7 transmembrane receptor (rhodopsin family)
293804	7 transmembrane receptor (rhodopsin family)
293805	7 transmembrane receptor (rhodopsin family)
293806	7 transmembrane receptor (rhodopsin family)
293808	7 transmembrane receptor (rhodopsin family)
293809	7 transmembrane receptor (rhodopsin family)
293810	7 transmembrane receptor (rhodopsin family)
293811	7 transmembrane receptor (rhodopsin family)
293812	7 transmembrane receptor (rhodopsin family)
293813	7 transmembrane receptor (rhodopsin family)
293814	7 transmembrane receptor (rhodopsin family)
293815	7 transmembrane receptor (rhodopsin family)
293816	7 transmembrane receptor (rhodopsin family)
293817	7 transmembrane receptor (rhodopsin family)
293818	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
293820	Aminotransferase class-V
293827	Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi
293829	Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily
293831	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
293832	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
293833	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
293834	Intermediate filament protein
293835	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
293836	HMG (high mobility group) box
293837	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
293839	LIM domain. This family represents two copies of the LIM structural domain
293842	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
293843	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
293843	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
293846	Ribosomal L28e protein family
293847	Protein-tyrosine phosphatase
293847	Putative tyrosine phosphatase family. This family consists of putative tyrosine phosphatase proteins, this function is inferred from several sequences at the top of the noise, EC:3.1.3.48
293847	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
293851	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
293855	Isocitrate/isopropylmalate dehydrogenase
293856	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
293858	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
293861	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
293864	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
293866	AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this
293867	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
293868	Ribosomal L29e protein family
293870	Ribosomal protein S19
293874	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
293875	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a b
293876	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
293876	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
293878	7 transmembrane receptor (rhodopsin family)
293880	Cyclophilin type peptidyl-prolyl cis-trans isomerase
293883	Phosphatidylethanolamine-binding protein
293884	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
293886	Cdc37 family. In the budding yeast Saccharomyces cerevisiae, Cdc37 is required for the productive formation of Cdc28-cyclin complexes. Cdc37 may be a kinase targeting subunit of Hsp90
293887	Dynein light chain type 1
293894	DNA gyrase/topoisomerase IV, subunit A
293894	Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction a
293896	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
293899	Eukaryotic ribosomal protein L18
293903	DNA repair protein Rad4
293905	Ribosomal L15
293906	Alkaline phosphatase
293907	Ribosomal L10
293908	Occludin/ELL family
293909	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
293911	Eukaryotic porin
293913	HSF-type DNA-binding
293915	Mitochondrial carrier protein
293920	Ribosomal protein L13e
293921	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
293921	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
293924	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
293924	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
293924	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
293927	Sulfatase
293928	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
293929	Ribosomal protein S2
293931	Ribosomal protein L21e
293932	Ribosomal L29e protein family
293935	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy
293937	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
293939	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
293940	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
293941	Ribosomal protein L10
293942	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
293945	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
293946	Thrombospondin N-terminal -like domain
293947	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
293950	Immunoglobulin A1 protease. This family consists of immunoglobulin A1 protease proteins. The immunoglobulin A1 protease cleaves immunoglobulin IgA and is found in pathogenic bacteria such as Neisseria gonorrhoeae. Not all of the members of this family are
293952	Josephin
293954	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
293955	Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular m
293957	Mitochondrial carrier protein
293958	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
293960	A20-like zinc finger. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappa B activation
293960	AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets in
293962	Ribosomal protein L21e
293964	Ribosomal protein S17
293967	SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily
293970	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
293971	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
293974	Phosphoglucomutase/phosphomannomutase, C-terminal domain
293974	pfam02880, PGM_PMM_III, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain III
293975	pfam02878, PGM_PMM_I, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I
293975	pfam02879, PGM_PMM_II, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II
293975	pfam02880, PGM_PMM_III, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain III
293976	DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown
293976	DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai
293978	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
293979	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
293980	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
293981	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
293982	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
293984	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
293985	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
293987	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
293988	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
293989	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
293990	wnt family
293994	Ribosomal protein L21e
293996	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
293997	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
293998	Homeobox domain
294000	Homeobox domain
294001	Homeobox domain
294006	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
294007	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
294011	Tricarboxylate carrier
294014	Domain of unknown function DUF21. This transmembrane region has no known function. Many of the sequences in this family are annotated as hemolysins, however this is due to a similarity to a protein which does not contain this domain. This domain is found
294017	Ribosomal protein S5, C-terminal domain
294018	WD domain, G-beta repeat
294021	Translation initiation factor SUI1
294027	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
294029	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
294035	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
294035	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
294035	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
294036	Ribosomal protein L21e
294046	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
294047	Ribosomal protein L36e
294051	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
294051	Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to
294052	Ribosomal protein L21e
294053	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
294053	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
294056	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294056	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294057	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294057	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294058	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
294059	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
294059	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
294061	Proteasome A-type and B-type
294063	Ribosomal S17
294065	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
294065	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
294068	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
294069	Ribosomal protein S8e
294070	Ribosomal protein L15
294072	Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region
294075	7 transmembrane receptor (rhodopsin family)
294076	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
294077	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
294081	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
294081	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
294085	metallopeptidase family M24
294086	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
294087	Proteasome A-type and B-type
294088	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isoform
294092	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
294093	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
294094	ab-hydrolase associated lipase region
294094	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
294096	ab-hydrolase associated lipase region
294096	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
294097	ab-hydrolase associated lipase region
294097	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
294101	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
294102	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
294103	ATP-sulfurylase. This family consists of ATP-sulfurylase or sulfate adenylyltransferase EC:2.7.7.4 some of which are part of a bifunctional polypeptide chain associated with adenosyl phosphosulphate (APS) kinase pfam01583. Both enzymes are required for PA
294106	Actin
294109	7 transmembrane receptor (rhodopsin family)
294111	Ribosomal protein S21e
294114	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
294115	HMG (high mobility group) box
294116	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
294117	Ribosomal protein L19e
294119	DOMON domain. The DOMON (named after dopamine beta-monooxygenase N-terminal) domain is 110-125 residues long. It is predicted to form an all beta fold with 7-8 strands. The domain may mediate extracellular adhesive interactions
294121	7 transmembrane receptor (rhodopsin family)
294122	7 transmembrane receptor (rhodopsin family)
294122	7 transmembrane receptor (rhodopsin family)
294123	7 transmembrane receptor (rhodopsin family)
294124	7 transmembrane receptor (rhodopsin family)
294124	7 transmembrane receptor (rhodopsin family)
294125	7 transmembrane receptor (rhodopsin family)
294125	7 transmembrane receptor (rhodopsin family)
294126	7 transmembrane receptor (rhodopsin family)
294126	7 transmembrane receptor (rhodopsin family)
294127	7 transmembrane receptor (rhodopsin family)
294128	7 transmembrane receptor (rhodopsin family)
294129	7 transmembrane receptor (rhodopsin family)
294130	7 transmembrane receptor (rhodopsin family)
294130	7 transmembrane receptor (rhodopsin family)
294131	7 transmembrane receptor (rhodopsin family)
294131	7 transmembrane receptor (rhodopsin family)
294132	7 transmembrane receptor (rhodopsin family)
294132	7 transmembrane receptor (rhodopsin family)
294133	7 transmembrane receptor (rhodopsin family)
294133	7 transmembrane receptor (rhodopsin family)
294134	7 transmembrane receptor (rhodopsin family)
294134	7 transmembrane receptor (rhodopsin family)
294136	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
294137	7 transmembrane receptor (rhodopsin family)
294138	7 transmembrane receptor (rhodopsin family)
294139	7 transmembrane receptor (rhodopsin family)
294140	7 transmembrane receptor (rhodopsin family)
294143	7 transmembrane receptor (rhodopsin family)
294143	Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans
294144	7 transmembrane receptor (rhodopsin family)
294145	7 transmembrane receptor (rhodopsin family)
294146	7 transmembrane receptor (rhodopsin family)
294147	7 transmembrane receptor (rhodopsin family)
294148	7 transmembrane receptor (rhodopsin family)
294149	7 transmembrane receptor (rhodopsin family)
294150	7 transmembrane receptor (rhodopsin family)
294151	7 transmembrane receptor (rhodopsin family)
294152	7 transmembrane receptor (rhodopsin family)
294153	7 transmembrane receptor (rhodopsin family)
294153	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
294155	7 transmembrane receptor (rhodopsin family)
294157	7 transmembrane receptor (rhodopsin family)
294158	7 transmembrane receptor (rhodopsin family)
294159	7 transmembrane receptor (rhodopsin family)
294160	7 transmembrane receptor (rhodopsin family)
294161	7 transmembrane receptor (rhodopsin family)
294162	7 transmembrane receptor (rhodopsin family)
294163	7 transmembrane receptor (rhodopsin family)
294164	7 transmembrane receptor (rhodopsin family)
294165	Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold
294165	Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold
294165	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
294166	7 transmembrane receptor (rhodopsin family)
294167	7 transmembrane receptor (rhodopsin family)
294168	7 transmembrane receptor (rhodopsin family)
294173	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
294174	Class I Histocompatibility antigen, domains alpha 1 and 2
294175	7 transmembrane receptor (rhodopsin family)
294176	7 transmembrane receptor (rhodopsin family)
294177	7 transmembrane receptor (rhodopsin family)
294179	7 transmembrane receptor (rhodopsin family)
294181	7 transmembrane receptor (rhodopsin family)
294183	7 transmembrane receptor (rhodopsin family)
294184	7 transmembrane receptor (rhodopsin family)
294185	7 transmembrane receptor (rhodopsin family)
294186	7 transmembrane receptor (rhodopsin family)
294187	7 transmembrane receptor (rhodopsin family)
294188	7 transmembrane receptor (rhodopsin family)
294189	7 transmembrane receptor (rhodopsin family)
294190	7 transmembrane receptor (rhodopsin family)
294191	7 transmembrane receptor (rhodopsin family)
294192	7 transmembrane receptor (rhodopsin family)
294193	7 transmembrane receptor (rhodopsin family)
294194	7 transmembrane receptor (rhodopsin family)
294195	Class I Histocompatibility antigen, domains alpha 1 and 2
294196	7 transmembrane receptor (rhodopsin family)
294197	7 transmembrane receptor (rhodopsin family)
294198	7 transmembrane receptor (rhodopsin family)
294199	7 transmembrane receptor (rhodopsin family)
294200	7 transmembrane receptor (rhodopsin family)
294201	7 transmembrane receptor (rhodopsin family)
294202	7 transmembrane receptor (rhodopsin family)
294203	7 transmembrane receptor (rhodopsin family)
294204	7 transmembrane receptor (rhodopsin family)
294205	Ribosomal protein S19
294208	B-box zinc finger
294208	Zinc finger, C3HC4 type (RING finger)
294208	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
294209	Zinc finger, C3HC4 type (RING finger)
294210	B-box zinc finger
294210	Zinc finger, C3HC4 type (RING finger)
294210	Phosphatidylethanolamine-binding protein
294210	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
294211	Class I Histocompatibility antigen, domains alpha 1 and 2
294212	Class I Histocompatibility antigen, domains alpha 1 and 2
294213	Class I Histocompatibility antigen, domains alpha 1 and 2
294213	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
294214	Domain of unknown function (DUF380). Domain of unknown function, present in a ribonuclease P subunit in humans. Possibly a metal-binding domain
294215	Class I Histocompatibility antigen, domains alpha 1 and 2
294216	Class I Histocompatibility antigen, domains alpha 1 and 2
294217	Class I Histocompatibility antigen, domains alpha 1 and 2
294218	Class I Histocompatibility antigen, domains alpha 1 and 2
294219	Class I Histocompatibility antigen, domains alpha 1 and 2
294220	Class I Histocompatibility antigen, domains alpha 1 and 2
294221	Class I Histocompatibility antigen, domains alpha 1 and 2
294222	Class I Histocompatibility antigen, domains alpha 1 and 2
294223	Class I Histocompatibility antigen, domains alpha 1 and 2
294224	Ribosomal protein S5, C-terminal domain
294224	Ribosomal protein S5, N-terminal domain
294225	Class I Histocompatibility antigen, domains alpha 1 and 2
294227	Class I Histocompatibility antigen, domains alpha 1 and 2
294228	Class I Histocompatibility antigen, domains alpha 1 and 2
294234	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
294234	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
294236	Transcription factor Tfb2
294237	Class I Histocompatibility antigen, domains alpha 1 and 2
294238	Class I Histocompatibility antigen, domains alpha 1 and 2
294239	Amidinotransferase. This family contains glycine (EC:2.1.4.1) and inosamine (EC:2.1.4.2) amidinotransferases, enzymes involved in creatine and streptomycin biosynthesis respectively. This family also includes arginine deiminases, EC:3.5.3.6. These enzymes
294247	Casein kinase II regulatory subunit
294248	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
294252	MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of p
294254	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
294254	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
294256	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
294257	Trypsin
294258	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
294262	Fibrinogen beta and gamma chains, C-terminal globular domain
294266	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
294269	Class II histocompatibility antigen, alpha domain
294269	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
294270	Class II histocompatibility antigen, beta domain
294270	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
294271	Class II histocompatibility antigen, beta domain
294271	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
294272	SRP54-type protein, GTPase domain. This family includes relatives of the G-domain of the SRP54 family of proteins
294272	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
294272	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
294273	Class II histocompatibility antigen, beta domain
294274	Class II histocompatibility antigen, alpha domain
294278	Class II histocompatibility antigen, alpha domain
294278	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
294279	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
294282	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
294283	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
294286	Kinesin motor domain
294288	CutA1 divalent ion tolerance protein. Several gene loci with a possible involvement in cellular tolerance to copper have been identified. One such locus in eubacteria and archaebacteria, cutA, is thought to be involved in cellular tolerance to a wide vari
294289	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
294292	NUDIX domain
294293	Ets-domain
294293	Sterile alpha motif (SAM)/Pointed domain
294297	CUB domain
294299	TEA/ATTS domain family
294304	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
294307	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
294309	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
294312	SCP-like extracellular protein. This domain is also found in prokaryotes
294313	Mitochondrial carrier protein
294314	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
294316	NAC domain
294320	Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily
294326	DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117
294328	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
294328	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
294328	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
294331	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
294334	S-adenosyl-L-homocysteine hydrolase
294335	von Willebrand factor type D domain
294336	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
294337	von Willebrand factor type A domain
294338	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
294341	Sodium:solute symporter family
294342	Sodium:solute symporter family
294344	Ribosomal protein L13e
294347	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
294351	Ribosomal protein L21e
294352	Cadherin domain
294356	Actin
294358	Proteasome A-type and B-type
294359	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
294359	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
294361	Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones
294367	Ribosomal protein S5, N-terminal domain
294368	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
294370	Enolase, N-terminal domain
294370	Enolase, C-terminal TIM barrel domain
294376	Dynein light chain type 1
294377	Ribosomal protein S2
294379	Vinculin family
294380	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
294382	Vinculin family
294386	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps
294391	Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex
294391	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
294392	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
294393	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294393	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294394	7 transmembrane receptor (metabotropic glutamate family)
294394	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
294395	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
294396	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
294399	Ribosomal protein L5
294399	ribosomal L5P family C-terminus. This region is found associated with pfam00281
294400	Putative undecaprenyl diphosphate synthase. Previously known as uncharacterized protein family UPF0015, a single member of this family has been identified as an undecaprenyl diphosphate synthase
294401	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
294405	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294405	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294406	MCM2/3/5 family
294413	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
294415	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
294415	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
294416	Ribosomal L15
294417	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294417	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294418	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294418	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294420	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
294421	TMS membrane protein/tumour differentially expressed protein (TDE)
294421	TMS membrane protein/tumour differentially expressed protein (TDE)
294423	Homeobox domain
294424	Homeobox domain
294425	Homeobox domain
294426	Homeobox domain
294427	Homeobox domain
294428	Homeobox domain
294429	RanBP1 domain
294429	Cyclophilin type peptidyl-prolyl cis-trans isomerase
294434	FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1
294435	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
294436	Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids
294439	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
294440	Thrombospondin N-terminal -like domain
294444	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
294447	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294447	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294448	Ribosomal protein L13
294449	Sulfotransferase protein
294451	Cyclophilin type peptidyl-prolyl cis-trans isomerase
294453	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
294458	Homeobox domain
294459	Homeobox domain
294460	Homeobox domain
294461	Homeobox domain
294462	Homeobox domain
294463	Homeobox domain
294464	Homeobox domain
294465	Homeobox domain
294466	Homeobox domain
294467	Homeobox domain
294468	Homeobox domain
294469	Homeobox domain
294470	Homeobox domain
294471	Homeobox domain
294472	Homeobox domain
294473	Homeobox domain
294474	Homeobox domain
294475	Homeobox domain
294476	Homeobox domain
294477	Homeobox domain
294478	Homeobox domain
294479	Homeobox domain
294480	Homeobox domain
294481	Homeobox domain
294482	Homeobox domain
294483	Homeobox domain
294484	Homeobox domain
294485	Homeobox domain
294486	Homeobox domain
294487	Homeobox domain
294488	Homeobox domain
294489	Homeobox domain
294490	Homeobox domain
294491	Homeobox domain
294491	Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain
294492	Homeobox domain
294493	Homeobox domain
294494	Homeobox domain
294495	Homeobox domain
294496	Homeobox domain
294497	Homeobox domain
294498	Homeobox domain
294503	Transforming growth factor beta like domain
294504	Inorganic pyrophosphatase
294506	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
294509	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294509	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294510	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
294515	Fork head domain
294524	Translation initiation factor SUI1
294528	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
294529	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
294530	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
294532	'Cold-shock' DNA-binding domain
294533	'Cold-shock' DNA-binding domain
294534	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
294535	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
294540	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
294543	Ribosomal protein S6e
294545	Protein kinase domain
294547	Ribosomal protein L21e
294549	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
294554	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
294555	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
294559	Zona pellucida-like domain
294562	Homeobox domain
294562	Pou domain - N-terminal to homeobox domain
294564	TEA/ATTS domain family
294566	Skp1 family, tetramerisation domain
294567	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
294568	WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin
294569	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
294570	Homeobox domain
294571	Homeobox domain
294572	Homeobox domain
294573	Homeobox domain
294574	Homeobox domain
294575	Homeobox domain
294576	Homeobox domain
294577	Homeobox domain
294578	Homeobox domain
294579	Homeobox domain
294580	Homeobox domain
294581	Homeobox domain
294582	Homeobox domain
294583	Homeobox domain
294584	Homeobox domain
294585	Homeobox domain
294586	Homeobox domain
294587	Homeobox domain
294588	Homeobox domain
294589	Homeobox domain
294590	Vacuolar protein sorting-associated protein 35. Vacuolar protein sorting-associated protein (Vps) 35 is one of around 50 proteins involved in protein trafficking. In particular, Vps35 assembles into a retromer complex with at least four other proteins Vps
294592	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
294593	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
294595	TPR Domain
294598	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294600	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
294601	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
294602	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
294603	Stromal antigen (SA/STAG) protein
294606	Homeobox domain
294606	Pou domain - N-terminal to homeobox domain
294607	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
294608	TT viral orf 1. TT virus (TTV), isolated initially from a Japanese patient with hepatitis of unknown aetiology, has since been found to infect both healthy and diseased individuals and numerous prevalence studies have raised questions about its role in un
294609	Ribosomal protein L6e
294617	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
294617	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
294618	Uracil DNA glycosylase superfamily
294620	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
294621	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
294623	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294625	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294625	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294629	Eukaryotic porin
294635	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
294637	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
294638	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
294639	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
294640	Homeobox domain
294642	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
294644	Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13,
294647	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
294648	Ribosomal protein L31e
294649	MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of p
294650	DNA mismatch repair proteins, mutS family
294650	MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of p
294655	Ribosomal protein S6e
294657	Sushi domain (SCR repeat)
294657	Lectin C-type domain. This family includes both long and short form C-type
294657	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
294658	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294658	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294660	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
294661	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
294662	ATP synthase subunit C
294663	7 transmembrane receptor (rhodopsin family)
294664	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
294668	Ribosomal protein S26e
294668	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
294669	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
294672	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
294672	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
294673	Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold
294674	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
294676	Ribosomal protein S15
294679	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
294680	NAC domain
294689	Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat
294690	Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat
294691	Ribosomal protein L21e
294691	Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat
294692	ENV polyprotein (coat polyprotein)
294693	Ssl1-like. Ssl1-like proteins are 40kDa subunits of the Transcription factor II H complex
294698	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
294700	Ribosomal protein L21e
294702	Ribosomal protein L35Ae
294704	Nucleoside diphosphate kinase
294705	ICE-like protease (caspase) p20 domain
294706	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
294707	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
294708	TPR Domain
294711	Cyclophilin type peptidyl-prolyl cis-trans isomerase
294713	Ribosomal protein L21e
294716	pfam02872, 5_nucleotidaseC, 5'-nucleotidase, C-terminal domain
294717	Zinc finger, C3HC4 type (RING finger)
294718	Ribosomal RNA adenine dimethylase
294721	Ribosomal L29e protein family
294722	Ribosomal L29e protein family
294725	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294727	Mitochondrial carrier protein
294728	Mitochondrial carrier protein
294730	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
294731	Phosphoglycerate kinase
294732	Actin
294737	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
294739	Fibronectin type III domain
294741	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
294744	Glutathione peroxidase
294746	Trypsin
294748	Translation initiation factor SUI1
294753	MoaE protein. This family contains the MoaE protein that is involved in biosynthesis of molybdopterin. Molybdopterin, the universal component of the pterin molybdenum cofactors, contains a dithiolene group serving to bind Mo. Addition of the dithiolene su
294754	eRF1 domain 2. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
294754	eRF1 domain 3. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
294764	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294764	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294771	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
294773	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
294776	Ets-domain
294777	Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA
294780	Fibronectin type III domain
294781	Ribosomal protein L21e
294788	Ribosomal protein S5, C-terminal domain
294788	Ribosomal protein S5, N-terminal domain
294790	F-box domain
294792	UDP-glucoronosyl and UDP-glucosyl transferase
294793	UDP-glucoronosyl and UDP-glucosyl transferase
294794	UDP-glucoronosyl and UDP-glucosyl transferase
294795	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
294798	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
294799	Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids
294800	Repair protein Rad1/Rec1/Rad17
294802	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294805	Profilin
294806	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
294807	7 transmembrane receptor (rhodopsin family)
294809	Ribosomal protein L23
294809	Thrombospondin type 1 domain
294809	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
294810	Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain
294811	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294814	RNase3 domain
294814	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea
294815	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
294816	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
294817	Ribosomal protein S26e
294818	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294821	Peptidase family M49
294823	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
294826	ENV polyprotein (coat polyprotein)
294829	HMG (high mobility group) box
294831	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
294832	Ribosomal L29 protein
294835	Hsp90 protein
294840	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294840	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294842	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
294844	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294848	Hsp90 protein
294849	Hsp90 protein
294852	Ribosomal protein L19e
294853	Intermediate filament protein
294854	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
294857	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
294858	Aconitase family (aconitate hydratase)
294861	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
294864	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
294869	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
294871	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
294872	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
294873	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
294874	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
294875	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
294876	Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73
294879	Eukaryotic-type carbonic anhydrase
294884	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
294885	Fructose-bisphosphate aldolase class-I
294885	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
294887	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
294888	Mitosis protein DIM1
294888	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
294889	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
294891	Mitosis protein DIM1
294892	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
294893	ENV polyprotein (coat polyprotein)
294894	Eukaryotic initiation factor 1A
294895	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294896	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription b
294897	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
294900	Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction a
294901	Uncharacterized ACR, COG1490
294902	Eukaryotic porin
294903	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294903	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294908	Phosphoglycerate kinase
294909	FMN-dependent dehydrogenase
294909	2-nitropropane dioxygenase. Members of this family catalyse the denitrification of a number of nitroalkanes using either FAD or FMN as a cofactor
294909	Conserved region in glutamate synthase. This family represents a region of the glutamate synthase protein. This region is expressed as a separate subunit in the glutamate synthase alpha subunit from archaebacteria, or part of a large multidomain enzyme in
294909	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
294912	Translocation protein Sec62
294913	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
294916	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
294917	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
294922	Ribosomal protein S6e
294923	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
294924	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
294926	Fibronectin type III domain
294933	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
294933	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
294934	Carboxylesterase
294935	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
294941	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
294942	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
294946	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
294947	3'5'-cyclic nucleotide phosphodiesterase
294950	Phosphoglycerate kinase
294953	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
294954	Helix-loop-helix DNA-binding domain
294955	Ribosomal protein L21e
294958	Tropomyosin
294959	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment f
294961	Calcium-activated potassium channel, beta subunit
294961	Calcium-activated potassium channel, beta subunit
294962	WD domain, G-beta repeat
294962	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
294966	Microtubule associated protein (MAP65/ASE1 family)
294968	Macrophage migration inhibitory factor (MIF)
294970	ENV polyprotein (coat polyprotein)
294972	Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins
294974	7 transmembrane receptor (rhodopsin family)
294976	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
294979	Adenosine-deaminase (editase) domain
294983	Galactoside-binding lectin
294985	D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain
294985	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
294987	Ribosomal protein L23
294989	Ribosomal protein L31e
294990	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
294991	D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain
294991	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
294995	Coenzyme A transferase
295000	Ribosomal L15
295004	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
295007	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
295008	Fructose-bisphosphate aldolase class-I
295010	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
295013	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
295014	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
295015	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
295016	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
295018	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
295019	Enhancer of rudimentary. Enhancer of rudimentary is a protein of unknown function that is highly conserved in plants and animals. This protein is found to be an enhancer of the rudimentary gene
295022	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
295023	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
295024	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
295025	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
295026	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
295027	Cadherin domain
295028	Ribosomal L18ae protein family
295030	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
295031	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
295034	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
295037	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
295038	Eukaryotic initiation factor 4E
295041	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
295043	Uncharacterized ACR, YggU family COG1872
295047	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
295050	3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
295050	3' exoribonuclease family, domain 2. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
295051	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
295052	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
295054	Helix-loop-helix DNA-binding domain
295057	Beige/BEACH domain
295065	Ribosomal protein S19e
295066	Zinc finger, C3HC4 type (RING finger)
295066	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
295066	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
295072	Carboxylesterase
295073	Carboxylesterase
295073	Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl
295074	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
295074	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
295075	Carboxylesterase
295076	Carboxylesterase
295077	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
295080	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
295081	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
295082	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
295084	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
295085	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
295085	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
295086	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
295088	GMP synthase C terminal domain. GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. This family is the C-terminal domain specific to the GMP synthases EC:6.3.5.2. In prokaryotes this domain mediates dimerisation. E
295089	7 transmembrane receptor (metabotropic glutamate family)
295090	7 transmembrane receptor (metabotropic glutamate family)
295091	7 transmembrane receptor (metabotropic glutamate family)
295092	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
295093	7 transmembrane receptor (metabotropic glutamate family)
295094	7 transmembrane receptor (metabotropic glutamate family)
295096	7 transmembrane receptor (metabotropic glutamate family)
295098	Ribosomal protein L44
295099	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295099	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295101	Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti
295103	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
295107	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
295107	SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily
295112	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
295112	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
295113	Rpp14 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40
295114	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
295117	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
295117	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
295118	Ribosomal protein L19e
295122	GTPase of unknown function
295123	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
295124	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
295129	Eukaryotic porin
295130	WD domain, G-beta repeat
295131	Intermediate filament protein
295133	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
295133	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
295134	FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in comple
295135	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295135	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295138	Ribosomal protein S5, C-terminal domain
295139	Hsp90 protein
295140	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
295140	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
295144	7 transmembrane receptor (rhodopsin family)
295154	Hsp90 protein
295156	Enolase, N-terminal domain
295156	Enolase, C-terminal TIM barrel domain
295156	Ribosomal protein S5, N-terminal domain
295159	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
295160	Mnd1 family. This family of proteins includes MND1 from S. cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair
295162	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di
295162	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
295165	WD domain, G-beta repeat
295168	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
295169	Ribosomal protein L21e
295171	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
295172	Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase. Glycerol-3-phosphate cytidylyltransferase
295173	N-acetylmuramoyl-L-alanine amidase. This family includes zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity EC:3.5.1.28. This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (prefere
295175	N-acetylmuramoyl-L-alanine amidase. This family includes zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity EC:3.5.1.28. This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (prefere
295179	Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase. Glycerol-3-phosphate cytidylyltransferase
295180	N-acetylmuramoyl-L-alanine amidase. This family includes zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity EC:3.5.1.28. This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (prefere
295184	Serine hydroxymethyltransferase
295186	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
295189	Polyprenyl synthetase
295190	Spumavirus gag protein
295192	WD domain, G-beta repeat
295193	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
295195	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
295199	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
295199	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295200	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295201	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295202	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
295202	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295204	S-adenosylmethionine synthetase, central domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold
295204	S-adenosylmethionine synthetase, C-terminal domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold
295207	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
295208	Ribosomal protein S5, C-terminal domain
295209	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
295211	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
295213	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
295214	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
295214	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
295219	AMP-binding enzyme
295220	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
295221	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
295222	Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins
295227	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
295228	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
295229	YjeF-related protein N-terminus
295230	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
295234	Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyne
295239	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
295239	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
295240	Protein kinase domain
295242	Protein of unknown function DUF124. This prokaryotic protein family has no known function
295245	MtN3/saliva family. This family includes proteins such as drosophila saliva, MtN3 involved in root nodule development and a protein involved in activation and expression of recombination activation genes (RAGs). Although the molecular function of these pr
295249	Phosphoadenosine phosphosulfate reductase family. This domain is found in phosphoadenosine phosphosulfate (PAPS) reductase enzymes or PAPS sulfotransferase. PAPS reductase is part of the adenine nucleotide alpha hydrolases superfamily also including N typ
295250	Phosphotyrosine interaction domain (PTB/PID)
295251	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
295252	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
295254	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
295254	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
295254	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295255	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
295255	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295256	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
295256	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295257	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
295258	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
295258	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295259	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
295261	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
295262	LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known
295263	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
295268	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
295268	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
295269	Ribosomal L15
295271	BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems
295272	Core histone H2A/H2B/H3/H4
295273	Core histone H2A/H2B/H3/H4
295274	Core histone H2A/H2B/H3/H4
295274	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
295277	Core histone H2A/H2B/H3/H4
295278	Core histone H2A/H2B/H3/H4
295278	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
295280	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295280	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295284	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
295285	Extracellular link domain
295287	Intermediate filament protein
295288	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
295292	Protein kinase domain
295294	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
295295	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
295296	Ets-domain
295297	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
295298	HMG (high mobility group) box
295300	Enolase, C-terminal TIM barrel domain
295304	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
295304	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
295305	Histidine acid phosphatase
295307	ENV polyprotein (coat polyprotein)
295309	Disintegrin
295309	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
295309	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
295314	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
295316	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
295317	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
295319	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
295321	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
295322	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
295323	Filamin/ABP280 repeat
295324	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
295326	Mab-21 protein
295327	Helix-loop-helix DNA-binding domain
295329	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
295329	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
295330	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295330	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295332	Ribosomal L15
295333	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
295333	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
295333	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
295335	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
295339	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
295340	Ribosomal protein L10
295341	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
295342	ADP-ribosylation factor family
295342	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
295343	ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain
295344	ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain
295345	Ribosomal protein S6e
295346	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
295347	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
295349	Glycosyl hydrolases family 18
295350	Glycosyl hydrolases family 18
295351	Glycosyl hydrolases family 18
295352	Glycosyl hydrolases family 18
295353	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295353	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295355	7 transmembrane receptor (rhodopsin family)
295358	Ribosomal L29e protein family
295359	Ribosomal protein L34e
295360	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
295360	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
295361	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
295362	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
295362	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
295363	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
295367	Cadherin domain
295367	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
295367	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
295367	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
295370	WD domain, G-beta repeat
295373	Ribosomal protein L21e
295375	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
295376	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
295378	PH domain. PH stands for pleckstrin homology
295378	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
295379	Ribosomal protein S17
295380	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
295382	Laminin N-terminal (Domain VI)
295384	Conserved hypothetical protein 95
295385	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
295386	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295386	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295388	Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str
295388	Alpha amylase, C-terminal all-beta domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding betwee
295389	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
295390	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
295393	Ribosomal protein L19e
295395	RNA 3'-terminal phosphate cyclase
295400	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
295401	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
295403	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
295406	AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4-
295407	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295411	Ribonucleotide reductase, small chain
295412	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
295420	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295420	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295423	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295423	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295429	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
295430	Sulfotransferase protein
295435	Ribosomal protein S6e
295436	Ribosomal protein L19e
295437	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
295439	Ribosomal protein L21e
295441	ENV polyprotein (coat polyprotein)
295442	ATP-sulfurylase. This family consists of ATP-sulfurylase or sulfate adenylyltransferase EC:2.7.7.4 some of which are part of a bifunctional polypeptide chain associated with adenosyl phosphosulphate (APS) kinase pfam01583. Both enzymes are required for PA
295443	ATP-sulfurylase. This family consists of ATP-sulfurylase or sulfate adenylyltransferase EC:2.7.7.4 some of which are part of a bifunctional polypeptide chain associated with adenosyl phosphosulphate (APS) kinase pfam01583. Both enzymes are required for PA
295446	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
295448	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
295449	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
295450	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
295452	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295452	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295453	Cyclophilin type peptidyl-prolyl cis-trans isomerase
295458	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
295460	Ribosomal protein S26e
295464	Ribosomal protein L31e
295468	Aminotransferase class-III
295470	Ribosomal L29e protein family
295471	Ribosomal protein S6e
295472	Ribosomal protein L21e
295473	Ribosomal L29e protein family
295475	Helix-loop-helix DNA-binding domain
295476	S-adenosyl-L-homocysteine hydrolase
295480	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
295481	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
295487	Ribosomal L29e protein family
295491	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
295495	Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue
295496	Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue
295497	metallopeptidase family M24
295498	Zinc-binding dehydrogenase
295499	Zinc-binding dehydrogenase
295500	metallopeptidase family M24
295501	Cyclophilin type peptidyl-prolyl cis-trans isomerase
295501	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
295502	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
295503	Nucleoside diphosphate kinase
295505	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
295506	Eukaryotic-type carbonic anhydrase
295508	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295508	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295509	Zinc finger, C3HC4 type (RING finger)
295511	Death domain
295512	Ribosomal L39 protein
295518	SH2 domain
295523	Hr1 repeat
295523	Protein kinase domain
295523	Protein kinase C terminal domain
295524	Ribosomal protein S19e
295529	14-3-3 protein
295530	Protein kinase domain
295531	Protein kinase domain
295531	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
295532	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
295537	Eukaryotic porin
295538	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
295541	DNA mismatch repair proteins, mutS family
295541	MutS family, N-terminal putative DNA binding domain. This family consists of the N-terminal region of proteins in the mutS family of DNA mismatch repair proteins and is found associated with pfam00488 located in the C-terminal region. The mutS family of p
295543	Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this f
295544	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
295545	Ribosomal protein L6
295549	DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh
295551	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
295553	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
295554	Ribosomal protein S5, C-terminal domain
295556	Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is possibly related to the pfam00135 family
295561	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
295562	Ribosomal protein L34
295563	GTPase of unknown function
295566	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
295567	TNF(Tumor Necrosis Factor) family
295568	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
295570	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
295572	ENV polyprotein (coat polyprotein)
295574	Ribosomal protein S19
295575	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
295576	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295577	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
295577	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295578	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295580	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
295580	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295581	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
295581	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295582	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295585	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
295585	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
295586	Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. These proteins contain two pfam00571 domains
295587	Ribosomal protein L31e
295588	ADP-ribosylation factor family
295588	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
295590	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
295597	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295597	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295600	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
295603	Dihydropyridine sensitive L-type calcium channel (Beta subunit)
295603	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
295604	Ribosomal L29e protein family
295609	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
295610	Cyclophilin type peptidyl-prolyl cis-trans isomerase
295610	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
295611	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
295612	TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that s
295614	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
295616	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295616	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295617	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295617	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295620	Phosphorylase family. Members of this family include: purine nucleoside phosphorylase (PNP) Uridine phosphorylase (UdRPase) 5'-methylthioadenosine phosphorylase (MTA phosphorylase)
295628	Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD has
295630	Lectin C-type domain. This family includes both long and short form C-type
295632	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
295633	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
295635	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
295640	WD domain, G-beta repeat
295641	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
295641	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
295642	ENV polyprotein (coat polyprotein)
295643	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295643	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295644	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
295645	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
295648	ENV polyprotein (coat polyprotein)
295649	Shikimate kinase
295649	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
295650	DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117
295658	Ribosomal protein S7e
295660	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
295662	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
295664	Conserved hypothetical protein 95
295664	C-5 cytosine-specific DNA methylase
295664	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
295666	Protein kinase domain
295667	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
295669	Cytochrome b561
295670	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
295672	FKBP-type peptidyl-prolyl cis-trans isomerase
295673	FKBP-type peptidyl-prolyl cis-trans isomerase
295676	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
295681	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295681	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295682	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
295684	HMG (high mobility group) box
295686	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
295688	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
295690	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
295691	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
295691	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
295694	Eukaryotic ribosomal protein L18
295698	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
295698	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
295699	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
295700	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
295701	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
295702	Yippee putative zinc-binding protein
295703	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
295704	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
295705	Mitochondrial carrier protein
295706	Lectin C-type domain. This family includes both long and short form C-type
295708	7 transmembrane receptor (rhodopsin family)
295709	7 transmembrane receptor (rhodopsin family)
295710	7 transmembrane receptor (rhodopsin family)
295711	7 transmembrane receptor (rhodopsin family)
295712	7 transmembrane receptor (rhodopsin family)
295713	7 transmembrane receptor (rhodopsin family)
295714	Fatty acid desaturase
295715	7 transmembrane receptor (rhodopsin family)
295716	7 transmembrane receptor (rhodopsin family)
295717	7 transmembrane receptor (rhodopsin family)
295718	7 transmembrane receptor (rhodopsin family)
295719	7 transmembrane receptor (rhodopsin family)
295720	7 transmembrane receptor (rhodopsin family)
295721	7 transmembrane receptor (rhodopsin family)
295722	7 transmembrane receptor (rhodopsin family)
295723	7 transmembrane receptor (rhodopsin family)
295724	7 transmembrane receptor (rhodopsin family)
295725	7 transmembrane receptor (rhodopsin family)
295726	7 transmembrane receptor (rhodopsin family)
295727	7 transmembrane receptor (rhodopsin family)
295728	7 transmembrane receptor (rhodopsin family)
295729	7 transmembrane receptor (rhodopsin family)
295730	7 transmembrane receptor (rhodopsin family)
295731	Ribosomal protein S19
295732	7 transmembrane receptor (rhodopsin family)
295733	7 transmembrane receptor (rhodopsin family)
295734	7 transmembrane receptor (rhodopsin family)
295735	7 transmembrane receptor (rhodopsin family)
295736	7 transmembrane receptor (rhodopsin family)
295737	7 transmembrane receptor (rhodopsin family)
295738	7 transmembrane receptor (rhodopsin family)
295739	7 transmembrane receptor (rhodopsin family)
295740	7 transmembrane receptor (rhodopsin family)
295741	7 transmembrane receptor (rhodopsin family)
295742	7 transmembrane receptor (rhodopsin family)
295743	7 transmembrane receptor (rhodopsin family)
295744	7 transmembrane receptor (rhodopsin family)
295745	7 transmembrane receptor (rhodopsin family)
295746	7 transmembrane receptor (rhodopsin family)
295747	7 transmembrane receptor (rhodopsin family)
295748	7 transmembrane receptor (rhodopsin family)
295749	7 transmembrane receptor (rhodopsin family)
295750	7 transmembrane receptor (rhodopsin family)
295751	7 transmembrane receptor (rhodopsin family)
295752	7 transmembrane receptor (rhodopsin family)
295753	7 transmembrane receptor (rhodopsin family)
295754	7 transmembrane receptor (rhodopsin family)
295755	7 transmembrane receptor (rhodopsin family)
295756	7 transmembrane receptor (rhodopsin family)
295757	7 transmembrane receptor (rhodopsin family)
295758	7 transmembrane receptor (rhodopsin family)
295759	7 transmembrane receptor (rhodopsin family)
295760	7 transmembrane receptor (rhodopsin family)
295761	7 transmembrane receptor (rhodopsin family)
295762	7 transmembrane receptor (rhodopsin family)
295763	7 transmembrane receptor (rhodopsin family)
295764	7 transmembrane receptor (rhodopsin family)
295765	7 transmembrane receptor (rhodopsin family)
295765	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
295766	7 transmembrane receptor (rhodopsin family)
295767	7 transmembrane receptor (rhodopsin family)
295768	7 transmembrane receptor (rhodopsin family)
295769	7 transmembrane receptor (rhodopsin family)
295770	7 transmembrane receptor (rhodopsin family)
295771	7 transmembrane receptor (rhodopsin family)
295772	7 transmembrane receptor (rhodopsin family)
295773	7 transmembrane receptor (rhodopsin family)
295774	7 transmembrane receptor (rhodopsin family)
295775	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
295776	7 transmembrane receptor (rhodopsin family)
295777	7 transmembrane receptor (rhodopsin family)
295778	7 transmembrane receptor (rhodopsin family)
295778	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
295779	7 transmembrane receptor (rhodopsin family)
295780	7 transmembrane receptor (rhodopsin family)
295781	7 transmembrane receptor (rhodopsin family)
295782	7 transmembrane receptor (rhodopsin family)
295783	7 transmembrane receptor (rhodopsin family)
295784	7 transmembrane receptor (rhodopsin family)
295785	7 transmembrane receptor (rhodopsin family)
295786	7 transmembrane receptor (rhodopsin family)
295787	7 transmembrane receptor (rhodopsin family)
295788	7 transmembrane receptor (rhodopsin family)
295789	7 transmembrane receptor (rhodopsin family)
295790	7 transmembrane receptor (rhodopsin family)
295791	7 transmembrane receptor (rhodopsin family)
295792	7 transmembrane receptor (rhodopsin family)
295793	7 transmembrane receptor (rhodopsin family)
295795	7 transmembrane receptor (rhodopsin family)
295797	7 transmembrane receptor (rhodopsin family)
295798	7 transmembrane receptor (rhodopsin family)
295799	7 transmembrane receptor (rhodopsin family)
295800	7 transmembrane receptor (rhodopsin family)
295802	7 transmembrane receptor (rhodopsin family)
295803	7 transmembrane receptor (rhodopsin family)
295806	7 transmembrane receptor (rhodopsin family)
295807	7 transmembrane receptor (rhodopsin family)
295808	7 transmembrane receptor (rhodopsin family)
295809	7 transmembrane receptor (rhodopsin family)
295810	Actin
295811	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
295812	7 transmembrane receptor (rhodopsin family)
295813	7 transmembrane receptor (rhodopsin family)
295814	7 transmembrane receptor (rhodopsin family)
295815	7 transmembrane receptor (rhodopsin family)
295816	7 transmembrane receptor (rhodopsin family)
295817	7 transmembrane receptor (rhodopsin family)
295818	7 transmembrane receptor (rhodopsin family)
295819	7 transmembrane receptor (rhodopsin family)
295820	7 transmembrane receptor (rhodopsin family)
295821	7 transmembrane receptor (rhodopsin family)
295822	7 transmembrane receptor (rhodopsin family)
295823	7 transmembrane receptor (rhodopsin family)
295824	7 transmembrane receptor (rhodopsin family)
295825	7 transmembrane receptor (rhodopsin family)
295826	7 transmembrane receptor (rhodopsin family)
295827	7 transmembrane receptor (rhodopsin family)
295828	7 transmembrane receptor (rhodopsin family)
295829	7 transmembrane receptor (rhodopsin family)
295830	7 transmembrane receptor (rhodopsin family)
295831	7 transmembrane receptor (rhodopsin family)
295832	7 transmembrane receptor (rhodopsin family)
295833	7 transmembrane receptor (rhodopsin family)
295834	7 transmembrane receptor (rhodopsin family)
295835	7 transmembrane receptor (rhodopsin family)
295836	7 transmembrane receptor (rhodopsin family)
295837	7 transmembrane receptor (rhodopsin family)
295838	7 transmembrane receptor (rhodopsin family)
295839	7 transmembrane receptor (rhodopsin family)
295840	7 transmembrane receptor (rhodopsin family)
295841	7 transmembrane receptor (rhodopsin family)
295842	7 transmembrane receptor (rhodopsin family)
295843	7 transmembrane receptor (rhodopsin family)
295844	7 transmembrane receptor (rhodopsin family)
295845	7 transmembrane receptor (rhodopsin family)
295848	7 transmembrane receptor (rhodopsin family)
295849	7 transmembrane receptor (rhodopsin family)
295850	7 transmembrane receptor (rhodopsin family)
295851	7 transmembrane receptor (rhodopsin family)
295852	7 transmembrane receptor (rhodopsin family)
295852	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
295853	7 transmembrane receptor (rhodopsin family)
295854	7 transmembrane receptor (rhodopsin family)
295855	7 transmembrane receptor (rhodopsin family)
295856	7 transmembrane receptor (rhodopsin family)
295857	7 transmembrane receptor (rhodopsin family)
295858	7 transmembrane receptor (rhodopsin family)
295859	7 transmembrane receptor (rhodopsin family)
295860	7 transmembrane receptor (rhodopsin family)
295861	7 transmembrane receptor (rhodopsin family)
295862	7 transmembrane receptor (rhodopsin family)
295863	7 transmembrane receptor (rhodopsin family)
295864	7 transmembrane receptor (rhodopsin family)
295865	7 transmembrane receptor (rhodopsin family)
295866	7 transmembrane receptor (rhodopsin family)
295867	7 transmembrane receptor (rhodopsin family)
295868	7 transmembrane receptor (rhodopsin family)
295869	7 transmembrane receptor (rhodopsin family)
295870	7 transmembrane receptor (rhodopsin family)
295871	7 transmembrane receptor (rhodopsin family)
295872	7 transmembrane receptor (rhodopsin family)
295873	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
295874	7 transmembrane receptor (rhodopsin family)
295875	7 transmembrane receptor (rhodopsin family)
295876	7 transmembrane receptor (rhodopsin family)
295877	7 transmembrane receptor (rhodopsin family)
295878	7 transmembrane receptor (rhodopsin family)
295878	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
295879	7 transmembrane receptor (rhodopsin family)
295880	7 transmembrane receptor (rhodopsin family)
295881	7 transmembrane receptor (rhodopsin family)
295882	7 transmembrane receptor (rhodopsin family)
295883	7 transmembrane receptor (rhodopsin family)
295884	7 transmembrane receptor (rhodopsin family)
295885	7 transmembrane receptor (rhodopsin family)
295886	7 transmembrane receptor (rhodopsin family)
295887	7 transmembrane receptor (rhodopsin family)
295889	7 transmembrane receptor (rhodopsin family)
295890	7 transmembrane receptor (rhodopsin family)
295891	7 transmembrane receptor (rhodopsin family)
295892	7 transmembrane receptor (rhodopsin family)
295893	7 transmembrane receptor (rhodopsin family)
295894	7 transmembrane receptor (rhodopsin family)
295895	7 transmembrane receptor (rhodopsin family)
295896	7 transmembrane receptor (rhodopsin family)
295897	7 transmembrane receptor (rhodopsin family)
295898	7 transmembrane receptor (rhodopsin family)
295900	7 transmembrane receptor (rhodopsin family)
295901	7 transmembrane receptor (rhodopsin family)
295902	7 transmembrane receptor (rhodopsin family)
295903	7 transmembrane receptor (rhodopsin family)
295904	7 transmembrane receptor (rhodopsin family)
295905	7 transmembrane receptor (rhodopsin family)
295906	7 transmembrane receptor (rhodopsin family)
295907	7 transmembrane receptor (rhodopsin family)
295908	7 transmembrane receptor (rhodopsin family)
295909	7 transmembrane receptor (rhodopsin family)
295910	7 transmembrane receptor (rhodopsin family)
295911	7 transmembrane receptor (rhodopsin family)
295912	7 transmembrane receptor (rhodopsin family)
295913	7 transmembrane receptor (rhodopsin family)
295914	7 transmembrane receptor (rhodopsin family)
295915	7 transmembrane receptor (rhodopsin family)
295916	7 transmembrane receptor (rhodopsin family)
295917	7 transmembrane receptor (rhodopsin family)
295918	7 transmembrane receptor (rhodopsin family)
295919	7 transmembrane receptor (rhodopsin family)
295920	7 transmembrane receptor (rhodopsin family)
295921	7 transmembrane receptor (rhodopsin family)
295922	Mitochondrial carrier protein
295923	Respiratory-chain NADH dehydrogenase, 30 Kd subunit
295924	Elongation factor 1 gamma, conserved domain
295924	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
295925	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
295928	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
295929	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
295934	Sulfotransferase protein
295935	Cyclophilin type peptidyl-prolyl cis-trans isomerase
295938	OAR domain
295939	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
295939	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
295941	Ribosomal protein L21e
295942	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps
295943	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
295944	Hsp90 protein
295946	Hsp90 protein
295949	Ribosomal L15
295950	Enolase, C-terminal TIM barrel domain
295951	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
295953	Recombination activating protein 2. V-D-J recombination is the combinatorial process by which the huge range of immunoglobulin and T cell binding specificity is generated from a limited amount of genetic material. This process is synergistically activated
295954	Zinc finger, C3HC4 type (RING finger)
295958	Ribosomal protein L21e
295961	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
295964	Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function
295965	Ets-domain
295966	Ets-domain
295966	Sterile alpha motif (SAM)/Pointed domain
295969	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
295972	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy
295973	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
295974	Ribosomal protein L35Ae
295975	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
295977	Ribosomal protein L6e
295978	4-hydroxybenzoyl-CoA thioesterase. This enzyme (EC 3.1.2.23) catalyses the final step in the biosynthesis of 4-hydroxybenzoate from 4-chlorobenzoate in the soil dwelling microbe Pseudomonas CBS-3
295979	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
295980	Doublecortin
295981	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
295984	Ribosomal protein S6e
295985	MraW methylase family. Members of this family are probably SAM dependent methyltransferases. This family appears to be related to pfam01596
295986	SH2 domain
295987	MraW methylase family. Members of this family are probably SAM dependent methyltransferases. This family appears to be related to pfam01596
295988	Kinesin motor domain
295989	S-adenosyl-L-homocysteine hydrolase
295990	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
295990	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
295991	7 transmembrane receptor (rhodopsin family)
295993	Mitochondrial carrier protein
295996	Ribosomal protein L19e
295999	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
296000	7 transmembrane receptor (rhodopsin family)
296001	7 transmembrane receptor (rhodopsin family)
296002	7 transmembrane receptor (rhodopsin family)
296003	7 transmembrane receptor (rhodopsin family)
296004	7 transmembrane receptor (rhodopsin family)
296005	7 transmembrane receptor (rhodopsin family)
296007	Ribosomal protein S8e
296008	7 transmembrane receptor (rhodopsin family)
296009	7 transmembrane receptor (rhodopsin family)
296010	7 transmembrane receptor (rhodopsin family)
296011	7 transmembrane receptor (rhodopsin family)
296012	7 transmembrane receptor (rhodopsin family)
296013	7 transmembrane receptor (rhodopsin family)
296014	7 transmembrane receptor (rhodopsin family)
296015	7 transmembrane receptor (rhodopsin family)
296016	7 transmembrane receptor (rhodopsin family)
296017	7 transmembrane receptor (rhodopsin family)
296018	7 transmembrane receptor (rhodopsin family)
296019	7 transmembrane receptor (rhodopsin family)
296020	7 transmembrane receptor (rhodopsin family)
296021	7 transmembrane receptor (rhodopsin family)
296022	7 transmembrane receptor (rhodopsin family)
296023	7 transmembrane receptor (rhodopsin family)
296024	7 transmembrane receptor (rhodopsin family)
296025	7 transmembrane receptor (rhodopsin family)
296026	7 transmembrane receptor (rhodopsin family)
296027	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296028	7 transmembrane receptor (rhodopsin family)
296029	7 transmembrane receptor (rhodopsin family)
296031	7 transmembrane receptor (rhodopsin family)
296032	7 transmembrane receptor (rhodopsin family)
296033	7 transmembrane receptor (rhodopsin family)
296034	7 transmembrane receptor (rhodopsin family)
296035	7 transmembrane receptor (rhodopsin family)
296037	7 transmembrane receptor (rhodopsin family)
296038	7 transmembrane receptor (rhodopsin family)
296039	7 transmembrane receptor (rhodopsin family)
296040	7 transmembrane receptor (rhodopsin family)
296041	7 transmembrane receptor (rhodopsin family)
296042	7 transmembrane receptor (rhodopsin family)
296043	7 transmembrane receptor (rhodopsin family)
296044	7 transmembrane receptor (rhodopsin family)
296045	7 transmembrane receptor (rhodopsin family)
296046	7 transmembrane receptor (rhodopsin family)
296047	7 transmembrane receptor (rhodopsin family)
296054	Ribosomal protein L35Ae
296056	Zinc-binding dehydrogenase
296057	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
296059	Formin Homology 2 Domain
296060	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
296061	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
296063	Ribosomal protein L13e
296064	Actin
296070	Ribosomal L39 protein
296071	Enolase, N-terminal domain
296071	Enolase, C-terminal TIM barrel domain
296072	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl
296075	Protein kinase domain
296076	Signal recognition particle 14kD protein. The signal recognition particle (SRP) is a multimeric protein involved in targeting secretory proteins to the rough endoplasmic reticulum membrane. SRP14 and SRP9 form a complex essential for SRP RNA binding
296077	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
296078	Protein kinase domain
296078	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
296079	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
296082	Helix-hairpin-helix motif
296082	recA bacterial DNA recombination protein
296083	Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain
296084	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
296088	Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5
296088	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
296091	C2 domain
296091	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop
296094	Putative zinc finger in N-recognin
296096	Transglutaminase family, C-terminal ig like domain
296099	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
296102	Occludin/ELL family
296104	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
296106	Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration
296111	Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold
296111	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
296112	S-adenosyl-L-homocysteine hydrolase
296116	GHMP kinases putative ATP-binding protein
296117	GHMP kinases putative ATP-binding protein
296119	DTW domain. This presumed domain is found in bacterial and eukaryotic proteins. Its function is unknown. The domain contains multiple conserved motifs including a DTXW motif that this domain has been named after
296121	Ubiquitin carboxyl-terminal hydrolase family 2
296122	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
296124	Ribosomal protein S11
296126	pfam02889, Sec63, Sec63 domain
296129	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family
296133	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296134	Ribosomal protein S5, C-terminal domain
296138	Acyl-CoA dehydrogenase, middle domain. Central domain of Acyl-CoA dehydrogenase has a beta-barrel fold
296139	Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA
296139	Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle
296140	Ribosomal protein S17
296143	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
296143	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296150	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
296152	Transglutaminase family
296152	Transglutaminase family, C-terminal ig like domain
296152	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
296156	Zinc carboxypeptidase
296157	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296165	Ribosomal protein L11, RNA binding domain
296165	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
296167	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isoform
296169	Proteasome A-type and B-type
296171	Ribosomal S3Ae family
296172	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
296173	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
296175	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
296176	HMG (high mobility group) box
296182	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
296186	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
296186	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296190	ubiE/COQ5 methyltransferase family
296190	Putative methyltransferase. Members of this family of hypothetical plant proteins are probably methyltransferases: several of the aligned sequences either match methyltransferase profiles, or contain a SAM-binding motif. One member contains both
296190	FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in comple
296190	SAM dependent carboxyl methyltransferase. This family of plant methyltransferases contains enzymes that act on a variety of substrates including salicylic acid, jasmonic acid and 7-Methylxanthine. Caffeine is synthesized through sequential three-step meth
296196	Ribosomal protein S6e
296197	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
296198	Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration
296203	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
296204	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
296204	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296206	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24
296209	Ribosomal L10
296210	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
296215	Eukaryotic ribosomal protein L18
296217	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
296218	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
296219	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
296220	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
296221	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
296222	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
296224	Translation initiation factor SUI1
296225	Translation initiation factor SUI1
296227	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
296228	Peptidase family M49
296231	Translation initiation factor SUI1
296232	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
296234	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
296235	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
296237	Ribosomal protein L13e
296240	Apoptosis regulator proteins, Bcl-2 family
296244	GHMP kinases putative ATP-binding protein
296245	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
296246	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296247	TEA/ATTS domain family
296248	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296249	Core histone H2A/H2B/H3/H4
296250	Protein kinase domain
296252	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
296255	GHMP kinases putative ATP-binding protein
296256	Protein kinase domain
296257	Ribosomal protein S2
296257	Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73
296258	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
296260	Ribosomal protein S19e
296261	AMP-binding enzyme
296262	Ribosomal protein S2
296265	Protein kinase domain
296266	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
296266	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
296266	Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain has
296269	Fibrinogen beta and gamma chains, C-terminal globular domain
296272	Helix-loop-helix DNA-binding domain
296276	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
296279	Endomembrane protein 70
296280	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
296282	Ribosomal protein L15
296283	Ribosomal protein L21e
296286	Enolase, C-terminal TIM barrel domain
296287	EB1-like C-terminal motif. This motif is found at the C-terminus of proteins that are related to the EB1 protein. The EB1 proteins contain an N-terminal CH domain pfam00307. The human EB1 protein was originally discovered as a protein interacting with the
296290	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
296292	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
296296	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
296297	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
296298	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
296299	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
296299	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
296299	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
296301	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
296302	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
296304	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
296305	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
296305	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
296307	C2 domain
296310	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
296313	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
296314	Homeobox domain
296316	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
296317	Guanylate-kinase-associated protein (GKAP) protein
296318	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to
296321	LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
296321	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
296323	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
296325	Ribosomal protein L13e
296326	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
296331	Protein-tyrosine phosphatase
296332	Protein-tyrosine phosphatase
296333	Protein-tyrosine phosphatase
296336	Fibronectin type III domain
296339	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
296341	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
296341	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
296345	MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000
296349	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
296349	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
296350	TMS membrane protein/tumour differentially expressed protein (TDE)
296351	Poly-adenylate binding protein, unique domain
296351	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
296356	WAP-type (Whey Acidic Protein) 'four-disulfide core'
296357	WAP-type (Whey Acidic Protein) 'four-disulfide core'
296358	von Willebrand factor type A domain
296360	Gpi16 subunit, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to new
296362	Adenylate kinase
296365	Ribosomal protein L13e
296368	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
296369	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
296370	Serine carboxypeptidase
296371	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
296377	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
296381	CAS/CSE protein, C-terminus. Mammalian cellular apoptosis susceptibility (CAS) proteins are homologous to the yeast chromosome-segregation protein, CSE1. This family aligns the C-terminal halves (approximately). CAS is involved in both cellular apoptosis
296385	pfam02939, UcrQ, UcrQ family. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This family represents the 9.5 kDa subunit of the complex
296386	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
296389	Eukaryotic ribosomal protein L18
296390	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
296394	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
296395	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
296395	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
296401	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
296402	PTB domain (IRS-1 type)
296406	Ribosomal protein L15
296407	Arp2/3 complex, 34kD subunit p34-Arc. Arp2/3 protein complex has been implicated in the control of actin polymerization in cells. The human complex consists of seven subunits which include the actin related Arp2 and Arp3, and five others referred to as p4
296409	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
296412	Transcription factor AP-2
296413	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
296416	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
296417	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
296420	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
296420	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
296421	Ubiquitin carboxyl-terminal hydrolase, family 1
296423	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296424	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296425	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296426	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296427	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296428	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296429	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296431	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296432	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296433	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296434	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296435	Ribosomal protein L24e
296436	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296437	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296438	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296440	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296441	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296444	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296445	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
296446	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296447	Stathmin family
296448	Ribosomal protein S19
296451	Ribosomal protein L19e
296453	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
296453	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296455	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
296461	Oxysterol-binding protein
296462	GTP1/OBG family
296462	GTPase of unknown function
296462	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
296467	YT521-B-like family. This family of poorly characterised proteins contains YT521-B, a putative splicing factor from Rat. YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment f
296471	Protein kinase domain
296472	SH2 domain
296472	Protein kinase domain
296472	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
296473	Ribosomal protein L35Ae
296474	RNB-like protein. The function of this region of similarity is uncertain
296478	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
296480	Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction a
296485	RNA polymerases M/15 Kd subunit
296485	Transcription factor S-II (TFIIS)
296489	Ribosomal protein L6
296492	Enolase, N-terminal domain
296492	Enolase, C-terminal TIM barrel domain
296492	Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain
296497	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
296498	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
296499	Homeobox domain
296502	Ribosomal protein L21e
296503	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
296503	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
296504	Glycosyltransferase family 6
296505	Cadherin domain
296506	Cytochrome c oxidase subunit Va. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit Va
296508	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
296510	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
296511	OAR domain
296511	Homeobox domain
296513	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
296515	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
296516	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
296517	Helix-loop-helix DNA-binding domain
296518	ENV polyprotein (coat polyprotein)
296519	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
296521	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296522	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296524	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
296525	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
296526	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
296526	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
296532	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
296532	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
296533	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
296534	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
296536	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
296536	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296537	Transcription factor TFIID (or TATA-binding protein, TBP)
296539	Transcription factor S-II (TFIIS)
296540	Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18
296541	Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18
296542	Ribosomal protein L23
296545	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
296546	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
296547	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
296548	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
296553	Ribosomal protein L14p/L23e
296554	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
296554	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
296558	Cullin family
296559	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
296560	UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate urid
296563	Ribosomal L28e protein family
296564	Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10
296566	Taurine catabolism dioxygenase TauD, TfdA family. This family consists of taurine catabolism dioxygenases of the TauD, TfdA family. TauD from E. coli is a alpha-ketoglutarate-dependent taurine dioxygenase. This enzyme catalyses the oxygenolytic release of
296568	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
296570	Helix-turn-helix. This large family of DNA binding helix-turn helix proteins includes Cro and CI
296573	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
296574	Eukaryotic initiation factor 4E
296574	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
296575	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
296576	Eukaryotic initiation factor 4E
296576	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
296577	Glycosyltransferase family 6
296578	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
296581	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
296582	Ribosomal protein S5, C-terminal domain
296582	Ribosomal protein S5, N-terminal domain
296586	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
296588	Insulinase (Peptidase family M16)
296592	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
296593	Aminotransferase class I and II
296594	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
296595	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
296598	SURF4 family
296603	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
296605	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
296614	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
296619	Protein kinase domain
296620	DNA/RNA non-specific endonuclease
296624	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
296625	Choline/Carnitine o-acyltransferase
296625	Major royal jelly protein. Royal jelly is the food of queen bee larvae, and is responsible for the high reproductive ability of the queen. Major royal jelly proteins make up around 90% of larval jelly proteins. This family also the sequence-related yellow
296627	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
296630	Cyclophilin type peptidyl-prolyl cis-trans isomerase
296635	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
296641	Peptidyl-tRNA hydrolase
296644	Ribosomal protein L11, RNA binding domain
296644	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
296645	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
296646	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
296646	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
296647	Ribosomal L39 protein
296654	Gelsolin repeat
296655	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
296656	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
296657	Glycosyltransferase family 6
296659	NAC domain
296660	7 transmembrane receptor (rhodopsin family)
296661	7 transmembrane receptor (rhodopsin family)
296662	7 transmembrane receptor (rhodopsin family)
296663	7 transmembrane receptor (rhodopsin family)
296664	7 transmembrane receptor (rhodopsin family)
296665	7 transmembrane receptor (rhodopsin family)
296666	7 transmembrane receptor (rhodopsin family)
296667	7 transmembrane receptor (rhodopsin family)
296669	7 transmembrane receptor (rhodopsin family)
296670	7 transmembrane receptor (rhodopsin family)
296672	7 transmembrane receptor (rhodopsin family)
296673	7 transmembrane receptor (rhodopsin family)
296674	7 transmembrane receptor (rhodopsin family)
296675	7 transmembrane receptor (rhodopsin family)
296676	7 transmembrane receptor (rhodopsin family)
296677	7 transmembrane receptor (rhodopsin family)
296678	7 transmembrane receptor (rhodopsin family)
296679	7 transmembrane receptor (rhodopsin family)
296680	7 transmembrane receptor (rhodopsin family)
296681	7 transmembrane receptor (rhodopsin family)
296682	7 transmembrane receptor (rhodopsin family)
296683	7 transmembrane receptor (rhodopsin family)
296684	7 transmembrane receptor (rhodopsin family)
296685	7 transmembrane receptor (rhodopsin family)
296686	7 transmembrane receptor (rhodopsin family)
296687	7 transmembrane receptor (rhodopsin family)
296688	7 transmembrane receptor (rhodopsin family)
296688	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
296689	7 transmembrane receptor (rhodopsin family)
296690	7 transmembrane receptor (rhodopsin family)
296691	7 transmembrane receptor (rhodopsin family)
296692	7 transmembrane receptor (rhodopsin family)
296693	7 transmembrane receptor (rhodopsin family)
296694	7 transmembrane receptor (rhodopsin family)
296695	Nucleoside diphosphate kinase
296697	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
296698	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
296699	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
296702	Nucleoside diphosphate kinase
296703	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
296704	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
296706	Homeobox domain
296706	LIM domain. This family represents two copies of the LIM structural domain
296707	7 transmembrane receptor (Secretin family)
296708	Olfactomedin-like domain
296711	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
296712	ENV polyprotein (coat polyprotein)
296713	Ribosomal L10
296714	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
296717	Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine
296717	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296717	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
296718	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
296721	HMG (high mobility group) box
296723	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
296724	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
296730	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
296734	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
296734	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
296735	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
296735	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
296739	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
296744	Autophagy protein Apg9. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg9 plays a direct role in the formation of the cytoplasm to vacuole targetin
296748	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
296752	Ribosomal protein L21e
296754	Macrophage migration inhibitory factor (MIF)
296764	Guanylate kinase
296767	Intermediate filament protein
296767	Intermediate filament tail domain
296768	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
296770	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
296772	ENV polyprotein (coat polyprotein)
296773	Ribosomal protein L19e
296774	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
296775	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
296786	CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion
296787	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
296788	Cadherin domain
296789	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
296790	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
296791	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
296791	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296804	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
296806	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296807	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p
296813	Mitochondrial carrier protein
296817	RanBP1 domain
296821	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
296822	Proteasome A-type and B-type
296823	HMG (high mobility group) box
296826	Protein kinase domain
296827	Protein kinase domain
296838	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
296846	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
296849	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
296851	Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. H
296852	Gelsolin repeat
296852	Villin headpiece domain
296853	Ribosomal L29e protein family
296854	Ribosomal L29e protein family
296855	Mitochondrial carrier protein
296856	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
296857	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
296857	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296858	Ribosomal L29e protein family
296859	Lyase
296860	Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration
296862	Guanylate-kinase-associated protein (GKAP) protein
296866	7 transmembrane receptor (rhodopsin family)
296867	7 transmembrane receptor (rhodopsin family)
296868	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
296868	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296869	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
296869	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296870	Ribosomal protein L34e
296874	7 transmembrane receptor (Secretin family)
296874	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296875	Enolase, N-terminal domain
296875	Enolase, C-terminal TIM barrel domain
296876	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
296877	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
296878	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
296879	Ribosomal protein S2
296880	Ribosomal protein L31e
296882	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
296882	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
296886	Actin
296889	HMG14 and HMG17
296891	GTPase of unknown function
296891	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
296891	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
296893	Stathmin family
296894	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
296896	Ribosomal protein S5, N-terminal domain
296897	Thrombospondin type 1 domain
296898	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
296903	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
296904	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
296905	Ribosomal protein L21e
296907	LIM domain. This family represents two copies of the LIM structural domain
296910	ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain
296911	ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain
296912	wnt family
296913	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
296915	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
296918	Nucleoside diphosphate kinase
296919	Tetraspanin family
296923	Translationally controlled tumor protein
296925	Saccharopine dehydrogenase. This family comprised of three structural domains that can not be separated in the linear sequence. In some organisms this enzyme is found as a bifunctional polypeptide with lysine ketoglutarate reductase (PF). The saccharopine
296926	Ribosomal protein S5, C-terminal domain
296930	Zinc finger, C3HC4 type (RING finger)
296930	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
296932	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
296936	WH2 motif. The WH2 motif (for Wiskott Aldrich syndrome homology region 2) has been shown in WASP and Scar1 (mammalian homolog) to interact via their WH2 motifs with actin
296937	Hyaluronidase
296938	Hyaluronidase
296939	Hyaluronidase
296940	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
296944	Thymidine kinase
296948	Tissue inhibitor of metalloproteinase. Members of this family are common in extracellular regions of vertebrate species
296949	Ribosomal protein L35Ae
296951	Filamin/ABP280 repeat
296955	Phosphoglycerate kinase
296956	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
296959	Zinc carboxypeptidase
296960	Zinc carboxypeptidase
296960	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fol
296964	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
296967	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
296968	Guanylate kinase
296968	Minor capsid protein VI. This minor capsid protein may act as a link between the external capsid and the internal DNA-protein core. The C-terminal 11 residues may function as a protease cofactor leading to enzyme activation
296970	Phosphopantetheine attachment site. A 4'-phosphopantetheine prosthetic group is attached through a serine. This prosthetic group acts as a a 'swinging arm' for the attachment of activated fatty acid and amino-acid groups. This domain forms a four helix bu
296972	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
296973	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
296980	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
296981	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
296985	Lectin C-type domain. This family includes both long and short form C-type
296987	HMG (high mobility group) box
296991	Protein kinase domain
296993	Protein kinase domain
296996	Protein kinase domain
296998	7 transmembrane receptor (rhodopsin family)
296999	7 transmembrane receptor (rhodopsin family)
297000	Lectin C-type domain. This family includes both long and short form C-type
297002	Trypsin
297003	Trypsin
297004	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297005	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297009	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297010	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297012	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297015	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297016	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297017	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297019	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297020	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297022	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297025	Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be
297027	7 transmembrane receptor (rhodopsin family)
297029	2-hydroxychromene-2-carboxylate isomerase family. 2-hydroxychromene-2-carboxylate (HCCA) isomerase enzymes catalyse one step in prokaryotic polyaromatic hydrocarbon (PAH) catabolic pathways . This family also contains members with functions other than HC
297029	2-hydroxychromene-2-carboxylate isomerase family. 2-hydroxychromene-2-carboxylate (HCCA) isomerase enzymes catalyse one step in prokaryotic polyaromatic hydrocarbon (PAH) catabolic pathways . This family also contains members with functions other than HCC
297031	LIM domain. This family represents two copies of the LIM structural domain
297034	7 transmembrane receptor (rhodopsin family)
297035	7 transmembrane receptor (rhodopsin family)
297036	7 transmembrane receptor (rhodopsin family)
297037	7 transmembrane receptor (rhodopsin family)
297039	7 transmembrane receptor (rhodopsin family)
297040	7 transmembrane receptor (rhodopsin family)
297041	7 transmembrane receptor (rhodopsin family)
297042	7 transmembrane receptor (rhodopsin family)
297043	7 transmembrane receptor (rhodopsin family)
297044	7 transmembrane receptor (rhodopsin family)
297045	7 transmembrane receptor (rhodopsin family)
297046	7 transmembrane receptor (rhodopsin family)
297047	7 transmembrane receptor (rhodopsin family)
297048	7 transmembrane receptor (rhodopsin family)
297049	7 transmembrane receptor (rhodopsin family)
297050	7 transmembrane receptor (rhodopsin family)
297051	7 transmembrane receptor (rhodopsin family)
297053	Homeobox domain
297054	Calreticulin family
297056	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
297058	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
297063	Ribosomal protein L44
297064	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
297066	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
297066	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
297068	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297069	von Willebrand factor type D domain
297072	Thrombospondin type 1 domain
297072	von Willebrand factor type D domain
297072	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
297076	GTPase of unknown function
297076	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
297079	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
297081	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge
297082	Domain of unknown function DUF143. This domain has no known function nor do any of the proteins that possess it. The aligned region is approximately 100 amino acids long
297083	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
297084	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
297086	Ribosomal protein S17
297087	Protein kinase domain
297088	Animal haem peroxidase
297088	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
297089	Ribosomal protein L31e
297090	Protein kinase domain
297093	'chromo' (CHRromatin Organization MOdifier) domain
297093	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
297094	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
297095	MAS20 protein import receptor
297096	PX domain. PX domains bind to phosphoinositides
297097	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
297098	Ribosomal protein L31e
297099	Homeobox domain
297100	Homeobox domain
297102	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
297103	ATP:guanido phosphotransferase, N-terminal domain. The N-terminal domain has an all-alpha fold
297103	ATP:guanido phosphotransferase, C-terminal catalytic domain. The substrate binding site is located in the cleft between N and C-terminal domains, but most of the catalytic residues are found in the larger C-terminal domain
297112	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
297112	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
297113	Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain
297114	NNMT/PNMT/TEMT family
297116	GrpE
297117	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
297118	Ribosomal protein S2
297119	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297120	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297121	7 transmembrane receptor (rhodopsin family)
297121	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297122	7 transmembrane receptor (rhodopsin family)
297122	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297123	FKBP-type peptidyl-prolyl cis-trans isomerase
297124	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297125	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
297125	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
297126	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297127	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297129	7 transmembrane receptor (rhodopsin family)
297129	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297130	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297131	7 transmembrane receptor (rhodopsin family)
297131	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297132	Hsp90 protein
297133	Hsp90 protein
297133	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297134	7 transmembrane receptor (rhodopsin family)
297134	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297135	Ribosomal protein S6e
297136	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297137	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297138	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
297139	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297140	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297141	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297142	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297143	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297144	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297145	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297146	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297147	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297150	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
297151	Ribosomal protein S7e
297153	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
297154	Cyclophilin type peptidyl-prolyl cis-trans isomerase
297155	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
297161	Enolase, N-terminal domain
297161	Enolase, C-terminal TIM barrel domain
297164	Ribosomal L29e protein family
297165	HMG (high mobility group) box
297167	Prothymosin/parathymosin family. Prothymosin alpha and parathymosin are two ubiquitous small acidic nuclear proteins that are thought to be involved in cell cycle progression, proliferation, and cell differentiation
297168	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
297170	7 transmembrane receptor (rhodopsin family)
297173	ADP-ribosylation factor family
297173	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
297174	'Cold-shock' DNA-binding domain
297175	MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains
297176	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
297177	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297178	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
297180	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297182	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297183	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297185	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297187	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297189	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297189	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
297190	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297191	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297195	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
297197	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297202	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297203	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297204	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297207	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297208	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297209	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297211	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297213	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297216	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297217	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297218	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297219	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297220	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297221	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297224	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297225	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297226	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297228	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297229	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297230	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297231	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297232	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297234	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297235	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297236	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297239	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297240	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297243	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297247	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297248	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297249	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297254	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297256	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297258	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297259	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297260	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297261	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297262	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297263	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297264	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297268	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297270	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297272	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297274	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297279	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297280	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297281	Fibrillarin
297282	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297285	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297287	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297288	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297289	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
297290	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297291	Fibrillarin
297292	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297294	HMG (high mobility group) box
297294	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297300	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297304	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297305	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297306	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297307	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297308	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297309	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297311	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297312	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297314	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297316	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297317	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297318	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297321	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297322	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297323	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297324	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297325	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297326	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297328	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297330	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297331	Fork head domain
297336	Ubiquitin carboxyl-terminal hydrolase family 2
297337	Zinc finger, C3HC4 type (RING finger)
297338	SH2 domain
297339	Gelsolin repeat
297345	ENV polyprotein (coat polyprotein)
297346	7 transmembrane receptor (rhodopsin family)
297348	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
297352	HMG (high mobility group) box
297353	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
297354	Ribosomal protein L15
297356	Ribosomal protein S8
297357	Vinculin family
297358	Vinculin family
297359	Vinculin family
297360	Vinculin family
297362	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
297363	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
297363	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
297365	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
297366	Cyclophilin type peptidyl-prolyl cis-trans isomerase
297367	Sec1 family
297369	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
297369	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
297370	Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However
297371	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
297372	Ribosomal protein L19
297373	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
297374	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
297374	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
297376	Trypsin
297376	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
297377	CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2
297379	Homeobox domain
297386	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
297387	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known struc
297388	BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems
297389	Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes
297390	Ribosomal L29e protein family
297391	Lectin C-type domain. This family includes both long and short form C-type
297392	Protein of unknown function (DUF396). A family of conserved eukaryotic transmembrane proteins
297394	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
297395	Ribosomal protein S19
297399	Ribosomal protein L31e
297401	Protein kinase domain
297402	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
297404	Arginosuccinate synthase. This family contains a PP-loop motif
297405	Homeobox domain
297406	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
297413	Helix-loop-helix DNA-binding domain
297416	NifU-like domain. This is an alignment of the carboxy-terminal domain. This is the only common region between the NifU protein from nitrogen-fixing bacteria and rhodobacterial species. The biochemical function of NifU is unknown
297417	SIS domain. SIS (Sugar ISomerase) domains are found in many phosphosugar isomerases and phosphosugar binding proteins. SIS domains are also found in proteins that regulate the expression of genes involved in synthesis of phosphosugars. Presumably the SIS
297418	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
297419	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
297420	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
297421	Transforming growth factor beta like domain
297421	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
297423	PH domain. PH stands for pleckstrin homology
297424	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
297425	ADP-ribosylation factor family
297425	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
297428	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
297430	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
297432	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
297432	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
297437	Pyridine nucleotide-disulphide oxidoreductase, dimerisation domain. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases
297438	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297439	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297440	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297441	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297442	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297443	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
297448	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl
297449	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
297452	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
297452	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
297455	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
297460	Ribosomal protein L10
297463	Guanylate kinase
297464	Dynein light chain type 1
297468	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
297469	Proteasome A-type and B-type
297470	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
297471	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
297472	ATP-grasp domain. This family does not contain all known ATP-grasp domain members. This family includes a diverse set of enzymes that possess ATP-dependent carboxylate-amine ligase activity
297472	CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP
297473	NOL1/NOP2/sun family
297475	Ribosomal protein L35Ae
297478	Ribosomal protein L23
297479	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
297479	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
297480	Fork head domain
297481	Eukaryotic initiation factor 4E
297484	Actin
297487	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
297488	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
297489	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
297489	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
297491	Ribosomal protein L13e
297493	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
297494	Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex
297494	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
297495	Ribosomal protein L13e
297498	ATP-dependent protease La (LON) domain
297499	Ribosomal protein S7e
297500	Actin
297503	ADP-ribosylation factor family
297505	ENV polyprotein (coat polyprotein)
297506	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
297507	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
297513	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
297521	Death domain
297522	WD domain, G-beta repeat
297528	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
297532	Ribosomal protein L6
297534	Ribosomal protein L23
297536	7 transmembrane receptor (rhodopsin family)
297537	7 transmembrane receptor (rhodopsin family)
297538	7 transmembrane receptor (rhodopsin family)
297539	Hsp90 protein
297540	7 transmembrane receptor (rhodopsin family)
297542	7 transmembrane receptor (rhodopsin family)
297543	7 transmembrane receptor (rhodopsin family)
297544	7 transmembrane receptor (rhodopsin family)
297545	7 transmembrane receptor (rhodopsin family)
297546	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
297547	Ribosomal protein S6e
297548	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
297550	Enolase, N-terminal domain
297550	Enolase, C-terminal TIM barrel domain
297550	Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain
297553	ENV polyprotein (coat polyprotein)
297555	Cadherin domain
297555	Protein kinase domain
297556	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
297557	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
297558	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
297560	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
297563	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
297565	Ribosomal protein L21e
297566	ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit
297567	Eukaryotic porin
297568	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
297569	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
297571	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
297572	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
297574	Ribosomal protein S26e
297577	Transforming growth factor beta like domain
297577	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
297580	Homeobox domain
297581	Ribosomal L29e protein family
297583	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
297584	Lectin C-type domain. This family includes both long and short form C-type
297585	Lectin C-type domain. This family includes both long and short form C-type
297586	Lectin C-type domain. This family includes both long and short form C-type
297587	Lectin C-type domain. This family includes both long and short form C-type
297588	7 transmembrane receptor (metabotropic glutamate family)
297588	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
297589	7 transmembrane receptor (metabotropic glutamate family)
297590	7 transmembrane receptor (metabotropic glutamate family)
297597	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
297598	7 transmembrane receptor (rhodopsin family)
297602	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
297603	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
297603	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
297605	Ribosomal protein L23
297606	Cyclophilin type peptidyl-prolyl cis-trans isomerase
297608	Ribosomal protein S5, C-terminal domain
297610	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
297611	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
297611	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
297614	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
297615	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
297616	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
297617	TEA/ATTS domain family
297617	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
297618	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
297619	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
297620	TEA/ATTS domain family
297621	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
297622	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
297623	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
297624	TEA/ATTS domain family
297624	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
297626	TEA/ATTS domain family
297629	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
297630	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
297632	Lectin C-type domain. This family includes both long and short form C-type
297633	Lectin C-type domain. This family includes both long and short form C-type
297634	Lectin C-type domain. This family includes both long and short form C-type
297635	Lectin C-type domain. This family includes both long and short form C-type
297637	Lectin C-type domain. This family includes both long and short form C-type
297638	Ribosomal L29e protein family
297640	Lectin C-type domain. This family includes both long and short form C-type
297641	Lectin C-type domain. This family includes both long and short form C-type
297644	Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5
297644	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
297646	Lectin C-type domain. This family includes both long and short form C-type
297647	Lectin C-type domain. This family includes both long and short form C-type
297648	Lectin C-type domain. This family includes both long and short form C-type
297649	Lectin C-type domain. This family includes both long and short form C-type
297650	Lectin C-type domain. This family includes both long and short form C-type
297651	Lectin C-type domain. This family includes both long and short form C-type
297652	Lectin C-type domain. This family includes both long and short form C-type
297654	Lectin C-type domain. This family includes both long and short form C-type
297656	Lectin C-type domain. This family includes both long and short form C-type
297657	Lectin C-type domain. This family includes both long and short form C-type
297661	Lectin C-type domain. This family includes both long and short form C-type
297664	Lectin C-type domain. This family includes both long and short form C-type
297665	Lectin C-type domain. This family includes both long and short form C-type
297666	Lectin C-type domain. This family includes both long and short form C-type
297667	Lectin C-type domain. This family includes both long and short form C-type
297668	Lectin C-type domain. This family includes both long and short form C-type
297669	Mago nashi protein. This family was originally identified in Drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene. The human homologue has been shown to interact with an RNA binding protein. An RNAi knockout of the C
297670	Protein kinase domain
297676	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
297677	S-adenosylmethionine synthetase, central domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold
297677	S-adenosylmethionine synthetase, N-terminal domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold
297679	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
297682	Protein-tyrosine phosphatase
297682	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
297683	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
297685	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
297690	Core histone H2A/H2B/H3/H4
297697	Core histone H2A/H2B/H3/H4
297699	WD domain, G-beta repeat
297700	Deoxyribose-phosphate aldolase. This family includes the enzyme deoxyribose-phosphate aldolase EC:4.1.2.4, which is involved in nucleotide metabolism. The family also includes a group of related bacterial proteins of unknown function
297701	TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway
297702	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
297703	C2 domain
297703	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
297703	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
297704	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
297706	Ribosomal protein L44
297706	Elongation factor 1 gamma, conserved domain
297707	ENV polyprotein (coat polyprotein)
297707	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
297708	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
297712	Ribosomal protein L44
297713	Phosphorylase family 2
297714	Ribosomal L28e protein family
297715	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
297719	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
297723	Ribosomal protein L31e
297731	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
297739	Ribosomal L10
297743	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
297752	Ribosomal family S4e
297755	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
297759	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
297762	ENV polyprotein (coat polyprotein)
297765	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
297767	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
297768	Metallo-beta-lactamase superfamily
297769	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
297769	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
297770	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
297770	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
297771	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
297771	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
297772	'Cold-shock' DNA-binding domain
297777	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
297777	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
297785	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
297786	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
297790	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
297791	Kinesin motor domain
297793	Aminotransferase class I and II
297795	7 transmembrane receptor (rhodopsin family)
297796	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
297797	V-ATPase subunit H
297798	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
297799	Ribosomal protein L15 amino terminal region. This family is always associated with pfam00256. This family is diagnostic of ribosomal L15 proteins
297802	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
297803	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
297805	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
297806	Protein-tyrosine phosphatase
297806	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
297807	Cyclophilin type peptidyl-prolyl cis-trans isomerase
297809	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
297809	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
297810	Clathrin adaptor complex small chain
297813	Translation initiation factor SUI1
297814	Eukaryotic-type carbonic anhydrase
297816	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
297819	ribosomal L5P family C-terminus. This region is found associated with pfam00281
297820	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
297821	Squalene/phytoene synthase
297826	Enolase, N-terminal domain
297826	Enolase, C-terminal TIM barrel domain
297827	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
297832	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
297835	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
297836	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
297837	AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4-
297838	MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase and the domain is also found in Carbamoyl phosphate synthetase (CPS) where it forms a regulatory domain that binds to the allosteric effector ornithine. This family also i
297840	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
297844	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
297845	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
297848	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
297849	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
297851	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
297852	Hsp90 protein
297852	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
297853	HMG (high mobility group) box
297855	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
297855	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
297860	Eukaryotic porin
297863	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
297864	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
297864	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
297869	Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity
297880	Sushi domain (SCR repeat)
297881	Ribosomal protein S12
297885	Protein of unknown function DUF101. The members of this family are uncharacterised. The alignment of these proteins contains several conserved polar residues that might be potential catalytic residues
297886	Ribosomal L29e protein family
297888	Ribosomal protein L35Ae
297889	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
297891	Protein kinase domain
297892	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
297893	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
297894	von Willebrand factor type A domain
297897	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
297898	Ribosomal L10
297899	Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates
297899	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
297902	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
297902	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
297902	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
297907	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
297908	Ribosomal L29e protein family
297909	DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds
297909	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
297910	Sulfate transporter family. Mutations may lead to several human diseases
297911	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
297915	ENV polyprotein (coat polyprotein)
297916	Protein kinase domain
297918	Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13,
297918	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
297919	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
297920	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
297920	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
297922	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
297927	HMG (high mobility group) box
297930	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
297931	Ribosomal protein L13e
297932	ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae
297934	ICE-like protease (caspase) p10 domain
297934	ICE-like protease (caspase) p20 domain
297937	Hsp90 protein
297938	Hsp90 protein
297939	Ribosomal L29e protein family
297945	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
297945	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
297946	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
297946	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
297947	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
297948	Ribosomal protein L6
297949	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
297951	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
297952	7 transmembrane receptor (rhodopsin family)
297954	LIM domain. This family represents two copies of the LIM structural domain
297956	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
297958	Uncharacterised protein family (UPF0131). This family of proteins are uncharacterised, however BtrG is part of a butirosin-biosynthetic gene cluster from Bacillus circulans
297966	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
297969	tRNA synthetases class I (R). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only arginyl tRNA synthetase
297970	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
297974	Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a
297974	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known struc
297976	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
297980	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
297981	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
297981	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
297982	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
297982	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
297986	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
297986	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
297987	Ribosomal protein S6e
297988	Trypsin
297992	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
297994	BAG domain. Domain present in Hsp70 regulators
297994	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
297996	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
297998	NUDIX domain
298003	Galactose-1-phosphate uridyl transferase, C-terminal domain. SCOP reports fold duplication with N-terminal domain. Both involved in Zn and Fe binding
298005	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
298006	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
298008	Eukaryotic ribosomal protein L18
298009	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298009	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298010	Choline/Carnitine o-acyltransferase
298011	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298011	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298012	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
298013	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
298014	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
298015	Ribosomal protein S2
298016	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298016	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298018	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
298019	Vinculin family
298021	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
298025	Sushi domain (SCR repeat)
298026	von Willebrand factor type A domain
298031	Sushi domain (SCR repeat)
298033	PX domain. PX domains bind to phosphoinositides
298036	Ribosomal protein L23
298038	7 transmembrane receptor (rhodopsin family)
298039	7 transmembrane receptor (rhodopsin family)
298040	7 transmembrane receptor (rhodopsin family)
298042	7 transmembrane receptor (rhodopsin family)
298043	7 transmembrane receptor (rhodopsin family)
298044	7 transmembrane receptor (rhodopsin family)
298045	7 transmembrane receptor (rhodopsin family)
298046	7 transmembrane receptor (rhodopsin family)
298047	7 transmembrane receptor (rhodopsin family)
298048	7 transmembrane receptor (rhodopsin family)
298049	7 transmembrane receptor (rhodopsin family)
298054	Iron/manganese superoxide dismutases, alpha-hairpin domain. superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the Mn/Fe-binding family is one
298054	Iron/manganese superoxide dismutases, C-terminal domain. superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the Mn/Fe-binding family is one. I
298057	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
298060	Fructose-bisphosphate aldolase class-I
298062	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
298063	7 transmembrane receptor (rhodopsin family)
298067	Proteasome A-type and B-type
298068	Sec61beta family. This family consists of homologues of Sec61beta - a component of the Sec61/SecYEG protein secretory system. The domain is found in eukaryotes and archaea and is possibly homologous to the bacterial SecG
298070	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
298071	Antifreeze-like domain. This family contains type III antifreeze proteins as well as a variety of enzymes. This domain is presumed to be involved in sugar binding in the enzyme proteins
298071	NeuB family. NeuB is the prokaryotic N-acetylneuraminic acid (Neu5Ac) synthase. It catalyses the direct formation of Neu5Ac (the most common sialic acid) by condensation of phosphoenolpyruvate (PEP) and N-acetylmannosamine (ManNAc). This reaction has only
298073	Zinc-binding dehydrogenase
298073	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
298074	XPA protein
298079	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
298080	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
298081	Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue
298082	Ribosomal protein L24e
298084	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
298085	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
298087	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298088	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298090	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
298091	Ctr copper transporter family. The redox active metal copper is an essential cofactor in critical biological processes such as respiration, iron transport, oxidative stress protection, hormone production, and pigmentation. A widely conserved family of hig
298095	WD domain, G-beta repeat
298097	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
298098	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
298099	Herpesvirus glycoprotein D. Herpesviruses are dsDNA viruses with no RNA stage. This is a family consists of glycoprotein-D (gD or gIV) which is common to herpes simplex virus types 1 and 2, as well as equine herpes, bovine herpes and Marek's disease virus
298101	Fibrillar collagen C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc
298103	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
298105	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
298107	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
298108	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298108	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
298109	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
298110	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298111	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
298111	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298112	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
298113	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298114	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
298115	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298115	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
298116	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
298117	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
298117	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
298118	Enolase, N-terminal domain
298118	Enolase, C-terminal TIM barrel domain
298123	Core histone H2A/H2B/H3/H4
298125	Ribosomal protein L13e
298126	Ribosomal protein L31e
298128	ENV polyprotein (coat polyprotein)
298135	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298136	Ribosomal protein L34e
298137	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
298138	DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh
298145	Eukaryotic initiation factor 4E
298148	Elongation factor 1 gamma, conserved domain
298148	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
298148	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
298152	Riboflavin kinase / FAD synthetase. This family consists part of the bifunctional enzyme riboflavin kinase / FAD synthetase. These enzymes have both ATP:riboflavin 5'-phospho transferase and ATP:FMN-adenylyltransferase activitys. They catalyse the 5'-phos
298153	Ribosomal protein S2
298154	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
298155	GTP1/OBG family
298156	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
298158	WD domain, G-beta repeat
298159	WD domain, G-beta repeat
298161	HMG (high mobility group) box
298162	ENV polyprotein (coat polyprotein)
298163	ENV polyprotein (coat polyprotein)
298169	Actin
298171	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
298172	ZPR1 zinc-finger domain. The zinc-finger protein ZPR1 is ubiquitous among eukaryotes. It is indeed known to be an essential protein in yeast. In quiescent cells, ZPR1 is localized to the cytoplasm. But in proliferating cells treated with EGF or with other
298173	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298176	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
298177	Lyase
298180	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
298180	PUA domain. The PUA domain named after Pseudouridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi
298180	TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that s
298184	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
298187	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
298193	Cyclophilin type peptidyl-prolyl cis-trans isomerase
298196	Thrombospondin type 1 domain
298197	Thrombospondin type 1 domain
298197	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
298199	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
298200	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298200	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298203	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
298204	Ribosomal protein L21e
298205	Interferon alpha/beta domain
298206	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
298207	Interferon alpha/beta domain
298208	Interferon alpha/beta domain
298209	Interferon alpha/beta domain
298210	Interferon alpha/beta domain
298211	Interferon alpha/beta domain
298212	Interferon alpha/beta domain
298213	Interferon alpha/beta domain
298214	Interferon alpha/beta domain
298215	Interferon alpha/beta domain
298216	Interferon alpha/beta domain
298217	Interferon alpha/beta domain
298218	Ribosomal protein S24e
298219	Interferon alpha/beta domain
298220	Interferon alpha/beta domain
298221	Interferon alpha/beta domain
298222	Interferon alpha/beta domain
298223	Interferon alpha/beta domain
298224	Interferon alpha/beta domain
298225	Interferon alpha/beta domain
298226	Myristoyl-CoA:protein N-myristoyltransferase, N-terminal domain. The N and C-terminal domains of NMT are structurally similar, each adopting an acyl-CoA N-acyltransferase-like fold
298227	Phosphorylase family 2
298229	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
298230	Intermediate filament tail domain
298234	Eukaryotic porin
298237	Ribosomal protein S7e
298238	HMG (high mobility group) box
298239	Ribosomal protein S27
298241	Chaperonin 10 Kd subunit
298243	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
298245	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
298246	Ribosomal protein L19e
298247	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
298248	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
298248	Jun-like transcription factor. The c-Jun NH(2)-terminal kinase (JNK) is a member of an evolutionarily conserved sub-family of mitogen-activated protein (MAP) kinases
298249	FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain
298250	FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain
298252	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
298253	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
298254	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
298258	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298258	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298260	Ribosomal protein L21e
298262	Ribosomal protein L21e
298267	Cache domain
298268	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
298270	Ribosomal protein L6
298278	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
298279	Hsp90 protein
298284	Respiratory-chain NADH dehydrogenase, 49 Kd subunit
298285	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298285	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298288	Low-density lipoprotein receptor domain class A
298288	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl
298289	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
298293	Actin
298296	Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold cont
298297	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298297	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298302	START domain
298305	Oxidoreductase FAD-binding domain
298305	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
298305	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous
298306	CUB domain
298313	Mitochondrial carrier protein
298314	Mitochondrial carrier protein
298316	R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA
298316	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
298317	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
298318	Mitochondrial carrier protein
298319	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
298322	Zinc finger, C3HC4 type (RING finger)
298322	ATP-dependent protease La (LON) domain
298326	LIM domain. This family represents two copies of the LIM structural domain
298327	LIM domain. This family represents two copies of the LIM structural domain
298328	LIM domain. This family represents two copies of the LIM structural domain
298331	LIM domain. This family represents two copies of the LIM structural domain
298332	LIM domain. This family represents two copies of the LIM structural domain
298334	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
298336	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
298336	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
298338	Zn-finger in Ran binding protein and others
298339	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
298340	Sodium:neurotransmitter symporter family
298344	Homeobox domain
298345	Homeobox domain
298346	Homeobox domain
298347	Homeobox domain
298348	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
298349	BTG1 family. A novel family of anti-proliferative proteins
298350	CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond
298351	CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond
298352	Homeobox domain
298353	Homeobox domain
298358	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298358	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298359	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
298360	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
298361	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
298362	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
298367	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298367	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298369	Oxysterol-binding protein
298370	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
298371	NAC domain
298374	PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is about
298376	Glutathione peroxidase
298381	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
298385	Mago nashi protein. This family was originally identified in Drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene. The human homologue has been shown to interact with an RNA binding protein. An RNAi knockout of the C
298386	Ribosomal protein S8
298392	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
298392	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
298393	7 transmembrane receptor (rhodopsin family)
298394	7 transmembrane receptor (rhodopsin family)
298395	7 transmembrane receptor (rhodopsin family)
298396	7 transmembrane receptor (rhodopsin family)
298397	7 transmembrane receptor (rhodopsin family)
298403	Tropomyosin
298408	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
298408	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
298410	Adenylate kinase
298410	Thymidylate kinase
298412	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
298414	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
298415	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
298417	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
298418	Core histone H2A/H2B/H3/H4
298419	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
298420	Core histone H2A/H2B/H3/H4
298421	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
298422	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
298423	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
298423	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
298424	Polyprenyl synthetase
298428	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
298429	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
298431	Tetraspanin family
298432	GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis of
298435	GNT-I family. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GNT-I, GLCNAC-T I) EC:2.4.1.101 transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide. This is an essential step in the synthesis of
298436	Tetraspanin family
298437	Ribosomal protein L21e
298439	SpoVR like protein. Bacillus subtilis stage V sporulation protein R is involved in spore cortex formation. Little is known about cortex biosynthesis, except that it depends on several sigma E controlled genes, including spoVR
298439	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
298444	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
298445	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
298448	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
298451	ATP synthase subunit C
298452	Putative diphthamide synthesis protein. One member is a candidate tumour suppressor gene. DPH2 from yeast, which confers resistance to diphtheria toxin has been found to be involved in diphthamide synthesis. Diphtheria toxin inhibits eukaryotic protein sy
298454	7 transmembrane receptor (metabotropic glutamate family)
298455	7 transmembrane receptor (metabotropic glutamate family)
298455	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
298457	7 transmembrane receptor (metabotropic glutamate family)
298457	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
298459	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
298461	Protein kinase domain
298463	Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes
298465	Ribosomal protein L21e
298468	7 transmembrane receptor (rhodopsin family)
298469	7 transmembrane receptor (rhodopsin family)
298470	7 transmembrane receptor (rhodopsin family)
298471	7 transmembrane receptor (rhodopsin family)
298472	7 transmembrane receptor (rhodopsin family)
298473	7 transmembrane receptor (rhodopsin family)
298474	7 transmembrane receptor (rhodopsin family)
298475	7 transmembrane receptor (rhodopsin family)
298476	7 transmembrane receptor (rhodopsin family)
298477	7 transmembrane receptor (rhodopsin family)
298478	7 transmembrane receptor (rhodopsin family)
298479	7 transmembrane receptor (rhodopsin family)
298480	7 transmembrane receptor (rhodopsin family)
298481	7 transmembrane receptor (rhodopsin family)
298482	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
298483	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
298485	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
298487	PMP-22/EMP/MP20/Claudin family
298488	Cyclophilin type peptidyl-prolyl cis-trans isomerase
298490	DNA / pantothenate metabolism flavoprotein. The DNA/pantothenate metabolism flavoprotein (EC:4.1.1.36) affects synthesis of DNA, and pantothenate metabolism
298491	CAP protein
298492	Ribosomal protein S5, N-terminal domain
298492	Ribosomal protein S5, C-terminal domain
298494	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter
298495	Ribosomal protein L35Ae
298496	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
298496	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
298501	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
298503	Palmitoyl protein thioesterase
298506	Helix-loop-helix DNA-binding domain
298506	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invol
298508	Cyclophilin type peptidyl-prolyl cis-trans isomerase
298509	Ribosomal protein L21e
298510	Poly-adenylate binding protein, unique domain
298511	Growth-Arrest-Specific Protein 2 Domain
298512	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa
298517	Ribosomal protein S15
298519	14-3-3 protein
298519	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
298520	Hsp90 protein
298521	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
298521	Bacterial extracellular solute-binding proteins, family 3
298521	Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors
298521	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
298528	Protein kinase domain
298528	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
298532	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
298533	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
298535	Ribosomal L29e protein family
298541	Putative undecaprenyl diphosphate synthase. Previously known as uncharacterized protein family UPF0015, a single member of this family has been identified as an undecaprenyl diphosphate synthase
298542	'Cold-shock' DNA-binding domain
298550	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
298558	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298558	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
298559	Transcription factor S-II (TFIIS)
298561	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
298562	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298564	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
298566	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
298566	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
298567	Trypsin
298569	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
298570	Rap/ran-GAP
298570	LGN motif, putative GEF specific for G-alpha GTPase
298573	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
298574	Kinesin motor domain
298575	Protein kinase domain
298577	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
298578	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t
298579	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t
298582	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
298584	F-actin capping protein, beta subunit
298590	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
298595	Protein-arginine deiminase (PAD). Members of this family are found in mammals. In the presence of calcium ions, PAD enzymes EC:3.5.3.15 catalyse the post-translational modification reaction responsible for the formation of citrulline residues: Protein L-a
298606	Mitochondrial carrier protein
298607	Arginase family
298611	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
298614	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
298623	Carboxylesterase
298625	MtN3/saliva family. This family includes proteins such as drosophila saliva, MtN3 involved in root nodule development and a protein involved in activation and expression of recombination activation genes (RAGs). Although the molecular function of these pr
298627	Ribosomal protein L36e
298628	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298629	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298635	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298639	Protein kinase domain
298640	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298642	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
298644	CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh
298648	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
298649	Ribosomal protein L36e
298650	NAD binding domain of 6-phosphogluconate dehydrogenase. The NAD binding domain of 6-phosphogluconate dehydrogenase adopts a Rossman fold
298653	Cytidylyltransferase. This family includes: Cholinephosphate cytidylyltransferase. Glycerol-3-phosphate cytidylyltransferase
298654	Mitochondrial carrier protein
298655	Glucose-6-phosphate dehydrogenase, C-terminal domain
298655	Glucose-6-phosphate dehydrogenase, NAD binding domain
298655	Glucosamine-6-phosphate isomerases/6-phosphogluconolactonase
298657	Eukaryotic-type carbonic anhydrase
298661	Ribosomal protein S7e
298671	Actin
298674	Zinc finger, C3HC4 type (RING finger)
298675	Rer1 family. RER1 family protein are involved in involved in the retrieval of some endoplasmic reticulum membrane proteins from the early golgi compartment. The C terminus of yeast Rer1p interacts with a coatomer complex
298679	Protein kinase domain
298682	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
298683	Fibronectin type III domain
298683	von Willebrand factor type A domain
298684	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
298685	Ribosomal protein L20
298686	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
298689	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
298690	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
298692	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
298693	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
298696	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
298696	PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline
298698	Fibrinogen beta and gamma chains, C-terminal globular domain
298701	SART-1 family. This family of proteins appear to contain a leucine zipper and may therefore be a family of transcription factors
298702	Marek's disease glycoprotein A
298702	Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signaling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, d
298703	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
298704	GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the do
298706	Protein kinase domain
298707	Ribosomal protein S19
298709	Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes
298710	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
298713	Core histone H2A/H2B/H3/H4
298715	TRAF-type zinc finger
298717	Protein of unknown function (DUF409). Family of eukaryotic membrane proteins with unknown function
298719	WD domain, G-beta repeat
298724	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
298725	7 transmembrane receptor (rhodopsin family)
298726	KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif
298727	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298727	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298730	BTG1 family. A novel family of anti-proliferative proteins
298731	Glutamate/Leucine/Phenylalanine/Valine dehydrogenase
298733	7 transmembrane receptor (rhodopsin family)
298734	7 transmembrane receptor (rhodopsin family)
298735	Protein kinase domain
298735	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
298736	Protein kinase domain
298736	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
298739	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
298740	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
298742	Ribosomal protein S19
298744	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
298746	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
298748	Uncharacterized ACR, COG1565
298749	Ribosomal S17
298750	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298750	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298754	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298754	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298758	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
298760	Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions
298762	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
298764	Synaptophysin / synaptoporin
298766	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
298774	Cyclophilin type peptidyl-prolyl cis-trans isomerase
298777	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
298778	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
298778	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
298779	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
298783	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
298785	Ribosomal protein S26e
298786	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
298787	Protein of unknown function DUF52. This family contains members from all branches of life. The function of this protein is unknown
298788	HMG (high mobility group) box
298791	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
298792	Yippee putative zinc-binding protein
298793	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298793	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298795	14-3-3 protein
298798	Enolase, C-terminal TIM barrel domain
298801	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
298803	pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an ess
298804	pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an ess
298805	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
298807	Glutaredoxin
298810	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
298811	Ribosomal protein L19e
298814	Glutaredoxin
298815	Ribosomal protein L19e
298816	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
298817	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
298818	Ribosomal protein L19e
298819	Ribosomal protein L19e
298820	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
298821	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
298822	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
298823	Ribosomal protein L19e
298825	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
298826	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
298827	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
298828	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
298829	Ribosomal protein L19e
298830	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
298832	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
298834	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
298835	Ribosomal protein L19e
298839	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
298840	Aspartate/ornithine carbamoyltransferase, Asp/Orn binding domain
298842	WD domain, G-beta repeat
298848	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
298848	EB1-like C-terminal motif. This motif is found at the C-terminus of proteins that are related to the EB1 protein. The EB1 proteins contain an N-terminal CH domain pfam00307. The human EB1 protein was originally discovered as a protein interacting with the
298849	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
298855	Ribosomal protein L36e
298857	7 transmembrane receptor (Secretin family)
298857	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
298858	NAC domain
298859	ADP-ribosylation factor family
298859	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
298859	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
298862	Ribosomal L29e protein family
298863	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
298865	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
298866	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
298867	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
298867	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
298871	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
298873	Amino acid permease
298874	Pumilio-family RNA binding repeat. Puf repeats (aka PUM-HD, Pumilio homology domain) are necessary and sufficient for sequence specific RNA binding in fly Pumilio and worm FBF-1 and FBF-2. Both proteins function as translational repressors in early embryo
298875	Golgi 4-transmembrane spanning transporter
298880	Ribosomal protein S5, C-terminal domain
298882	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
298883	XPG I-region
298883	XPG N-terminal domain
298886	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
298887	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
298893	Ribosomal protein S8
298894	Helix-loop-helix DNA-binding domain
298894	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invol
298895	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
298896	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
298900	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
298901	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
298902	Uracil DNA glycosylase superfamily
298904	ribosomal L5P family C-terminus. This region is found associated with pfam00281
298908	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
298908	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
298912	Fibrinogen beta and gamma chains, C-terminal globular domain
298913	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
298917	Lipase/Acylhydrolase with GDSL-like motif
298919	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
298920	Ribonucleotide reductase, small chain
298921	Ribosomal L10
298921	Ribosomal protein L31e
298922	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298923	D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain
298924	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
298931	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
298932	Allantoicase repeat. This family is found in pairs in Allantoicases, forming the majority of the protein. These proteins allow the use of purines as secondary nitrogen sources in nitrogen-limiting conditions through the reaction: allantoate + H(2)0 = (-)-
298933	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
298934	ARD/ARD' family. The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+. ARD and ARD' can be e
298942	Pyridine nucleotide-disulphide oxidoreductase, dimerisation domain. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases
298942	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
298944	7 transmembrane receptor (rhodopsin family)
298946	Phosphatidylinositol 3- and 4-kinase
298947	Phosphatidylinositol 3- and 4-kinase
298947	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
298947	Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation
298947	PI3-kinase family, ras-binding domain. Certain members of the PI3K family possess Ras-binding domains in their N-termini. These regions show some similarity (although not highly significant similarity) to Ras-binding pfam00788 domains
298948	Eukaryotic-type carbonic anhydrase
298949	Synaptophysin / synaptoporin
298950	Helix-loop-helix DNA-binding domain
298951	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
298954	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
298956	PXA domain. This domain is associated with PX domains pfam00787
298957	Hsp90 protein
298958	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298958	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298960	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298962	Tetraspanin family
298965	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
298967	Chaperonin 10 Kd subunit
298968	Ribosomal protein S5, C-terminal domain
298968	Ribosomal protein S5, N-terminal domain
298972	HMG (high mobility group) box
298973	Ribosomal L29e protein family
298974	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
298976	Gelsolin repeat
298981	Homeobox domain
298984	Ribosomal protein L21e
298985	Ribosomal protein S26e
298987	Ribosomal protein L6
298988	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
298989	Enolase, N-terminal domain
298989	Enolase, C-terminal TIM barrel domain
298990	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298990	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298991	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298991	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
298992	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
298992	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
298993	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
298994	Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex
298994	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
298996	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
298997	ATP synthase subunit C
298999	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
298999	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299001	Ribosomal protein L31e
299002	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
299004	Clathrin adaptor complex small chain
299004	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
299006	Uncharacterized ACR, COG1490
299007	7 transmembrane receptor (rhodopsin family)
299009	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
299021	Ribosomal protein S19e
299022	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
299025	14-3-3 protein
299026	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
299028	Mitochondrial carrier protein
299029	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299029	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299031	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
299031	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
299032	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
299035	Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal
299036	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
299037	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
299038	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
299039	Mitochondrial carrier protein
299040	Protein kinase domain
299041	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
299052	Fructose-bisphosphate aldolase class-I
299054	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
299055	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299055	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299056	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
299056	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
299058	Homeobox domain
299059	Proline dehydrogenase
299060	Insulinase (Peptidase family M16)
299061	Homeobox domain
299062	Ribosomal L15
299063	Ribosomal L15
299064	Homeobox domain
299068	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
299068	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
299069	PXA domain. This domain is associated with PX domains pfam00787
299070	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
299071	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
299073	Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits
299075	Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits
299076	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24
299079	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
299080	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299080	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299081	Hsp90 protein
299082	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
299083	SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase
299084	Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA
299087	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
299090	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1
299091	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299091	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299092	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
299094	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299094	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299094	Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73
299099	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
299100	Cyclophilin type peptidyl-prolyl cis-trans isomerase
299102	Ribosomal protein S6e
299103	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
299103	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
299104	FKBP-type peptidyl-prolyl cis-trans isomerase
299105	Eukaryotic porin
299106	Ribosomal L10
299106	Ribosomal protein L16
299107	Cyclophilin type peptidyl-prolyl cis-trans isomerase
299108	Ribosomal protein L5
299108	ribosomal L5P family C-terminus. This region is found associated with pfam00281
299110	Delta-aminolevulinic acid dehydratase
299111	Ribosomal protein S8e
299112	DNA polymerase epsilon subunit B. DNA polymerase epsilon is essential for cell viability and chromosomal DNA replication in budding yeast. In addition, DNA polymerase epsilon may be involved in DNA repair and cell-cycle checkpoint control. The enzyme cons
299116	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
299119	Domain of unknown function (DUF341)
299119	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
299125	Ribosomal protein S2
299126	Ribosomal L15
299128	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
299128	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
299131	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
299134	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
299135	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
299136	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
299138	Homeobox domain
299142	C2 domain
299144	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
299145	ADP-ribosylation factor family
299145	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
299146	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
299147	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
299148	Ribosomal S17
299149	Ribosomal protein L31e
299150	Ribosomal S17
299156	ADP-ribosylation factor family
299156	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
299156	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
299157	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
299158	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
299159	ATP synthase subunit D. This is a family of subunit D form various ATP synthases including V-type H+ transporting and Na+ dependent. Subunit D is suggested to be an integral part of the catalytic sector of the V-ATPase
299163	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
299166	Ribosomal L39 protein
299167	Macrophage migration inhibitory factor (MIF)
299168	Core histone H2A/H2B/H3/H4
299171	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
299172	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
299172	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
299174	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
299175	Enhancer of rudimentary. Enhancer of rudimentary is a protein of unknown function that is highly conserved in plants and animals. This protein is found to be an enhancer of the rudimentary gene
299176	S-adenosyl-L-homocysteine hydrolase
299177	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299178	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t
299182	Ribosomal protein L21e
299182	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
299184	Poly-adenylate binding protein, unique domain
299185	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
299188	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
299189	Phosphotyrosine interaction domain (PTB/PID)
299190	Ribosomal protein L23
299194	Zinc-binding dehydrogenase
299195	Monooxygenase. This family includes diverse enzymes that utilise FAD
299196	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
299197	Filamin/ABP280 repeat
299197	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
299201	2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain
299201	Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins
299202	MIT domain
299202	Protein kinase domain
299203	Acylphosphatase
299205	Ribosomal protein L21e
299206	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
299207	Uncharacterised protein family (UPF0143). This family of uncharacterised proteins are integral membrane proteins. They may contain 4 transmembrane helices. The family contains a conserved arginine and histidine that may be functionally important
299210	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
299210	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
299211	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
299212	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
299213	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299223	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
299224	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
299227	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299228	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
299232	metallopeptidase family M24
299236	Fibronectin type III domain
299236	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
299239	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
299241	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
299242	7 transmembrane receptor (rhodopsin family)
299244	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
299251	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
299254	Ribosomal protein S2
299255	Kinesin motor domain
299266	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
299267	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
299270	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299271	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299271	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299272	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299273	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299274	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299276	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299277	ENV polyprotein (coat polyprotein)
299277	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299277	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
299278	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299279	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299280	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299281	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299282	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299282	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299283	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
299284	RNase3 domain
299285	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
299286	Ribosomal protein L6e
299286	Ribosomal protein L6, N-terminal domain
299289	TCL1/MTCP1 family. Two related oncogenes, TCL-1 and MTCP-1, are overexpressed in T cell prolymphocytic leukemias as a result of chromosomal rearrangements that involve the translocation of one T cell receptor gene to either chromosome 14q32 or Xq28
299294	Ribosomal protein L21e
299298	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
299301	Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5
299301	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
299302	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
299303	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
299304	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
299306	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
299307	Core histone H2A/H2B/H3/H4
299308	Ribosomal protein S5, C-terminal domain
299308	Ribosomal protein S5, N-terminal domain
299309	Fork head domain
299313	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription b
299316	Mitochondrial carrier protein
299325	Ribosomal protein L3
299326	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
299327	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299327	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299328	Ribosomal protein S21e
299329	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
299331	Hsp90 protein
299331	Hsp90 protein
299331	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
299332	Protein kinase domain
299333	Ribosomal protein L44
299343	Fibrillarin
299343	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
299343	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
299346	NUDIX domain
299355	Marek's disease glycoprotein A
299358	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299360	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299362	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299366	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299366	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
299367	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299368	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299369	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299372	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299375	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299376	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299382	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299384	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299386	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299388	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299389	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299390	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299391	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299393	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299397	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299398	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299400	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299402	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299404	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299405	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299406	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299409	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299411	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299413	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299415	Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids
299418	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299420	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299423	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299428	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299434	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299435	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299437	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299438	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299440	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
299441	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299443	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299450	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299451	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299455	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299458	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299460	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299462	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299464	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299467	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299468	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299469	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299471	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299473	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299475	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299478	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299479	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299483	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299484	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299488	C2 domain
299489	C2 domain
299491	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
299491	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
299492	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
299496	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
299497	Macrophage migration inhibitory factor (MIF)
299498	Protein kinase domain
299500	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299500	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299501	Yippee putative zinc-binding protein
299502	GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein
299503	Ribosomal protein S3, C-terminal domain. This family contains a central domain pfam00013, hence the amino and carboxyl terminal domains are stored separately. This is a minimal carboxyl-terminal domain. Some are much longer
299504	Glutamine amidotransferase class-I
299505	Aminotransferase class I and II
299510	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
299511	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
299512	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
299513	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
299513	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
299514	mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein
299519	Translationally controlled tumor protein
299520	Ribosomal protein S5, C-terminal domain
299520	Ribosomal protein S5, N-terminal domain
299523	Hsp90 protein
299524	Hsp90 protein
299526	Heme oxygenase
299530	7 transmembrane receptor (rhodopsin family)
299535	Translationally controlled tumor protein
299535	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
299538	Ribosomal protein S7e
299544	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299546	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299551	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299552	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
299553	7 transmembrane receptor (rhodopsin family)
299555	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
299556	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
299557	jmjC domain
299557	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
299557	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
299558	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299558	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299560	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
299562	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
299564	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
299566	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
299570	7 transmembrane receptor (rhodopsin family)
299572	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
299573	7 transmembrane receptor (rhodopsin family)
299574	7 transmembrane receptor (rhodopsin family)
299575	7 transmembrane receptor (rhodopsin family)
299576	7 transmembrane receptor (rhodopsin family)
299578	7 transmembrane receptor (metabotropic glutamate family)
299578	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
299578	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
299579	7 transmembrane receptor (metabotropic glutamate family)
299579	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
299581	7 transmembrane receptor (metabotropic glutamate family)
299582	7 transmembrane receptor (rhodopsin family)
299583	7 transmembrane receptor (metabotropic glutamate family)
299586	7 transmembrane receptor (rhodopsin family)
299587	ICE-like protease (caspase) p10 domain
299587	ICE-like protease (caspase) p20 domain
299588	7 transmembrane receptor (rhodopsin family)
299589	7 transmembrane receptor (rhodopsin family)
299590	7 transmembrane receptor (rhodopsin family)
299591	7 transmembrane receptor (rhodopsin family)
299592	7 transmembrane receptor (rhodopsin family)
299593	7 transmembrane receptor (rhodopsin family)
299594	7 transmembrane receptor (rhodopsin family)
299595	7 transmembrane receptor (rhodopsin family)
299596	7 transmembrane receptor (rhodopsin family)
299597	7 transmembrane receptor (rhodopsin family)
299598	7 transmembrane receptor (metabotropic glutamate family)
299602	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
299603	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
299604	DNA-dependent RNA polymerase. This is a family of single chain RNA polymerases
299606	Trypsin
299609	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
299610	RNA polymerase Rpb5, C-terminal domain. The assembly domain of Rpb5. The archaeal equivalent to this domain is subunit H. Subunit H lacks the N-terminal domain
299610	RNA polymerase Rpb5, N-terminal domain. Rpb5 has a bipartite structure which includes a eukaryote-specific N-terminal domain and a C-terminal domain resembling the archaeal RNAP subunit H. The N-terminal domain is involved in DNA binding and is part of th
299614	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
299615	Homeobox domain
299617	Dienelactone hydrolase family
299617	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
299622	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299622	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299623	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
299624	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299625	Intermediate filament protein
299626	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
299627	pfam02891, zf-MIZ, MIZ zinc finger
299632	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
299636	Groucho/TLE N-terminal Q-rich domain. The N-terminal domain of the Grouch/TLE co-repressor proteins are involved in oligomerisation
299638	Sir2 family
299640	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
299644	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
299649	7 transmembrane receptor (rhodopsin family)
299649	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
299650	7 transmembrane receptor (rhodopsin family)
299650	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
299651	Uncharacterized protein family UPF0023
299652	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
299653	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
299653	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
299654	7 transmembrane receptor (metabotropic glutamate family)
299655	7 transmembrane receptor (metabotropic glutamate family)
299655	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
299657	7 transmembrane receptor (metabotropic glutamate family)
299659	7 transmembrane receptor (metabotropic glutamate family)
299660	Protein kinase domain
299662	Protein kinase domain
299663	Protein kinase domain
299664	Protein kinase domain
299667	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
299668	Protein kinase domain
299670	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
299671	Protein kinase domain
299672	Protein kinase domain
299673	Protein kinase domain
299674	Protein kinase domain
299676	Protein kinase domain
299680	7 transmembrane receptor (metabotropic glutamate family)
299682	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
299684	7 transmembrane receptor (rhodopsin family)
299685	LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
299685	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
299686	Uncharacterized protein family UPF0027
299687	Helix-loop-helix DNA-binding domain
299689	'Cold-shock' DNA-binding domain
299692	RNA polymerase beta subunit. RNA polymerases catalyse the DNA dependent polymerisation of RNA. Prokaryotes contain a single RNA polymerase compared to three in eukaryotes (not including mitochondrial. and chloroplast polymerases). Each RNA polymerase comp
299694	Protein kinase domain
299698	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
299699	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
299703	HMG (high mobility group) box
299705	14-3-3 protein
299706	WD domain, G-beta repeat
299710	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
299711	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299711	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299712	Ribosomal protein S5, C-terminal domain
299712	Ribosomal protein S5, N-terminal domain
299713	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
299718	Cyclophilin type peptidyl-prolyl cis-trans isomerase
299724	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
299726	Ribosomal protein L21e
299727	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299727	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299732	Proteasome A-type and B-type
299733	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
299737	Laminin N-terminal (Domain VI)
299737	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
299737	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
299740	Ribosomal protein L31e
299744	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299744	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299745	Transaldolase
299746	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299746	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299749	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
299749	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
299749	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
299750	Citrate synthase
299752	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299754	Ribosomal protein S15
299756	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299758	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299761	ENV polyprotein (coat polyprotein)
299762	TPR Domain
299763	Intermediate filament protein
299766	Helix-loop-helix DNA-binding domain
299766	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates s
299767	Helix-loop-helix DNA-binding domain
299767	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates s
299768	Ribosomal protein S6e
299769	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
299772	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
299775	Ribosomal protein L44
299776	Mitosis protein DIM1
299777	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
299779	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
299780	Oxysterol-binding protein
299783	SCP-like extracellular protein. This domain is also found in prokaryotes
299784	SCP-like extracellular protein. This domain is also found in prokaryotes
299786	ENV polyprotein (coat polyprotein)
299788	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
299790	ADP-ribosylation factor family
299791	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
299792	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
299796	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
299797	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
299799	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
299800	Integral membrane protein DUF92. Members of this family have several predicted transmembrane helices. The function of these prokaryotic proteins is unknown
299801	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
299802	7 transmembrane receptor (rhodopsin family)
299804	Ribosomal protein L19e
299805	NOT2 / NOT3 / NOT5 family. NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5
299806	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
299808	Uncharacterized protein family UPF0027
299809	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
299810	YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity
299813	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
299815	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299815	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299820	Protein-tyrosine phosphatase
299820	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
299822	Ribosomal L29e protein family
299823	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
299824	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
299825	Sulfatase
299827	Protein kinase domain
299832	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
299833	Ribosomal protein L19e
299835	Actin
299840	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
299842	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
299843	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
299844	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
299845	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
299846	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
299847	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
299848	Ribosomal protein S26e
299851	WHEP-TRS domain
299852	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299853	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299853	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299854	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
299857	Serine hydroxymethyltransferase
299868	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
299868	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
299870	ENV polyprotein (coat polyprotein)
299870	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
299872	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
299875	Intermediate filament protein
299878	Cadherin domain
299879	Fibrinogen beta and gamma chains, C-terminal globular domain
299881	Fibrinogen beta and gamma chains, C-terminal globular domain
299886	Ribosomal L38e protein family
299888	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299888	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299890	CUB domain
299891	CUB domain
299891	Sushi domain (SCR repeat)
299892	CUB domain
299893	Sushi domain (SCR repeat)
299894	Sushi domain (SCR repeat)
299899	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
299901	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
299903	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
299904	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
299906	HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats
299913	Proteasome A-type and B-type
299914	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
299915	Protein kinase domain
299917	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
299919	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t
299921	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
299923	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
299923	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to
299927	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299928	Lectin C-type domain. This family includes both long and short form C-type
299928	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
299929	Lectin C-type domain. This family includes both long and short form C-type
299929	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
299930	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299930	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299931	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
299934	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
299935	Ribosomal protein L31e
299936	Translation initiation factor SUI1
299937	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
299938	Ribosomal protein L13
299942	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
299943	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
299944	Sugar (and other) transporter
299945	Mitochondrial carrier protein
299947	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
299948	Ribosomal protein L31e
299951	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
299952	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
299958	Isocitrate/isopropylmalate dehydrogenase
299961	GCM motif protein
299962	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
299964	NAD-dependent glycerol-3-phosphate dehydrogenase
299965	ENV polyprotein (coat polyprotein)
299968	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
299968	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
299969	WD domain, G-beta repeat
299969	Sof1-like domain. Sof1 is essential for cell growth and is a component of the nucleolar rRNA processing machinery
299970	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
299971	V-ATPase subunit C
299972	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
299976	Ribonucleotide reductase, small chain
299980	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
299982	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
299983	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
299983	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
299985	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
299988	ADP-ribosylglycohydrolase. This family includes enzymes that ADP-ribosylations, for example ADP-ribosylarginine hydrolase EC:3.2.2.19 cleaves ADP-ribose-L-arginine. The family also includes dinitrogenase reductase activating glycohydrolase. Most surprisin
299990	Protein kinase domain
299991	Protein kinase domain
299995	ENV polyprotein (coat polyprotein)
299996	von Willebrand factor type A domain
300001	Ribosomal protein L36e
300005	Actin
300007	Core histone H2A/H2B/H3/H4
300009	R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA
300011	Ribosomal protein S19e
300014	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
300015	4-hydroxybenzoyl-CoA thioesterase. This enzyme (EC 3.1.2.23) catalyses the final step in the biosynthesis of 4-hydroxybenzoate from 4-chlorobenzoate in the soil dwelling microbe Pseudomonas CBS-3
300016	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
300017	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
300018	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
300020	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
300021	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
300023	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
300024	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
300025	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
300026	Ribosomal protein L21e
300029	Eukaryotic DNA topoisomerase I, catalytic core. Topoisomerase I promotes the relaxation of DNA superhelical tension by introducing a transient single-stranded break in duplex DNA and are vital for the processes of replication, transcription, and recombina
300030	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
300031	Ribosomal protein L21e
300033	EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains
300034	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di
300034	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
300035	Delta 1-pyrroline-5-carboxylate reductase
300036	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
300036	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
300044	Adenylate and Guanylate cyclase catalytic domain
300045	3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
300046	Gaa1-like, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to newly s
300047	Cytochrome C1 family
300050	WD domain, G-beta repeat
300051	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
300052	VPS28 protein
300053	Kinesin motor domain
300054	Fork head domain
300055	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
300062	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
300063	Octicosapeptide repeat. Short motif that may bind Ca2+
300066	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
300068	Aminotransferase class I and II
300071	Domain of unknown function
300077	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
300078	Dynein light chain type 1
300079	Ribosomal protein L3
300082	Ribosomal protein L21e
300083	Mitochondrial carrier protein
300088	RNA polymerase Rpb7, N-terminal domain. Rpb7 bind to Rpb4 to form a heterodimer. This complex is thought to interact with the nascent RNA strand during Pol II elongation
300089	Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions
300090	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
300092	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
300093	Cytochrome C oxidase subunit II, periplasmic domain
300095	Helix-loop-helix DNA-binding domain
300099	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
300101	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
300103	Ribosomal protein L36e
300104	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
300105	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
300108	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of
300111	Bacterial surface antigen. This entry includes the following surface antigens
300112	Bacterial surface antigen. This entry includes the following surface antigens
300115	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
300117	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
300121	SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily
300123	Cyclophilin type peptidyl-prolyl cis-trans isomerase
300127	7 transmembrane receptor (Secretin family)
300127	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
300127	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are
300128	Cadherin domain
300144	Protein kinase domain
300147	PH domain. PH stands for pleckstrin homology
300150	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
300156	Ribosomal protein S19e
300156	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
300158	WD domain, G-beta repeat
300161	Protein kinase domain
300162	HMG (high mobility group) box
300165	Respiratory-chain NADH dehydrogenase, 49 Kd subunit
300170	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
300172	LIM domain. This family represents two copies of the LIM structural domain
300173	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt
300174	Ribosomal protein L35Ae
300175	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
300177	Protein kinase domain
300178	Homeobox domain
300179	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
300181	Ribosomal protein S5, N-terminal domain
300182	Ribosomal protein S5, N-terminal domain
300185	Ribosomal protein L31e
300187	NAC domain
300188	Translationally controlled tumor protein
300194	7 transmembrane receptor (rhodopsin family)
300195	7 transmembrane receptor (rhodopsin family)
300197	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
300197	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
300198	7 transmembrane receptor (rhodopsin family)
300199	7 transmembrane receptor (rhodopsin family)
300200	7 transmembrane receptor (rhodopsin family)
300202	7 transmembrane receptor (rhodopsin family)
300203	7 transmembrane receptor (rhodopsin family)
300204	7 transmembrane receptor (rhodopsin family)
300205	7 transmembrane receptor (rhodopsin family)
300208	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
300211	FKBP-type peptidyl-prolyl cis-trans isomerase
300212	wnt family
300216	Ribosomal protein L13e
300217	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
300217	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
300220	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
300222	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
300224	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
300227	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
300231	AMP-binding enzyme
300232	Trypsin
300232	Immunoglobulin A1 protease. This family consists of immunoglobulin A1 protease proteins. The immunoglobulin A1 protease cleaves immunoglobulin IgA and is found in pathogenic bacteria such as Neisseria gonorrhoeae. Not all of the members of this family are
300238	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
300239	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
300241	S25 ribosomal protein
300242	Intermediate filament protein
300243	Intermediate filament protein
300244	Intermediate filament protein
300245	Intermediate filament protein
300245	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
300246	Intermediate filament protein
300246	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
300247	Intermediate filament protein
300247	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
300247	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
300248	Intermediate filament protein
300248	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
300249	Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA
300250	Intermediate filament protein
300251	Intermediate filament protein
300251	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
300255	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
300257	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription b
300258	Uncharacterised protein family (UPF0160). This family of proteins contains a large number of metal binding residues. The patterns are suggestive of a phosphoesterase function. The conserved DHH motif may mean this family is related to pfam01368
300260	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can b
300260	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
300262	Homeobox domain
300263	Homeobox domain
300264	Homeobox domain
300264	Occludin/ELL family
300266	'chromo' (CHRromatin Organization MOdifier) domain
300266	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
300268	Zinc finger, C3HC4 type (RING finger)
300271	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
300274	Zinc finger, C3HC4 type (RING finger)
300274	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
300275	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
300276	C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome
300277	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
300278	Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However
300278	S4 domain. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yea
300283	Ribosomal protein L10
300284	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
300291	Actin
300295	Hsp90 protein
300296	Cyclophilin type peptidyl-prolyl cis-trans isomerase
300300	Stathmin family
300300	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
300302	HIT family
300303	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
300305	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
300306	Cyclophilin type peptidyl-prolyl cis-trans isomerase
300306	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
300307	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
300308	Protein kinase domain
300313	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
300315	Ribosomal protein L19e
300320	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter
300321	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
300323	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
300328	4'-phosphopantetheinyl transferase superfamily. Members of this family transfers the 4'-phosphopantetheine (4'-PP) moiety from coenzyme A (CoA) to the invariant serine of pfam00550. This post-translational modification renders holo-ACP capable of acyl gro
300329	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
300330	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
300331	Phosphoglycerate kinase
300332	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
300334	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
300335	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
300337	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
300338	Fragilysin metallopeptidase (M10C) enterotoxin
300338	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
300338	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
300338	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases
300339	Fragilysin metallopeptidase (M10C) enterotoxin
300339	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
300339	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
300339	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family
300339	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases
300340	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
300340	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
300340	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases
300341	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
300341	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
300341	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases
300344	Galactoside-binding lectin
300345	ENV polyprotein (coat polyprotein)
300346	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
300347	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
300347	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
300347	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
300350	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
300350	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
300352	ENV polyprotein (coat polyprotein)
300353	Protein kinase domain
300353	Protein kinase C terminal domain
300354	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
300355	ENV polyprotein (coat polyprotein)
300357	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
300358	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
300358	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
300363	Peptidase family M49
300364	Peptidase family M49
300365	Ribosomal protein S24e
300377	7 transmembrane receptor (rhodopsin family)
300378	7 transmembrane receptor (rhodopsin family)
300379	7 transmembrane receptor (rhodopsin family)
300380	7 transmembrane receptor (rhodopsin family)
300381	HMG (high mobility group) box
300382	EB1-like C-terminal motif. This motif is found at the C-terminus of proteins that are related to the EB1 protein. The EB1 proteins contain an N-terminal CH domain pfam00307. The human EB1 protein was originally discovered as a protein interacting with the
300383	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
300384	Peptidase family M28D
300384	Transferring receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure
300385	7 transmembrane receptor (rhodopsin family)
300386	HMG (high mobility group) box
300388	Ribosomal L15
300391	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
300392	7 transmembrane receptor (rhodopsin family)
300393	7 transmembrane receptor (rhodopsin family)
300394	7 transmembrane receptor (rhodopsin family)
300394	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
300395	7 transmembrane receptor (rhodopsin family)
300396	7 transmembrane receptor (rhodopsin family)
300397	7 transmembrane receptor (rhodopsin family)
300398	7 transmembrane receptor (rhodopsin family)
300399	7 transmembrane receptor (rhodopsin family)
300400	7 transmembrane receptor (rhodopsin family)
300401	7 transmembrane receptor (rhodopsin family)
300402	7 transmembrane receptor (rhodopsin family)
300403	7 transmembrane receptor (rhodopsin family)
300404	7 transmembrane receptor (rhodopsin family)
300405	7 transmembrane receptor (rhodopsin family)
300406	7 transmembrane receptor (rhodopsin family)
300407	7 transmembrane receptor (rhodopsin family)
300408	7 transmembrane receptor (rhodopsin family)
300409	7 transmembrane receptor (rhodopsin family)
300410	7 transmembrane receptor (rhodopsin family)
300411	7 transmembrane receptor (rhodopsin family)
300413	7 transmembrane receptor (rhodopsin family)
300414	7 transmembrane receptor (rhodopsin family)
300415	7 transmembrane receptor (rhodopsin family)
300416	7 transmembrane receptor (rhodopsin family)
300417	7 transmembrane receptor (rhodopsin family)
300417	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
300418	7 transmembrane receptor (rhodopsin family)
300419	7 transmembrane receptor (rhodopsin family)
300420	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
300421	7 transmembrane receptor (rhodopsin family)
300422	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
300423	7 transmembrane receptor (rhodopsin family)
300425	7 transmembrane receptor (rhodopsin family)
300426	7 transmembrane receptor (rhodopsin family)
300426	'chromo' (CHRromatin Organization MOdifier) domain
300427	7 transmembrane receptor (rhodopsin family)
300428	7 transmembrane receptor (rhodopsin family)
300430	Fork head domain
300431	7 transmembrane receptor (rhodopsin family)
300432	ENV polyprotein (coat polyprotein)
300433	7 transmembrane receptor (rhodopsin family)
300434	7 transmembrane receptor (rhodopsin family)
300435	7 transmembrane receptor (rhodopsin family)
300436	7 transmembrane receptor (rhodopsin family)
300438	Low-density lipoprotein receptor domain class A
300438	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
300438	Low-density lipoprotein receptor domain class A
300438	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
300444	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
300444	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
300446	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
300448	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
300453	S25 ribosomal protein
300455	von Willebrand factor type D domain
300455	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
300458	7 transmembrane receptor (rhodopsin family)
300465	Triosephosphate isomerase
300467	Glycosyl hydrolases family 35
300472	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps
300473	Eukaryotic porin
300474	Thrombospondin type 1 domain
300474	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
300474	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
300475	Thrombospondin type 1 domain
300475	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
300476	Ribosomal L29e protein family
300478	Ribosomal protein L21e
300479	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
300479	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
300480	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
300481	Ribosomal protein S19e
300482	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
300482	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
300484	Cyclophilin type peptidyl-prolyl cis-trans isomerase
300485	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
300489	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
300490	Fibronectin type III domain
300491	Enolase, N-terminal domain
300491	Enolase, C-terminal TIM barrel domain
300491	Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain
300511	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
300515	Ribosomal protein L34e
300518	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv
300518	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
300522	7 transmembrane receptor (rhodopsin family)
300523	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
300524	7 transmembrane receptor (rhodopsin family)
300525	7 transmembrane receptor (rhodopsin family)
300526	7 transmembrane receptor (rhodopsin family)
300527	7 transmembrane receptor (rhodopsin family)
300528	7 transmembrane receptor (rhodopsin family)
300529	7 transmembrane receptor (rhodopsin family)
300530	7 transmembrane receptor (rhodopsin family)
300531	7 transmembrane receptor (rhodopsin family)
300533	Hsp90 protein
300533	7 transmembrane receptor (rhodopsin family)
300533	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
300534	7 transmembrane receptor (rhodopsin family)
300535	7 transmembrane receptor (rhodopsin family)
300536	7 transmembrane receptor (rhodopsin family)
300537	7 transmembrane receptor (rhodopsin family)
300538	7 transmembrane receptor (rhodopsin family)
300539	7 transmembrane receptor (rhodopsin family)
300540	7 transmembrane receptor (rhodopsin family)
300543	Myosin head (motor domain)
300545	7 transmembrane receptor (rhodopsin family)
300546	7 transmembrane receptor (rhodopsin family)
300547	7 transmembrane receptor (rhodopsin family)
300548	7 transmembrane receptor (rhodopsin family)
300549	7 transmembrane receptor (rhodopsin family)
300550	7 transmembrane receptor (rhodopsin family)
300551	7 transmembrane receptor (rhodopsin family)
300552	7 transmembrane receptor (rhodopsin family)
300554	7 transmembrane receptor (rhodopsin family)
300556	7 transmembrane receptor (rhodopsin family)
300557	7 transmembrane receptor (rhodopsin family)
300558	7 transmembrane receptor (rhodopsin family)
300559	7 transmembrane receptor (rhodopsin family)
300560	7 transmembrane receptor (rhodopsin family)
300561	7 transmembrane receptor (rhodopsin family)
300562	7 transmembrane receptor (rhodopsin family)
300563	7 transmembrane receptor (rhodopsin family)
300564	7 transmembrane receptor (rhodopsin family)
300565	7 transmembrane receptor (rhodopsin family)
300566	7 transmembrane receptor (rhodopsin family)
300567	7 transmembrane receptor (rhodopsin family)
300568	7 transmembrane receptor (rhodopsin family)
300569	7 transmembrane receptor (rhodopsin family)
300570	7 transmembrane receptor (rhodopsin family)
300571	7 transmembrane receptor (rhodopsin family)
300572	7 transmembrane receptor (rhodopsin family)
300573	7 transmembrane receptor (rhodopsin family)
300574	7 transmembrane receptor (rhodopsin family)
300575	7 transmembrane receptor (rhodopsin family)
300576	7 transmembrane receptor (rhodopsin family)
300577	7 transmembrane receptor (rhodopsin family)
300578	7 transmembrane receptor (rhodopsin family)
300579	7 transmembrane receptor (rhodopsin family)
300580	7 transmembrane receptor (rhodopsin family)
300581	7 transmembrane receptor (rhodopsin family)
300582	7 transmembrane receptor (rhodopsin family)
300583	7 transmembrane receptor (rhodopsin family)
300584	7 transmembrane receptor (rhodopsin family)
300585	7 transmembrane receptor (rhodopsin family)
300587	7 transmembrane receptor (rhodopsin family)
300588	7 transmembrane receptor (rhodopsin family)
300589	7 transmembrane receptor (rhodopsin family)
300590	7 transmembrane receptor (rhodopsin family)
300591	7 transmembrane receptor (rhodopsin family)
300592	7 transmembrane receptor (rhodopsin family)
300593	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
300595	7 transmembrane receptor (rhodopsin family)
300596	7 transmembrane receptor (rhodopsin family)
300597	7 transmembrane receptor (rhodopsin family)
300598	7 transmembrane receptor (rhodopsin family)
300599	7 transmembrane receptor (rhodopsin family)
300600	7 transmembrane receptor (rhodopsin family)
300602	7 transmembrane receptor (rhodopsin family)
300603	7 transmembrane receptor (rhodopsin family)
300604	7 transmembrane receptor (rhodopsin family)
300605	7 transmembrane receptor (rhodopsin family)
300606	7 transmembrane receptor (rhodopsin family)
300607	7 transmembrane receptor (rhodopsin family)
300609	7 transmembrane receptor (rhodopsin family)
300610	7 transmembrane receptor (rhodopsin family)
300613	7 transmembrane receptor (rhodopsin family)
300614	7 transmembrane receptor (rhodopsin family)
300615	7 transmembrane receptor (rhodopsin family)
300616	7 transmembrane receptor (rhodopsin family)
300617	Ribosomal protein S5, C-terminal domain
300617	Ribosomal protein S5, N-terminal domain
300618	7 transmembrane receptor (rhodopsin family)
300620	7 transmembrane receptor (rhodopsin family)
300621	7 transmembrane receptor (rhodopsin family)
300622	7 transmembrane receptor (rhodopsin family)
300623	7 transmembrane receptor (rhodopsin family)
300624	7 transmembrane receptor (rhodopsin family)
300625	7 transmembrane receptor (rhodopsin family)
300626	7 transmembrane receptor (rhodopsin family)
300627	7 transmembrane receptor (rhodopsin family)
300628	7 transmembrane receptor (rhodopsin family)
300629	7 transmembrane receptor (rhodopsin family)
300630	7 transmembrane receptor (rhodopsin family)
300631	7 transmembrane receptor (rhodopsin family)
300632	7 transmembrane receptor (rhodopsin family)
300633	7 transmembrane receptor (rhodopsin family)
300634	7 transmembrane receptor (rhodopsin family)
300635	7 transmembrane receptor (rhodopsin family)
300636	7 transmembrane receptor (rhodopsin family)
300637	7 transmembrane receptor (rhodopsin family)
300638	7 transmembrane receptor (rhodopsin family)
300639	7 transmembrane receptor (rhodopsin family)
300640	7 transmembrane receptor (rhodopsin family)
300643	7 transmembrane receptor (rhodopsin family)
300645	Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly in
300646	Ribosomal L15
300647	Herpesvirus UL6 like. This family consists of various proteins from the herpesviridae that are similar to herpes simplex virus type I UL6 virion protein. UL6 is essential for cleavage and packaging of the viral genome
300647	EB1-like C-terminal motif. This motif is found at the C-terminus of proteins that are related to the EB1 protein. The EB1 proteins contain an N-terminal CH domain pfam00307. The human EB1 protein was originally discovered as a protein interacting with the
300648	Homeobox domain
300651	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
300651	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
300653	Zona pellucida-like domain
300653	von Willebrand factor type D domain
300653	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
300654	Ribosomal L29e protein family
300655	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
300656	B-box zinc finger
300657	B-box zinc finger
300660	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
300662	CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and
300668	Core histone H2A/H2B/H3/H4
300669	Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses
300679	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
300680	Trypsin
300681	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
300682	Trypsin
300682	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
300685	ZPR1 zinc-finger domain. The zinc-finger protein ZPR1 is ubiquitous among eukaryotes. It is indeed known to be an essential protein in yeast. In quiescent cells, ZPR1 is localized to the cytoplasm. But in proliferating cells treated with EGF or with other
300686	ZPR1 zinc-finger domain. The zinc-finger protein ZPR1 is ubiquitous among eukaryotes. It is indeed known to be an essential protein in yeast. In quiescent cells, ZPR1 is localized to the cytoplasm. But in proliferating cells treated with EGF or with other
300688	Ribosomal protein S21e
300689	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
300691	NNMT/PNMT/TEMT family
300692	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
300697	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
300697	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
300698	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
300701	HMG14 and HMG17
300706	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
300707	NAC domain
300709	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
300709	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
300710	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
300710	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
300714	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
300715	WD domain, G-beta repeat
300716	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
300716	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
300718	SH2 domain
300721	DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh
300721	DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds
300722	WD domain, G-beta repeat
300723	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
300726	Electron transfer flavoprotein alpha subunit. This protein is distantly related to and forms a heterodimer with pfam01012
300728	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
300729	Sodium / potassium ATPase beta chain
300731	Ribosomal protein L21e
300732	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
300733	Tetraspanin family
300735	Ribosomal protein S8
300736	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
300737	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
300738	Protein-tyrosine phosphatase
300739	Eukaryotic porin
300740	Ribosomal protein L11, RNA binding domain
300741	Phosphomannose isomerase type I
300742	MIT domain
300742	Protein kinase domain
300744	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
300745	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
300748	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
300748	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
300757	Glycosyl hydrolase family 20, domain 2. This domain has a zincin-like fold
300757	Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold
300758	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
300767	Ribosomal protein S28e
300770	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
300771	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
300774	Cyclophilin type peptidyl-prolyl cis-trans isomerase
300775	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
300778	RNB-like protein. The function of this region of similarity is uncertain
300784	Fibronectin type III domain
300785	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
300789	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
300789	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
300792	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
300794	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
300796	PX domain. PX domains bind to phosphoinositides
300797	Domain of unknown function DUF59. This family includes prokaryotic proteins of unknown function. The family also includes PhaH from Pseudomonas putida. PhaH forms a complex with PhaF, PhaG and PhaI, which hydroxylates phenylacetic acid to 2-hydroxyphenyla
300798	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
300799	Protein kinase domain
300800	Ribosomal L15
300807	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
300810	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
300812	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
300812	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
300815	Disintegrin
300815	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
300816	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
300817	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
300821	Ribosomal L29 protein
300823	Ribosomal protein S7e
300824	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
300827	Ribosomal protein L24e
300828	Translation initiation factor SUI1
300831	C2 domain
300831	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
300832	Ribosomal protein L34e
300833	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
300835	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
300838	Fimbrial Usher protein. This protein is involved in biogenesis of gram negative bacterial pili
300838	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
300840	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
300841	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuron
300844	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
300845	Papain family cysteine protease
300845	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
300850	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
300850	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
300852	Glucose inhibited division protein A
300853	Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular m
300856	Ribosomal S3Ae family
300857	LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in respo
300860	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
300861	14-3-3 protein
300863	Translationally controlled tumor protein
300867	Ribosomal protein L11, RNA binding domain
300867	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
300868	Proteasome A-type and B-type
300871	Ribosomal protein L31e
300872	ENV polyprotein (coat polyprotein)
300874	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
300876	Stathmin family
300877	Phosphoglucomutase/phosphomannomutase, C-terminal domain
300881	Ribosomal protein S8
300882	Ribosomal protein S19
300884	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
300884	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
300886	5-formyltetrahydrofolate cyclo-ligase family. 5-formyltetrahydrofolate cyclo-ligase or methenyl-THF synthetase EC:6.3.3.2 catalyses the interchange of 5-formyltetrahydrofolate (5-FTHF) to 5-10-methenyltetrahydrofolate, this requires ATP and Mg2+. 5-FTHF i
300888	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
300889	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
300890	Thymidine kinase
300895	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
300895	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
300898	Ribosomal protein L31e
300899	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involved
300900	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involved
300901	2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2
300902	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
300903	Ribosomal protein S2
300904	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
300904	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
300908	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
300913	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
300914	Eukaryotic ribosomal protein L18
300915	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
300920	PMP-22/EMP/MP20/Claudin family
300924	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
300926	Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a
300928	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
300929	Ribosomal protein L19e
300930	Fibrillarin
300931	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
300931	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
300932	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
300933	Metallo-beta-lactamase superfamily
300935	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
300935	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
300939	Hsp90 protein
300940	Calx-beta domain
300940	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
300943	Phosphatidylinositol 3- and 4-kinase
300943	FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal
300945	FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain
300946	Ribosomal protein L31e
300947	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
300949	Cyclophilin type peptidyl-prolyl cis-trans isomerase
300950	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
300951	Apoptosis regulator proteins, Bcl-2 family
300953	PMP-22/EMP/MP20/Claudin family
300954	HMG (high mobility group) box
300955	SH2 domain
300955	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
300956	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
300957	Stromal antigen (SA/STAG) protein
300959	Ribosomal protein L19e
300960	Protein kinase domain
300960	Fibronectin type III domain
300960	Giardia variant-specific surface protein
300960	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
300960	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
300962	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
300964	Transferrin
300965	ADP-ribosylation factor family
300965	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
300968	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
300969	Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle
300970	Ribosomal protein S6e
300972	C2 domain
300973	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
300974	Ribosomal protein L3
300975	NUDIX domain
300978	WD domain, G-beta repeat
300979	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
300981	Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases
300983	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
300985	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
300992	Cache domain
300993	Hyaluronidase
300995	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
300996	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
300997	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
300998	Enolase, N-terminal domain
300998	Enolase, C-terminal TIM barrel domain
300999	Protein kinase domain
300999	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
300999	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
300999	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
301003	tRNA synthetases class I (E and Q), anti-codon binding domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase a
301005	FMN-dependent dehydrogenase
301005	2-nitropropane dioxygenase. Members of this family catalyse the denitrification of a number of nitroalkanes using either FAD or FMN as a cofactor
301005	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
301015	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
301016	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
301017	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
301021	Ribosomal protein L13e
301025	Protein-tyrosine phosphatase
301026	Cathelicidin. A novel protein family, showing a conserved proregion and a variable carboxyl-terminal antimicrobial domain. This region shows similarity to cystatins
301027	Trypsin
301027	Immunoglobulin A1 protease. This family consists of immunoglobulin A1 protease proteins. The immunoglobulin A1 protease cleaves immunoglobulin IgA and is found in pathogenic bacteria such as Neisseria gonorrhoeae. Not all of the members of this family are
301028	Trypsin
301034	Transferrin
301039	Extracellular link domain
301042	B-box zinc finger
301042	Filamin/ABP280 repeat
301042	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
301042	PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins
301042	NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in some members, proteolysis has shown that the Peptidyl-alph
301047	NAD-dependent glycerol-3-phosphate dehydrogenase
301048	Oligosaccharyl transferase STT3 subunit. This family consists of the oligsacharyl transferase STT3 subunit and related proteins. The STT3 subunit is part of the oligosccharyl transferase (OTase) complex of proteins and is required for its activity. OTase
301050	Aminotransferase class-V
301050	Aminotransferase class I and II
301050	Pyridoxal-dependent decarboxylase conserved domain
301050	Herpesvirus transcription activation factor (transactivator). This family includes EBV BRLF1 and similar ORF 50 proteins from other herpesviruses
301051	Pyridoxal-dependent decarboxylase conserved domain
301053	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
301056	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
301057	Villin headpiece domain
301058	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
301059	Death domain
301059	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
301060	Protein kinase domain
301060	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with t
301063	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
301066	7 transmembrane receptor (rhodopsin family)
301067	Mitochondrial carrier protein
301068	Translation initiation factor SUI1
301069	Ribosomal protein S19
301073	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
301076	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
301076	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
301077	Ribosomal protein L21e
301078	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
301081	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
301083	Sodium:neurotransmitter symporter family
301084	7 transmembrane receptor (rhodopsin family)
301085	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
301085	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
301086	7 transmembrane receptor (rhodopsin family)
301087	Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone
301088	7 transmembrane receptor (rhodopsin family)
301089	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
301090	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
301091	7 transmembrane receptor (rhodopsin family)
301093	7 transmembrane receptor (rhodopsin family)
301096	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
301106	Fibronectin type III domain
301107	Transferrin
301108	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
301108	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
301113	Mitochondrial carrier protein
301114	Mitochondrial carrier protein
301115	Adenylate kinase
301115	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
301117	Clp protease. The Clp protease has an active site catalytic triad. In E. coli Clp protease, ser-111, his-136 and asp-185 form the catalytic triad. One member has lost all of these active site residues and is therefore inactive, others contain one or two l
301120	AMP-binding enzyme
301122	Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrour
301125	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
301127	Zinc finger, C3HC4 type (RING finger)
301130	Dienelactone hydrolase family
301132	TNF(Tumor Necrosis Factor) family
301133	TNF(Tumor Necrosis Factor) family
301134	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
301134	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
301135	HMG (high mobility group) box
301136	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
301137	7 transmembrane receptor (metabotropic glutamate family)
301137	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
301138	7 transmembrane receptor (metabotropic glutamate family)
301139	Hsp90 protein
301140	7 transmembrane receptor (metabotropic glutamate family)
301143	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
301149	7 transmembrane receptor (Secretin family)
301149	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
301149	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
301152	Telomere-binding protein alpha subunit, N-terminal domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. Al
301154	Protein kinase domain
301154	7 transmembrane receptor (metabotropic glutamate family)
301154	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
301155	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
301156	Enolase, N-terminal domain
301156	Enolase, C-terminal TIM barrel domain
301158	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
301160	Transaldolase
301160	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
301163	Actin
301164	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
301165	Protein kinase domain
301166	Protein kinase domain
301170	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
301174	Ribosomal L10
301218	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
301221	Radical SAM superfamily
301223	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
301224	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
301225	Death domain
301225	ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function
301231	Ribosomal protein L13
301234	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
301236	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
301237	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
301240	Ribosomal Proteins L2, C-terminal domain
301240	Ribosomal Proteins L2, RNA binding domain
301241	Protein kinase domain
301244	LIM domain. This family represents two copies of the LIM structural domain
301248	Ribosomal protein S5, C-terminal domain
301248	Ribosomal protein S5, N-terminal domain
301249	Ribosomal protein S18
301250	Ribosomal protein L14p/L23e
301251	Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length
301252	Hsp90 protein
301252	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
301254	tRNA synthetases class II (A). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only alanyl-tRNA synthetases
301259	Runt domain
301261	Sulfatase
301261	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
301262	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
301264	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
301265	Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl
301269	7 transmembrane receptor (Secretin family)
301269	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
301270	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
301271	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
301272	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
301274	Ribosomal protein L21e
301275	B12 binding domain. This domain binds to B12 (adenosylcobamide), it is found in several enzymes, such as glutamate mutase, methionine synthase and methylmalonyl-CoA mutase
301276	Methylmalonyl-CoA mutase. The enzyme methylmalonyl-CoA mutase is a member of a class of enzymes that uses coenzyme B12 (adenosylcobalamin) as a cofactor. The enzyme induces the formation of an adenosyl radical from the cofactor. This radical then initiate
301280	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
301281	Cullin family
301282	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
301283	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
301284	Transcription factor AP-2
301285	Transcription factor AP-2
301290	Cyclophilin type peptidyl-prolyl cis-trans isomerase
301296	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
301297	Ribosomal L39 protein
301299	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
301300	P53
301303	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
301307	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
301310	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
301312	Thrombospondin N-terminal -like domain
301314	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
301319	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
301319	MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains
301320	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
301322	Zn-finger in Ran binding protein and others
301323	Eukaryotic-type DNA primase, large subunit. DNA primase is the polymerase that synthesises small RNA primers for the Okazaki fragments made during discontinuous DNA replication. DNA primase is a heterodimer of two subunits, the small subunit Pri1 (48 kDa
301324	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
301325	BAG domain. Domain present in Hsp70 regulators
301327	Ribosomal protein L36e
301328	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
301329	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
301333	Protein-tyrosine phosphatase
301336	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
301338	Ribosomal protein S11
301339	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
301341	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
301343	Legume-like lectin family. Lectins are structurally diverse proteins that bind to specific carbohydrates. This family includes the VIP36 and ERGIC-53 lectins. These two proteins were the first recognized members of a family of animal lectins similar (19-2
301346	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
301346	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
301352	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
301353	Transglycosylase SLT domain. This family is distantly related to pfam00062
301360	Zinc finger, C3HC4 type (RING finger)
301360	ATP-dependent protease La (LON) domain
301361	Zinc finger, C3HC4 type (RING finger)
301361	ATP-dependent protease La (LON) domain
301363	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
301364	Core histone H2A/H2B/H3/H4
301365	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
301366	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
301371	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
301372	7 transmembrane receptor (rhodopsin family)
301379	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
301381	Dienelactone hydrolase family
301384	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
301387	Ribosomal protein S12
301388	LacY proton/sugar symporter. This family is closely related to the sugar transporter family
301390	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
301390	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
301392	Ribosomal protein L11, RNA binding domain
301393	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
301395	Ribosomal protein S7e
301399	Hsp90 protein
301401	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
301405	Enolase, N-terminal domain
301405	Enolase, C-terminal TIM barrel domain
301407	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
301408	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
301412	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
301414	Ribosomal protein L31e
301415	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
301416	Uncharacterised protein family (UPF0083)
301417	Ribosomal protein L6e
301424	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
301424	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
301426	Protein kinase domain
301429	ICE-like protease (caspase) p20 domain
301430	Origin recognition complex subunit 2. All DNA replication initiation is driven by a single conserved eukaryotic initiator complex termed he origin recognition complex (ORC). The ORC is a six protein complex. The function of ORC is reviewed in
301431	Domain of unknown function DUF34. One member of this family NIF3 (NGG1p interacting factor 3) interacts with the yeast transcriptional coactivator NGG1p which is part of the ADA complex the exact function of this interaction is unknown
301432	Cyclophilin type peptidyl-prolyl cis-trans isomerase
301433	Protein kinase domain
301434	Protein kinase domain
301435	[2Fe-2S] binding domain
301435	CO dehydrogenase flavoprotein C-terminal domain
301435	FAD binding domain in molybdopterin dehydrogenase
301435	Aldehyde oxidase and xanthine dehydrogenase, a/b hammerhead domain
301435	Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain
301436	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
301438	Ribosomal protein S5, C-terminal domain
301438	Ribosomal protein S5, N-terminal domain
301439	Protein kinase domain
301440	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
301440	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled are
301442	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
301444	Acyl CoA binding protein
301445	WD domain, G-beta repeat
301447	Beige/BEACH domain
301448	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
301449	Ribosomal protein S5, C-terminal domain
301450	SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase
301455	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
301458	Molybdopterin oxidoreductase
301459	Cytochrome c oxidase subunit Vb
301461	Disintegrin
301461	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
301462	Ribosomal L29e protein family
301468	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
301469	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
301471	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
301472	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
301475	Enolase, N-terminal domain
301475	Enolase, C-terminal TIM barrel domain
301484	Microtubule associated protein (MAP65/ASE1 family)
301484	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
301485	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
301486	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
301488	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
301489	SCP-like extracellular protein. This domain is also found in prokaryotes
301490	ENV polyprotein (coat polyprotein)
301491	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
301492	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
301494	Mitosis protein DIM1
301495	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
301496	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
301496	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
301499	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
301499	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
301501	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
301501	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
301502	Mitosis protein DIM1
301502	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
301507	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
301510	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
301510	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
301511	Arp2/3 complex, 34kD subunit p34-Arc. Arp2/3 protein complex has been implicated in the control of actin polymerization in cells. The human complex consists of seven subunits which include the actin related Arp2 and Arp3, and five others referred to as p4
301512	WD domain, G-beta repeat
301513	Cell differentiation family, Rcd1-like. Two of the members in this family have been characterised as being involved in regulation of Ste11 regulated sex genes
301514	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
301517	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
301517	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
301518	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
301518	CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh
301519	Cyclophilin type peptidyl-prolyl cis-trans isomerase
301520	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
301522	ATP synthase subunit C
301525	Glycosyl hydrolases family 35
301526	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
301527	Protein-tyrosine phosphatase
301528	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
301529	Aminopeptidase I zinc metalloprotease (M18)
301530	Aminopeptidase I zinc metalloprotease (M18)
301531	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
301532	Protein kinase domain
301533	Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars
301534	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
301539	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
301544	tRNA synthetase B5 domain. This domain is found in phenylalanine-tRNA synthetase beta subunits
301546	Ribosomal protein S2
301547	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
301549	WD domain, G-beta repeat
301549	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
301553	Ribosomal protein S5, C-terminal domain
301553	Ribosomal protein S5, N-terminal domain
301555	Cullin family
301568	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
301570	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
301571	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
301572	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
301573	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
301574	Mo25 protein family
301575	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
301576	7 transmembrane receptor (rhodopsin family)
301577	Signal recognition particle 14kD protein. The signal recognition particle (SRP) is a multimeric protein involved in targeting secretory proteins to the rough endoplasmic reticulum membrane. SRP14 and SRP9 form a complex essential for SRP RNA binding
301582	Ribosomal protein L21e
301584	Alkaline phosphatase
301584	Ribosomal protein L31e
301585	Alkaline phosphatase
301586	Alkaline phosphatase
301587	Alkaline phosphatase
301589	Eukaryotic initiation factor 4E
301591	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
301592	Zinc-binding dehydrogenase
301593	UDP-glucoronosyl and UDP-glucosyl transferase
301594	UDP-glucoronosyl and UDP-glucosyl transferase
301595	UDP-glucoronosyl and UDP-glucosyl transferase
301595	UDP-glucoronosyl and UDP-glucosyl transferase
301595	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
301597	Cyclophilin type peptidyl-prolyl cis-trans isomerase
301600	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
301600	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
301604	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
301604	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
301607	14-3-3 protein
301608	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
301609	Ribosomal protein L36e
301610	7 transmembrane receptor (rhodopsin family)
301611	7 transmembrane receptor (rhodopsin family)
301612	7 transmembrane receptor (rhodopsin family)
301613	7 transmembrane receptor (rhodopsin family)
301614	7 transmembrane receptor (rhodopsin family)
301615	7 transmembrane receptor (rhodopsin family)
301619	PH domain. PH stands for pleckstrin homology
301622	Thymidylate kinase
301624	pfam02913, FAD-oxidase_C, FAD linked oxidases, C-terminal domain. This domain has a ferredoxin-like fold
301626	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
301627	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
301628	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
301630	S-adenosyl-L-homocysteine hydrolase
301631	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
301632	S25 ribosomal protein
301633	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
301635	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
301637	Proteasome A-type and B-type
301638	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
301639	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
301640	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
301642	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
301643	Translationally controlled tumor protein
301645	Core histone H2A/H2B/H3/H4
301646	ENV polyprotein (coat polyprotein)
301650	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
301653	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
301657	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
301657	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
301658	Protein kinase domain
301663	Ribosomal S17
301664	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
301670	SH2 domain
301671	SH2 domain
301672	Ribosomal protein L31e
301673	DNA mismatch repair proteins, mutS family
301674	F-box domain
301674	Putative zinc finger in N-recognin
301677	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
301677	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
301678	Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are repr
301679	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
301679	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
301681	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
301683	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
301685	Enolase, C-terminal TIM barrel domain
301685	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
301687	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
301690	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
301690	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
301691	Ribosomal protein S7e
301693	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
301694	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
301704	Ribosomal protein L24e
301706	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter
301707	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter
301708	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
301709	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
301710	Ribosomal L18ae protein family
301713	MAM domain. An extracellular domain found in many receptors
301715	Fibronectin type III domain
301715	MAM domain. An extracellular domain found in many receptors
301719	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301720	Protein kinase domain
301722	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301723	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301724	Protein kinase domain
301726	Protein kinase domain
301728	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
301730	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
301731	Eukaryotic ribosomal protein L18
301732	Ribosomal protein L19e
301735	Enolase, N-terminal domain
301735	Enolase, C-terminal TIM barrel domain
301737	Protein kinase domain
301750	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
301753	impB/mucB/samB family. These proteins are involved in UV protection
301754	Resolvase, N terminal domain. The N-terminal domain of the resolvase family (this family) contains the active site and the dimer interface. The extended arm at the C-terminus of this domain connects to the C-terminal helix-turn-helix domain of resolvase -
301755	Transposase, Mutator family
301756	Conserved hypothetical protein 95
301756	ubiE/COQ5 methyltransferase family
301756	FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in comple
301756	Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to
301759	HMG (high mobility group) box
301761	Ribosomal protein S7e
301764	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
301766	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
301767	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
301768	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301771	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301772	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301773	Ribosomal protein L19e
301774	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301775	Ribosomal protein L19e
301777	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301778	Ribosomal protein L19e
301779	Ribosomal protein L19e
301780	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301785	WD domain, G-beta repeat
301786	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301788	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301792	ADP-ribosylation factor family
301792	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
301793	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
301794	7 transmembrane receptor (rhodopsin family)
301795	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
301796	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
301797	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
301797	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
301799	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
301799	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
301801	Ribosomal protein L19e
301802	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301803	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301807	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301808	Ribosomal protein L19e
301810	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301813	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301814	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301816	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301818	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301819	7 transmembrane receptor (rhodopsin family)
301822	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301824	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301827	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301830	Ribosomal protein L19e
301831	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301832	Ribosomal protein L19e
301833	Ribosomal protein L19e
301835	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301839	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301840	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301841	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301842	Ribosomal protein L19e
301843	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301844	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301848	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301852	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301853	Ribosomal protein L19e
301854	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301856	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301857	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301860	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301862	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301866	Ribosomal protein L19e
301867	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301868	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301869	Ribosomal protein L19e
301870	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301872	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301873	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301876	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301877	Glutaredoxin
301878	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301879	Ribosomal protein L19e
301880	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301881	Ribosomal protein L19e
301882	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301883	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301885	Uncharacterized ACR, COG1579
301885	Intermediate filament protein
301885	7 transmembrane receptor (rhodopsin family)
301885	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
301885	Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration
301885	PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins
301886	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301887	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
301890	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301891	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301893	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
301894	7 transmembrane receptor (metabotropic glutamate family)
301894	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
301897	Protein kinase domain
301898	Protein kinase domain
301899	Protein kinase domain
301900	Protein kinase domain
301901	Protein kinase domain
301902	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
301903	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
301910	WD domain, G-beta repeat
301911	ADP-ribosylation factor family
301911	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
301912	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301913	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301914	Ribosomal protein L19e
301915	Ribosomal protein L19e
301918	Ribosomal protein L19e
301923	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301925	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301926	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301928	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301929	Ribosomal protein L19e
301930	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301931	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301932	Ribosomal protein L19e
301932	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301934	Ribosomal protein L19e
301934	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301935	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301943	Ribosomal protein L19e
301944	Ribosomal protein L19e
301944	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301945	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301947	Ribosomal protein L19e
301947	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301948	Intermediate filament protein
301949	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301950	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301951	Ribosomal protein L19e
301953	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
301961	ENV polyprotein (coat polyprotein)
301962	Hsp90 protein
301963	Hsp90 protein
301966	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301967	Ribosomal protein L19e
301968	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301969	Ribosomal protein L19e
301970	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
301971	Kinesin motor domain
301981	HesB-like domain. This family includes HesB which may be involved in nitrogen fixation
301989	Glutaredoxin
301990	Glutaredoxin
301991	Glutaredoxin
301992	Glutaredoxin
301995	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
302000	Glutaredoxin
302002	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302003	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302005	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302006	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302007	7 transmembrane receptor (rhodopsin family)
302009	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302011	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302012	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302013	Ribosomal protein L19e
302017	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302018	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302021	Mitosis protein DIM1
302021	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
302023	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
302024	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
302025	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
302026	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
302027	7 transmembrane receptor (rhodopsin family)
302028	7 transmembrane receptor (metabotropic glutamate family)
302028	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
302033	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
302033	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302034	S-adenosyl-L-homocysteine hydrolase
302035	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
302040	7 transmembrane receptor (rhodopsin family)
302041	7 transmembrane receptor (rhodopsin family)
302043	7 transmembrane receptor (rhodopsin family)
302044	7 transmembrane receptor (rhodopsin family)
302045	7 transmembrane receptor (rhodopsin family)
302046	7 transmembrane receptor (rhodopsin family)
302047	7 transmembrane receptor (rhodopsin family)
302048	7 transmembrane receptor (rhodopsin family)
302048	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
302052	7 transmembrane receptor (rhodopsin family)
302053	7 transmembrane receptor (rhodopsin family)
302054	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
302055	7 transmembrane receptor (rhodopsin family)
302056	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
302060	Protein kinase domain
302061	Protein kinase domain
302061	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
302062	Protein kinase domain
302063	Protein kinase domain
302063	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
302064	Protein kinase domain
302064	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
302066	Protein kinase domain
302066	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
302070	Protein kinase domain
302071	Protein kinase domain
302072	Protein kinase domain
302073	Protein kinase domain
302074	Protein kinase domain
302075	Protein kinase domain
302076	Protein kinase domain
302086	7 transmembrane receptor (rhodopsin family)
302087	7 transmembrane receptor (rhodopsin family)
302087	Nop14-like family. Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production
302089	Myosin head (motor domain)
302097	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302103	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302104	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
302104	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302104	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
302105	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
302106	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302107	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
302109	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
302110	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
302110	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302110	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
302111	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302113	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
302114	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
302115	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302116	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302117	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302118	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
302119	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
302120	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302121	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
302123	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302123	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
302124	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
302124	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302125	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
302126	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302127	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302128	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302129	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
302131	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302132	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
302133	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
302135	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302136	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302137	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302138	Ribosomal protein L19e
302139	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302140	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302141	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302142	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302143	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302146	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302147	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302149	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302152	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302153	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302154	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302156	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302157	Ribosomal protein L19e
302158	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302159	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302160	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302162	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302163	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302166	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
302167	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
302167	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302168	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
302170	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
302179	ADP-ribosylation factor family
302179	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
302183	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
302183	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
302205	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
302213	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
302217	ENV polyprotein (coat polyprotein)
302218	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302220	7 transmembrane receptor (rhodopsin family)
302221	ENV polyprotein (coat polyprotein)
302224	ADP-ribosylation factor family
302224	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
302226	Ribosomal protein L19e
302227	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302228	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302229	Ribosomal protein L19e
302230	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302231	Ribosomal protein L19e
302232	Glutaredoxin
302233	Hsp90 protein
302235	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302236	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302237	Ribosomal protein L19e
302240	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
302240	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302241	Hsp90 protein
302250	Ribosomal protein L19e
302251	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302254	7 transmembrane receptor (rhodopsin family)
302255	7 transmembrane receptor (rhodopsin family)
302259	Protein kinase domain
302260	Protein kinase domain
302261	Protein kinase domain
302262	Protein kinase domain
302263	Protein kinase domain
302267	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302270	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302272	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
302273	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
302274	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
302275	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
302276	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
302279	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
302282	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
302287	Ribosomal protein S5, C-terminal domain
302288	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
302289	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
302290	Clathrin adaptor complex small chain
302290	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
302291	ADP-ribosylation factor family
302291	Clathrin adaptor complex small chain
302291	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
302292	Mnd1 family. This family of proteins includes MND1 from S. cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair
302293	ADP-ribosylation factor family
302293	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
302294	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
302294	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302296	Galactoside-binding lectin
302299	Actin
302300	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
302303	Actin
302303	Regulator of chromosome condensation (RCC1)
302304	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
302306	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
302310	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
302313	Tetraspanin family
302314	Hsp90 protein
302315	ENV polyprotein (coat polyprotein)
302316	Fructose-bisphosphate aldolase class-I
302317	HMG (high mobility group) box
302318	Phosphorylase family 2
302319	Ribosomal protein S2
302321	HMG (high mobility group) box
302324	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
302325	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
302326	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
302329	Nucleoside diphosphate kinase
302330	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
302333	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
302339	Fibrillarin
302340	Actin
302345	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
302345	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302349	ENV polyprotein (coat polyprotein)
302351	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
302351	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302352	Homeobox domain
302356	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
302357	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
302360	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
302360	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
302366	Zinc finger, C3HC4 type (RING finger)
302369	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
302370	Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates
302371	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
302372	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
302373	7 transmembrane receptor (rhodopsin family)
302374	Enolase, C-terminal TIM barrel domain
302375	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
302376	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
302377	7 transmembrane receptor (rhodopsin family)
302378	7 transmembrane receptor (rhodopsin family)
302380	Ribosomal protein L21e
302381	Galactoside-binding lectin
302382	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
302384	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
302387	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
302387	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302388	Ribosomal protein L19e
302389	HMG (high mobility group) box
302391	Ribosomal protein L19e
302392	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302393	Ribosomal protein S8e
302394	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302395	SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily
302395	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
302398	Mitochondrial carrier protein
302400	Homeobox domain
302402	Ribosomal L18ae protein family
302404	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
302405	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
302406	Ribosomal protein S7e
302408	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302409	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
302410	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
302411	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
302412	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
302415	Fork head domain
302416	Ribosomal protein S5, C-terminal domain
302422	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
302423	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
302425	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
302426	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
302426	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
302428	Ribosomal L29e protein family
302429	Proliferating cell nuclear antigen, N-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA
302431	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302432	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302433	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302434	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302435	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302437	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
302438	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302439	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
302439	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
302440	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302441	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302442	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302445	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
302453	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
302454	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
302456	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
302460	Intermediate filament protein
302460	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
302462	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
302465	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
302466	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
302467	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
302468	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
302469	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
302472	NOL1/NOP2/sun family
302473	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
302477	'chromo' (CHRromatin Organization MOdifier) domain
302478	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
302479	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
302485	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
302487	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
302488	Ubiquitin carboxyl-terminal hydrolase family 2
302488	Ubiquitin carboxyl-terminal hydrolases family 2
302494	ATP synthase
302495	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
302495	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
302496	Actin
302497	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
302500	PUA domain. The PUA domain named after Pseudouridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi
302501	Cullin family
302501	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
302502	Cullin family
302506	Quinolinate phosphoribosyl transferase, N-terminal domain. Quinolinate phosphoribosyl transferase (QPRTase) or nicotinate-nucleotide pyrophosphorylase EC:2.4.2.19 is involved in the de novo synthesis of NAD in both prokaryotes and eukaryotes. It catalyses
302508	TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme
302509	Proteasome A-type and B-type
302510	Cyclophilin type peptidyl-prolyl cis-trans isomerase
302512	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
302514	Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly in
302515	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302516	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
302517	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
302518	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
302518	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302521	ENV polyprotein (coat polyprotein)
302522	Cyclophilin type peptidyl-prolyl cis-trans isomerase
302523	Ribosomal protein S27
302525	Ribosomal protein L19e
302528	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
302530	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
302532	Ribosomal protein S6e
302533	Cyclophilin type peptidyl-prolyl cis-trans isomerase
302535	Elongation factor 1 gamma, conserved domain
302535	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
302536	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
302538	C2 domain
302538	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
302538	C2 domain
302538	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
302539	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
302541	DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds
302543	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
302546	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
302547	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
302547	HpcH/HpaI aldolase family. This family includes 2,4-dihydroxyhept-2-ene-1,7-dioic acid aldolase and 4-hydroxy-2-oxovalerate aldolase
302547	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
302548	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
302549	ADP-ribosylation factor family
302549	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
302550	Protein kinase domain
302551	Ribosomal protein L31e
302552	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
302553	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
302554	Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily
302555	Zn-finger in ubiquitin-hydrolases and other protein
302556	ADP-ribosylation factor family
302556	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
302556	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
302557	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
302558	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
302561	Cyclophilin type peptidyl-prolyl cis-trans isomerase
302564	Ribosomal protein L3
302566	Ubiquitin carboxyl-terminal hydrolase family 2
302566	Ubiquitin carboxyl-terminal hydrolases family 2
302567	LIM domain. This family represents two copies of the LIM structural domain
302568	NOL1/NOP2/sun family
302568	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
302571	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
302572	Core histone H2A/H2B/H3/H4
302574	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea
302575	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
302576	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
302578	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
302579	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
302579	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
302579	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
302583	Fork head domain
302585	Apoptosis regulator proteins, Bcl-2 family
302586	Regulator of chromosome condensation (RCC1)
302588	Thymosin beta-4 family
302589	ENV polyprotein (coat polyprotein)
302590	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
302591	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
302591	TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucle
302592	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
302595	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
302595	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
302601	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
302602	Ribosomal protein L19e
302605	PH domain. PH stands for pleckstrin homology
302606	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
302607	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
302610	Enolase, C-terminal TIM barrel domain
302610	Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain
302611	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
302613	Ribosomal protein L31e
302614	Phosphatidylinositol 3- and 4-kinase
302615	Ribosomal protein L35Ae
302617	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
302619	Ocular albinism type 1 protein
302619	7 transmembrane receptor (Secretin family)
302620	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
302625	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
302626	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
302630	F-actin capping protein alpha subunit
302635	7 transmembrane receptor (rhodopsin family)
302636	Glutaredoxin
302637	Glutaredoxin
302638	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302639	Zinc finger, C3HC4 type (RING finger)
302640	Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-Co
302642	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
302643	Octicosapeptide repeat. Short motif that may bind Ca2+
302645	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302646	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302647	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
302650	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302655	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
302658	'Cold-shock' DNA-binding domain
302659	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
302659	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
302660	Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses
302666	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
302667	PH domain. PH stands for pleckstrin homology
302667	BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region
302668	Angiotensin-converting enzyme. Members of this family are dipeptidyl carboxydipeptidases (cleave carboxyl dipeptides) and most notably convert angiotensin I to angiotensin II. Many members of this family contain a tandem duplication of the 600 amino acid
302669	Eukaryotic-type carbonic anhydrase
302670	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
302671	Clathrin adaptor complex small chain
302671	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
302674	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
302675	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
302677	Glutamine amidotransferase class-I
302677	Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates
302677	Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold
302678	BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function
302679	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
302681	Proteasome A-type and B-type
302682	Mitochondrial carrier protein
302685	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
302686	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
302686	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
302688	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
302690	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
302690	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302694	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
302694	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
302695	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
302697	Eukaryotic initiation factor 1A
302698	Ribosomal L29e protein family
302699	Hsp90 protein
302702	Nucleoside diphosphate kinase
302703	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
302703	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
302705	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
302706	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
302707	Ribosomal protein S24e
302708	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
302710	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
302711	WD domain, G-beta repeat
302713	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
302714	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
302715	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
302716	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
302719	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
302720	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
302721	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
302721	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
302722	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
302723	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302725	SKIP/SNW domain. This domain is found in chromatin proteins
302728	Ribosomal L10
302729	Death domain
302740	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
302742	HMG (high mobility group) box
302743	Actin
302744	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302745	Phosphoenolpyruvate carboxykinase. Catalyses the formation of phosphoenolpyruvate by decarboxylation of oxaloacetate
302746	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302747	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302748	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302749	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302750	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
302753	GDP dissociation inhibitor
302755	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
302757	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
302760	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302761	Ribosomal protein L44
302762	WD domain, G-beta repeat
302763	Actin
302766	HMG (high mobility group) box
302768	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
302769	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
302771	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
302773	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
302774	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
302775	Zinc finger, C3HC4 type (RING finger)
302779	ENV polyprotein (coat polyprotein)
302780	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
302781	ENV polyprotein (coat polyprotein)
302782	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
302782	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302784	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
302785	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
302786	Hsp90 protein
302787	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
302788	HMG (high mobility group) box
302789	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
302789	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
302790	Hsp90 protein
302791	Hsp90 protein
302792	HMG (high mobility group) box
302793	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
302793	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
302794	ENV polyprotein (coat polyprotein)
302795	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
302795	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302796	Actin
302797	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
302798	HMG (high mobility group) box
302799	HMG (high mobility group) box
302800	Ribosomal S17
302801	HMG (high mobility group) box
302803	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
302804	Phosphomannose isomerase type I
302804	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
302806	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
302806	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302807	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
302808	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
302809	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
302809	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
302811	Ets-domain
302813	7 transmembrane receptor (rhodopsin family)
302813	7 transmembrane receptor (rhodopsin family)
302816	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
302816	Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15
302817	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
302819	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
302820	7 transmembrane receptor (rhodopsin family)
302821	7 transmembrane receptor (rhodopsin family)
302822	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involv
302822	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
302823	7 transmembrane receptor (rhodopsin family)
302824	7 transmembrane receptor (rhodopsin family)
302825	7 transmembrane receptor (rhodopsin family)
302826	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
302827	Actin
302829	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302831	Eukaryotic porin
302835	Tropomyosin
302836	Ribosomal protein S2
302838	Eukaryotic porin
302840	HMG (high mobility group) box
302842	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
302842	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy
302845	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
302846	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
302847	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
302848	NAC domain
302851	Microtubule associated protein (MAP65/ASE1 family)
302852	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
302855	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
302856	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
302856	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
302858	Core histone H2A/H2B/H3/H4
302859	Hsp90 protein
302862	NAC domain
302862	Ribosomal protein L19e
302865	Clathrin adaptor complex small chain
302867	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
302869	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
302870	Ribosomal protein S8e
302871	Oxidoreductase FAD-binding domain
302871	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
302872	Common central domain of tyrosinase. This family also contains polyphenol oxidases and some hemocyanins. Binds two copper ions via two sets of three histidines. This family is related to pfam00372
302873	Zinc finger, C3HC4 type (RING finger)
302876	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
302879	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
302880	WD domain, G-beta repeat
302881	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
302886	Endomembrane protein 70
302891	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
302893	PX domain. PX domains bind to phosphoinositides
302898	Putative replicase 1 (ORF2)
302899	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
302903	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
302904	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
302914	Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase
302915	Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions
302918	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
302920	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
302931	PWI domain
302931	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
302937	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
302937	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
302939	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
302940	Ribosomal protein L10
302943	Ribosomal protein L6
302944	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
302945	Fibronectin type III domain
302950	Adenylate and Guanylate cyclase catalytic domain
302952	Glutathione peroxidase
302953	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
302954	7 transmembrane receptor (rhodopsin family)
302957	7 transmembrane receptor (rhodopsin family)
302958	7 transmembrane receptor (rhodopsin family)
302961	Ribosomal protein S5, C-terminal domain
302961	Ribosomal protein S5, N-terminal domain
302962	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
302963	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
302963	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
302963	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases
302967	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
302970	Trypsin
302971	Trypsin
302971	Ribosomal L39 protein
302972	N-acetylglucosamine-6-phosphate deacetylase. This family are predicted to adopt a TIM barrel fold. This is family of N-acetylglucosamine-6-phosphate deacetylases, EC:3.5.1.25 These enzymes catalyse the reaction N-acetyl-D-glucosamine 6-phosphate + H2O <=>
302975	Synaptogyrin. This family of proteins is distantly related to pfam01284
302976	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
302978	Sulfotransferase protein
302980	Fumarylacetoacetate (FAA) hydrolase family. This family consists of fumarylacetoacetate (FAA) hydrolase, or fumarylacetoacetate hydrolase (FAH) and it also includes HHDD isomerase/OPET decarboxylase from E. coli strain W. FAA is the last enzyme in the tyr
302981	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
302983	Uncharacterized ACR, COG1579
302983	Intermediate filament protein
302983	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
302983	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
302988	C2 domain
302989	Trypsin
302990	Trypsin
302991	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
302993	HMG (high mobility group) box
302995	Metallo-beta-lactamase superfamily
303001	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
303002	Ribosomal protein L3
303003	Nucleoside diphosphate kinase
303004	Per1-like. A member of this family has been implemented in protein processing in the endoplasmic reticulum
303005	Protein of unknown function, DUF259
303007	Globin
303008	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa
303021	PX domain. PX domains bind to phosphoinositides
303021	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
303023	DAD family. Members of this family are thought to be integral membrane proteins. Some members of this family have been shown to cause apoptosis if mutated, these proteins are known as DAD for defender against death. The family also includes the epsilon su
303024	F-box domain
303024	WD domain, G-beta repeat
303025	Ribosomal protein L6
303029	Importin-beta N-terminal domain
303031	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
303032	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
303033	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
303039	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
303039	Arginyl tRNA synthetase N terminal domain. This domain is found at the amino terminus of Arginyl tRNA synthetase, also called additional domain 1 (Add-1). It is about 140 residues long and it has been suggested that this domain will be involved in tRNA re
303040	Actin
303049	HMG (high mobility group) box
303051	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
303051	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
303053	Ribosomal protein S19e
303054	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
303058	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
303059	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
303059	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
303060	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
303068	HMG (high mobility group) box
303083	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
303086	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
303088	B-box zinc finger
303088	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
303089	B-box zinc finger
303089	Zinc finger, C3HC4 type (RING finger)
303089	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
303090	Protein of unknown function, DUF258
303091	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
303093	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
303096	7 transmembrane receptor (rhodopsin family)
303105	7 transmembrane receptor (rhodopsin family)
303107	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
303113	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
303115	Ribosomal L10
303129	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
303130	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
303133	pfam02939, UcrQ, UcrQ family. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This family represents the 9.5 kDa subunit of the complex
303134	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
303135	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
303136	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
303140	Sugar (and other) transporter
303141	Cyclic nucleotide-binding domain
303141	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
303141	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
303141	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
303145	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
303146	HIT family
303148	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
303159	Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. M
303160	WD domain, G-beta repeat
303162	Ribosomal protein S8
303166	Phospholipase A2 inhibitor
303167	B-box zinc finger
303167	Zinc finger, C3HC4 type (RING finger)
303167	7 transmembrane receptor (rhodopsin family)
303167	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
303168	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
303168	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
303170	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
303170	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
303170	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
303171	7 transmembrane receptor (rhodopsin family)
303171	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
303172	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
303173	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
303175	Core histone H2A/H2B/H3/H4
303175	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
303176	Core histone H2A/H2B/H3/H4
303176	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
303177	Core histone H2A/H2B/H3/H4
303178	B-box zinc finger
303178	Zinc finger, C3HC4 type (RING finger)
303178	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
303179	Connexin
303181	wnt family
303182	wnt family
303189	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
303189	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
303191	Myosin head (motor domain)
303192	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
303194	Topoisomerase DNA binding C4 zinc finger
303195	NAC domain
303201	Ubiquitin carboxyl-terminal hydrolase family 2
303201	Ubiquitin carboxyl-terminal hydrolases family 2
303201	Zn-finger in ubiquitin-hydrolases and other protein
303202	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
303203	Uncharacterized membrane protein family UPF0013. These proteins are integral membrane proteins of unknown function
303205	Sodium:solute symporter family
303206	Protein kinase domain
303209	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
303212	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
303212	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
303213	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
303213	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
303214	B-box zinc finger
303214	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
303215	WD domain, G-beta repeat
303215	High molecular weight glutenin subunit. Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin
303218	Sulfotransferase protein
303220	Sulfotransferase protein
303222	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
303226	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
303226	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
303226	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
303228	Ribosomal protein S5, C-terminal domain
303228	Ribosomal protein S5, N-terminal domain
303229	Enolase, N-terminal domain
303229	Enolase, C-terminal TIM barrel domain
303233	Ubiquitin carboxyl-terminal hydrolase family 2
303233	Ubiquitin carboxyl-terminal hydrolases family 2
303235	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
303236	PMP-22/EMP/MP20/Claudin family
303237	Mitochondrial carrier protein
303241	'chromo' (CHRromatin Organization MOdifier) domain
303241	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
303241	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
303241	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
303243	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
303244	PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localization of cystinosin to lysosomes. Howe
303245	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
303247	Protein kinase domain
303252	Lipoxygenase
303252	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
303257	Vitelline membrane outer layer protein I (VOMI). VOMI binds tightly to ovomucin fibrils of the egg yolk membrane. The structure that consists of three beta-sheets forming Greek key motifs, which are related by an internal pseudo three-fold symmetry. Furth
303260	R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA
303270	Ribosomal protein L15
303272	FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1
303272	BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems
303274	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
303275	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
303276	GTPase of unknown function
303284	Ribosomal protein L23
303284	Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a
303285	TRAF-type zinc finger
303285	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
303287	F-box domain
303289	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
303290	Gamma-glutamyltranspeptidase
303291	Sugar (and other) transporter
303291	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
303293	Ribosomal protein S27
303293	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
303294	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
303296	7 transmembrane receptor (rhodopsin family)
303297	Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73
303298	Rap/ran-GAP
303304	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
303304	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
303306	Pyridoxal-phosphate dependent enzyme. Members of this family are all pyridoxal-phosphate dependent enzymes. This family includes: serine dehydratase EC:4.2.1.13 P20132, threonine dehydratase EC:4.2.1.16, tryptophan synthase beta chain EC:4.2.1.20, threoni
303307	SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop
303308	SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop
303310	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
303312	Beige/BEACH domain
303312	WD domain, G-beta repeat
303314	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
303321	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
303322	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
303323	Ribosomal protein S2
303328	Protein kinase domain
303329	Protein kinase domain
303336	WD domain, G-beta repeat
303337	Intermediate filament protein
303337	Microtubule associated protein (MAP65/ASE1 family)
303337	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
303337	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
303337	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
303340	Stathmin family
303341	Intermediate filament protein
303342	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
303346	Caldesmon
303347	Helix-loop-helix DNA-binding domain
303349	Nucleoside diphosphate kinase
303349	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
303350	Zinc finger, C3HC4 type (RING finger)
303350	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
303353	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
303358	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
303361	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
303361	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
303371	Fibronectin type III domain
303372	WD domain, G-beta repeat
303374	Transaldolase
303376	Fibronectin type III domain
303377	NB-ARC domain
303377	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
303379	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
303379	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
303382	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
303384	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
303384	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
303385	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
303388	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
303389	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
303391	Myosin head (motor domain)
303391	IQ calmodulin-binding motif. Calmodulin-binding motif
303394	Ubiquitin carboxyl-terminal hydrolase family 2
303394	Ubiquitin carboxyl-terminal hydrolases family 2
303398	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
303399	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
303405	Yippee putative zinc-binding protein
303407	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be
303408	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be
303413	Animal haem peroxidase
303414	Animal haem peroxidase
303415	7 transmembrane receptor (rhodopsin family)
303419	CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2
303439	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
303448	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
303448	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
303449	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
303454	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
303455	Eukaryotic-type carbonic anhydrase
303462	AMP-binding enzyme
303463	AMP-binding enzyme
303466	Cyclophilin type peptidyl-prolyl cis-trans isomerase
303467	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
303469	Homeobox domain
303470	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
303470	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
303470	MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases
303471	Stathmin family
303476	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
303477	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
303477	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
303478	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
303479	Herpesvirus transcription activation factor (transactivator). This family includes EBV BRLF1 and similar ORF 50 proteins from other herpesviruses
303479	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
303479	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p
303480	Homeobox domain
303484	Homeobox domain
303485	Homeobox domain
303486	Homeobox domain
303488	Homeobox domain
303489	Homeobox domain
303490	Homeobox domain
303491	Homeobox domain
303493	PX domain. PX domains bind to phosphoinositides
303494	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
303495	Clathrin adaptor complex small chain
303496	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
303497	Oxysterol-binding protein
303497	PH domain. PH stands for pleckstrin homology
303499	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
303500	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
303501	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
303505	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
303506	ADP-ribosylation factor family
303506	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
303506	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
303516	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an
303518	Intermediate filament protein
303519	Intermediate filament protein
303520	Intermediate filament protein
303521	Intermediate filament protein
303522	Intermediate filament protein
303522	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
303523	Intermediate filament protein
303523	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
303526	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
303527	Intermediate filament protein
303527	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
303528	Intermediate filament protein
303529	Intermediate filament protein
303529	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
303530	Intermediate filament protein
303530	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
303531	Intermediate filament protein
303531	Flagellar hook-associated protein 2. The flagellar hook-associated protein 2 (HAP2 or FliD) forms the distal end of the flagella, and plays a role in mucin specific adhesion of the bacteria
303531	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
303533	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
303533	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
303535	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
303538	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
303538	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
303540	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
303540	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
303541	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
303541	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
303542	GH3 auxin-responsive promoter
303549	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
303550	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
303550	Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
303550	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge
303552	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
303553	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
303554	Octicosapeptide repeat. Short motif that may bind Ca2+
303554	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
303557	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
303559	ADP-ribosylation factor family
303559	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
303559	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
303561	L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7
303561	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
303561	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
303562	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
303565	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
303567	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
303569	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
303571	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
303575	Kinesin motor domain
303576	Ribosomal L22e protein family
303577	Acyl CoA binding protein
303583	PH domain. PH stands for pleckstrin homology
303583	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
303583	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
303584	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
303584	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
303586	wnt family
303600	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
303601	Cytochrome b561
303604	Protein kinase domain
303605	Protein kinase domain
303605	LIM domain. This family represents two copies of the LIM structural domain
303607	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
303607	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
303612	Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain
303614	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
303615	C2 domain
303615	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
303618	Phosphatidylinositol transfer protein
303623	Protein kinase domain
303623	Protein kinase C terminal domain
303624	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
303629	Macrophage migration inhibitory factor (MIF)
303631	Sulfatase
303632	Sulfatase
303632	Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism
303634	ADP-ribosylation factor family
303634	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
303637	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
303639	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
303651	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
303658	Ribosomal L38e protein family
303659	Kinesin motor domain
303660	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
303661	7 transmembrane receptor (rhodopsin family)
303663	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
303665	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
303665	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
303666	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
303667	Mitochondrial carrier protein
303668	ADP-ribosylation factor family
303668	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
303669	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
303670	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
303671	Fatty acid desaturase
303673	Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons
303674	Sugar (and other) transporter
303676	Mitochondrial carrier protein
303676	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
303678	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
303678	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
303678	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
303682	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
303683	B-box zinc finger
303683	Zinc finger, C3HC4 type (RING finger)
303683	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
303684	Zinc finger, C3HC4 type (RING finger)
303684	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
303685	Phosphatidylethanolamine-binding protein
303687	Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen
303689	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
303690	Domain of unknown function DUF20. This transmembrane region is found in putative permeases and predicted transmembrane proteins it has no known function. It is not clear what source suggested that these proteins may be permeases and this information shoul
303690	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa
303692	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
303694	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
303694	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
303700	Ubiquitin carboxyl-terminal hydrolase family 2
303701	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
303701	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
303702	Glycosyl hydrolase family 85. Family of endo-beta-N-acetylglucosaminidases. These enzymes work on a broad spectrum of substrates
303703	Myosin head (motor domain)
303708	Myosin head (motor domain)
303711	Myosin head (motor domain)
303713	Myosin head (motor domain)
303717	Myosin head (motor domain)
303718	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
303718	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
303723	Myosin head (motor domain)
303725	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
303728	Myosin head (motor domain)
303729	Sulfatase
303729	Metalloenzyme superfamily. This family includes phosphopentomutase and 2,3-bisphosphoglycerate-independent phosphoglycerate mutase. This family is also related to pfam00245. The alignment contains the most conserved residues that are probably involved in
303729	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
303730	'chromo' (CHRromatin Organization MOdifier) domain
303731	'chromo' (CHRromatin Organization MOdifier) domain
303732	'chromo' (CHRromatin Organization MOdifier) domain
303734	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
303736	Zinc finger, C3HC4 type (RING finger)
303738	RNA polymerase Rpb8. Rpb8 is a subunit common to the three yeast RNA polymerases, pol I, II and III. Rpb8 interacts with the largest subunit Rpb1, and with Rpb3 and Rpb11, two smaller subunits
303743	Pectinacetylesterase
303745	Sir2 family
303750	Glycosyl hydrolase family 20, catalytic domain. This domain has a TIM barrel fold
303751	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
303752	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
303753	Fork head domain
303753	pfam02906, Fe_hyd_lg_C, Iron only hydrogenase large subunit, C-terminal domain
303754	ADP-ribosylation factor family
303754	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
303755	Fructosamine kinase. This family includes eukaryotic fructosamine-3-kinase enzymes. The family also includes bacterial members that have not been characterised but probably have a similar or identical function
303755	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
303760	SH2 domain
303761	7 transmembrane receptor (metabotropic glutamate family)
303761	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
303762	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
303770	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
303772	Sugar (and other) transporter
303773	jmjC domain
303775	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
303777	AMP-binding enzyme
303778	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
303778	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
303781	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
303784	Yippee putative zinc-binding protein
303785	Fibronectin type III domain
303785	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
303786	Rieske [2Fe-2S] domain. The rieske domain has a [2Fe-2S] centre. Two conserved cysteines that one Fe ion while the other Fe ion is coordinated by two conserved histidines
303786	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
303787	Amino acid permease
303788	Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyne
303789	Roadblock/LC7 domain. This family includes proteins that are about 100 amino acids long and have been shown to be related. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dyne
303792	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
303793	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
303795	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
303796	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
303798	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
303801	Lysosome-associated membrane glycoprotein (Lamp)
303803	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
303809	Ribosomal protein S7e
303810	Ribosomal L15
303811	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
303811	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
303811	Dishevelled specific domain. This domain is specific to the signaling protein dishevelled. The domain is found adjacent to the PDZ domain pfam00595, often in conjunction with DEP (pfam00610) and DIX (pfam00778)
303811	DIX domain. The DIX domain is present in Dishevelled and axin. This domain is involved in homo- and hetero-oligomerisation. It is involved in the homo- oligomerisation of mouse axin. The axin DIX domain also interacts with the disheveled DIX domain. The D
303814	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
303815	Ribosomal protein L11, RNA binding domain
303815	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
303816	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
303816	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
303818	NUDIX domain
303823	Protein kinase domain
303824	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
303826	Ribosomal protein L15
303827	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi
303827	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
303828	Ets-domain
303829	HMG14 and HMG17
303829	ENV polyprotein (coat polyprotein)
303831	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
303831	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
303832	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
303834	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
303836	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
303844	Ribosomal L10
303848	Mab-21 protein
303859	Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly in
303860	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
303861	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
303861	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
303862	Ribosomal protein S26e
303872	Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability
303874	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
303876	WD domain, G-beta repeat
303877	Zinc finger, C3HC4 type (RING finger)
303878	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
303879	Domain of unknown function
303882	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
303887	von Willebrand factor type D domain
303887	AMOP domain. This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges
303888	Oxysterol-binding protein
303888	PH domain. PH stands for pleckstrin homology
303889	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
303889	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
303891	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
303892	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
303904	Zinc finger, C3HC4 type (RING finger)
303905	Appr-1"-p processing enzyme family. This domain is found in a number of otherwise unrelated proteins. This domain is found at the C-terminus of the macro-H2A histone protein. This domain is found in the non-structural proteins of several types of ssRNA vi
303907	WD domain, G-beta repeat
303908	RanBP1 domain
303909	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
303911	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
303912	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
303916	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
303918	TFIIE alpha subunit. The general transcription factor TFIIE has an essential role in eukaryotic transcription initiation together with RNA polymerase II and other general factors. Human TFIIE consists of two subunits TFIIE-alpha and TFIIE-beta and joins t
303919	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
303923	Ribosomal protein L21e
303923	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
303924	Tetraspanin family
303930	Ribosomal protein L6
303935	Ribosomal L39 protein
303952	TRAF-type zinc finger
303952	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
303953	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
303955	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
303956	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
303959	Core histone H2A/H2B/H3/H4
303961	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
303965	Myosin head (motor domain)
303965	Intermediate filament protein
303965	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
303965	Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown
303965	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
303965	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
303972	HMG (high mobility group) box
303977	CBL proto-oncogene N-terminal domain 1. Cbl is an adaptor protein that binds EGF receptors (or other tyrosine kinases) and SH3 domains, functioning as a negative regulator of many signaling pathways. The N-terminal domain is evolutionarily conserved, and
303979	Hydratase/decarboxylase. This family consist of various hydratases and 4-oxalocrotonate decarboxylases which are involved in the bacterial meta-cleavage pathways for degradation of aromatic compounds. 2-hydroxypentadienoic acid hydratase encoded by mhpD i
303990	Homeobox domain
303991	Homeobox domain
303992	Prothymosin/parathymosin family. Prothymosin alpha and parathymosin are two ubiquitous small acidic nuclear proteins that are thought to be involved in cell cycle progression, proliferation, and cell differentiation
303993	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
303993	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
304005	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
304008	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
304010	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
304012	Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue
304013	Fibronectin type III domain
304014	7 transmembrane receptor (Secretin family)
304014	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
304015	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
304015	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
304017	TPR Domain
304021	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
304023	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
304024	Coproporphyrinogen III oxidase
304026	7 transmembrane receptor (rhodopsin family)
304028	Protein kinase domain
304029	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
304031	NOL1/NOP2/sun family
304033	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
304034	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
304035	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
304037	ADP-ribosylation factor family
304038	Syntaxin
304039	ADP-ribosylation factor family
304039	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
304039	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
304040	Aminotransferase class-III
304041	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304042	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304044	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304045	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304046	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304047	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304048	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304052	Ornithine decarboxylase antizyme
304053	Protein kinase domain
304053	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
304056	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
304061	WD domain, G-beta repeat
304067	ribosomal L5P family C-terminus. This region is found associated with pfam00281
304068	Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps
304070	Biotin protein ligase C terminal domain. The function of this structural domain is unknown. It is found to the C terminus of the biotin protein ligase catalytic domain pfam01317
304070	Biotin/lipoate A/B protein ligase family. This family includes biotin protein ligase, lipoate-protein ligase A and B. Biotin is covalently attached at the active site of certain enzymes that transfer carbon dioxide from bicarbonate to organic acids to for
304071	Helix-loop-helix DNA-binding domain
304071	PAC motif. PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold
304071	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
304073	PMP-22/EMP/MP20/Claudin family
304075	TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that s
304078	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
304079	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
304084	C2 domain
304085	Ribosomal protein S6
304085	Pentapeptide repeats (8 copies). These repeats are found in many cyanobacterial proteins. The repeats were first identified in hglK. The function of these repeats is unknown. The structure of this repeat has been predicted to be a beta-helix. The repeat c
304085	Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibers in association with hair keratin intermediate filaments. This family has been divided up int
304087	Sodium:solute symporter family
304088	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
304089	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
304092	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
304092	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea
304093	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
304102	Galactose binding lectin domain
304103	Helix-loop-helix DNA-binding domain
304109	Raf-like Ras-binding domain
304109	PH domain. PH stands for pleckstrin homology
304109	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
304112	7 transmembrane receptor (rhodopsin family)
304116	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
304124	PMP-22/EMP/MP20/Claudin family
304125	PMP-22/EMP/MP20/Claudin family
304127	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
304127	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
304135	Thrombospondin type 1 domain
304135	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
304135	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
304143	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
304150	Ubiquitin carboxyl-terminal hydrolase family 2
304168	Eukaryotic initiation factor 4E
304174	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
304181	Protein kinase domain
304182	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304183	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
304185	ATP synthase alpha/beta chain, C terminal domain
304185	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
304191	Helix-loop-helix DNA-binding domain
304192	Ribosomal protein S7e
304193	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
304196	RanBP1 domain
304197	Lectin C-type domain. This family includes both long and short form C-type
304199	Lectin C-type domain. This family includes both long and short form C-type
304200	Lectin C-type domain. This family includes both long and short form C-type
304201	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
304208	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
304209	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
304210	7 transmembrane receptor (metabotropic glutamate family)
304211	'chromo' (CHRromatin Organization MOdifier) domain
304212	'chromo' (CHRromatin Organization MOdifier) domain
304215	START domain
304215	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
304216	ENV polyprotein (coat polyprotein)
304224	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
304228	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
304229	7 transmembrane receptor (metabotropic glutamate family)
304229	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
304231	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304232	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304238	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
304239	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
304240	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
304242	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304243	Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73
304244	'chromo' (CHRromatin Organization MOdifier) domain
304255	Ribosomal protein S19e
304264	Ribosomal L18ae protein family
304265	Protein kinase domain
304265	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
304266	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
304266	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
304266	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
304267	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
304267	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
304268	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
304270	Protein kinase domain
304271	Zinc finger, C3HC4 type (RING finger)
304273	ENV polyprotein (coat polyprotein)
304273	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
304273	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
304273	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
304274	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
304275	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
304278	PPR repeat. This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilisation. Thi
304279	Ribosomal protein S8
304280	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
304285	PH domain. PH stands for pleckstrin homology
304287	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
304287	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
304289	Lipase (class 3)
304290	ER lumen protein retaining receptor
304291	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
304293	Transposase. Transposase proteins are necessary for efficient DNA transposition. Contains transposases for IS204, IS1001, IS1096 and IS1165
304293	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304293	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
304297	Fibronectin type III domain
304299	DIL domain. The DIL domain has no known function
304299	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
304300	Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to
304301	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
304301	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
304305	Luciferase-like monooxygenase
304305	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
304307	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
304307	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
304310	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
304310	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
304314	Uncharacterized ACR, COG1579
304314	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
304314	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
304319	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
304322	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
304323	FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in comple
304325	Domain of unknown function
304328	7 transmembrane receptor (rhodopsin family)
304333	Cyclic nucleotide-binding domain
304336	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304337	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304338	Ribosomal protein S8e
304340	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304340	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
304341	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
304341	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304341	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
304342	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
304342	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
304343	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
304344	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
304346	Metallo-beta-lactamase superfamily
304347	7 transmembrane receptor (metabotropic glutamate family)
304347	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
304348	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
304358	O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine
304361	O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine
304368	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
304369	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
304370	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
304371	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
304372	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
304373	F-box domain
304374	F-box domain
304375	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
304378	GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein
304379	MAM domain. An extracellular domain found in many receptors
304380	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
304385	Clathrin adaptor complex small chain
304387	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
304388	PMP-22/EMP/MP20/Claudin family
304396	WD domain, G-beta repeat
304399	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
304401	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
304402	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
304403	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
304406	ubiE/COQ5 methyltransferase family
304407	PMP-22/EMP/MP20/Claudin family
304423	Flavodoxin
304429	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
304436	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
304438	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
304445	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid
304448	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
304461	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
304466	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
304467	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
304468	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
304468	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
304472	Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons
304477	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
304479	Helix-loop-helix DNA-binding domain
304486	Prismane/CO dehydrogenase family. This family includes both hybrid-cluster proteins and the beta chain of carbon monoxide dehydrogenase. The hybrid-cluster proteins contain two Fe/S centers - a [4Fe-4S] cubane cluster, and a hybrid [4Fe-2S-2O] cluster. Th
304494	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
304495	F-box domain
304495	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
304497	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
304497	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
304500	Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle
304502	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
304503	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
304506	Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates
304510	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cells
304512	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
304514	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
304521	Zinc finger, C3HC4 type (RING finger)
304524	Protein kinase domain
304525	Protein kinase domain
304527	Acyl CoA binding protein
304528	Rad17 cell cycle checkpoint protein
304528	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
304529	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
304531	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
304532	Protein kinase domain
304539	Sir2 family
304540	Cytochrome c oxidase subunit VIa
304541	Glu-tRNAGln amidotransferase C subunit. This is a family of Glu-tRNAGln amidotransferase C subunits. The Glu-tRNA Gln amidotransferase enzyme itself is an important translational fidelity mechanism replacing incorrectly charged Glu-tRNAGln with the correc
304542	ubiE/COQ5 methyltransferase family
304545	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
304546	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
304547	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
304550	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
304554	CUB domain
304554	Sushi domain (SCR repeat)
304560	TPR Domain
304567	tRNA pseudouridine synthase. Involved in the formation of pseudouridine at the anticodon stem and loop of transfer-RNAs Pseudouridine is an isomer of uridine (5-(beta-D-ribofuranosyl) uracil, and id the most abundant modified nucleoside found in all cellu
304568	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
304569	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
304569	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
304570	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
304570	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
304571	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
304573	Ribosomal L10
304573	DNA polymerase family B, exonuclease domain. This domain has 3' to 5' exonuclease activity and adopts a ribonuclease H type fold
304573	DNA polymerase family B. This region of DNA polymerase B appears to consist of more than one structural domain, possibly including elongation, DNA-binding and dNTP binding activities
304574	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304575	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
304576	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304576	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
304577	Uracil DNA glycosylase superfamily
304579	Ubiquitin carboxyl-terminal hydrolase family 2
304579	Ubiquitin carboxyl-terminal hydrolases family 2
304579	Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this f
304580	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
304582	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
304588	Synaptobrevin
304590	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
304590	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
304591	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
304593	Ribosomal protein S6e
304599	DNA primase small subunit. DNA primase synthesizes the RNA primers for the Okazaki fragments in lagging strand DNA synthesis. DNA primase is a heterodimer of large and small subunits
304601	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
304601	DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors)
304601	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
304601	Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD has
304605	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
304608	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
304608	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
304608	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases
304610	7 transmembrane receptor (rhodopsin family)
304612	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
304614	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
304615	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304616	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
304617	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304619	Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73
304620	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304622	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
304624	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
304626	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304627	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304628	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304630	Ribosomal protein L21e
304632	7 transmembrane receptor (rhodopsin family)
304633	7 transmembrane receptor (rhodopsin family)
304634	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
304638	Ribosomal L22e protein family
304639	S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure
304642	Zinc finger, C3HC4 type (RING finger)
304645	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
304645	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
304646	Calreticulin family
304649	Sterile alpha motif (SAM)/Pointed domain
304649	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
304657	Ribonucleotide reductase, small chain
304658	Ribonucleotide reductase, small chain
304663	Helix-loop-helix DNA-binding domain
304667	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anch
304670	Importin-beta N-terminal domain
304671	SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop
304672	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
304673	Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73
304676	S-adenosylmethionine synthetase, central domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold
304676	S-adenosylmethionine synthetase, C-terminal domain. The three domains of S-adenosylmethionine synthetase have the same alpha+beta fold
304687	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
304688	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
304689	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
304690	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
304692	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
304693	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
304694	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
304697	Cadherin domain
304697	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
304698	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
304699	Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold
304701	Cadherin domain
304705	Protein kinase domain
304705	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
304707	ENV polyprotein (coat polyprotein)
304709	Myosin head (motor domain)
304709	Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown
304709	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
304710	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
304711	Ribosomal protein L13
304712	Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is
304715	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
304715	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
304716	Protein kinase domain
304717	Glutamine amidotransferase class-I
304718	ERCC4 domain. This domain is predicted to be a nuclease domain
304720	Pyridoxal-dependent decarboxylase conserved domain
304721	Pyridoxal-dependent decarboxylase conserved domain
304725	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
304727	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
304733	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
304734	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
304735	ENV polyprotein (coat polyprotein)
304737	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304739	Hsp90 protein
304742	Homeobox domain
304749	Prolyl oligopeptidase family
304749	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
304754	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi
304754	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
304756	Ribosomal protein S5, N-terminal domain
304756	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
304756	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
304760	HNH endonuclease
304761	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
304761	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
304765	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
304766	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
304769	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
304769	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
304770	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
304775	Protein kinase domain
304776	Protein kinase domain
304778	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
304778	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
304779	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
304783	ADP-ribosylation factor family
304783	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
304783	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
304784	Sulfate transporter family. Mutations may lead to several human diseases
304784	Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases
304784	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
304786	Ets-domain
304791	Protein kinase domain
304794	Protein kinase domain
304798	p53-associated protein (MDM2). The p53-associated protein (MDM2) is an inhibitor of the p53 tumor suppressor gene binding the transactivation domain and down regulating the ability of p53 to activate transcription. This family contains the p53 binding dom
304801	pfam02892, zf-BED, BED zinc finger
304805	Oxidoreductase FAD-binding domain
304805	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
304808	Ribosomal protein S7e
304812	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
304813	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
304815	Ets-domain
304815	Sterile alpha motif (SAM)/Pointed domain
304816	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
304817	Importin-beta N-terminal domain
304818	RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses
304822	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
304823	Fibronectin type III domain
304825	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
304829	Ribosomal protein L6
304831	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
304838	Uncharacterised protein family (UPF0123). This family of proteins are uncharacterised, they contain five CXXC motifs
304838	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
304846	Ribosomal protein L23
304847	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
304849	ENV polyprotein (coat polyprotein)
304850	Zinc finger, C3HC4 type (RING finger)
304851	Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat
304851	N2,N2-dimethylguanosine tRNA methyltransferase. This enzyme EC:2.1.1.32 used S-AdoMet to methylate tRNA. The TRM1 gene of Saccharomyces cerevisiae is necessary for the N2,N2-dimethylguanosine modification of both mitochondrial and cytoplasmic tRNAs. The e
304852	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
304853	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
304855	Ribosomal protein S5, C-terminal domain
304858	Laminin N-terminal (Domain VI)
304860	Dihydrodipicolinate synthetase family. This family has a TIM barrel structure
304868	Ribosomal L29e protein family
304871	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
304871	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
304872	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
304873	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
304877	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
304878	Homeobox domain
304878	LIM domain. This family represents two copies of the LIM structural domain
304879	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
304880	LIM domain. This family represents two copies of the LIM structural domain
304886	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
304888	Fibrinogen beta and gamma chains, C-terminal globular domain
304889	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
304891	SH2 domain
304892	Lectin C-type domain. This family includes both long and short form C-type
304893	C2 domain
304893	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
304893	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
304894	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
304897	ATP synthase subunit C
304899	Ribosomal protein S5, C-terminal domain
304905	Zinc finger, C3HC4 type (RING finger)
304913	Fibronectin type III domain
304913	Fibrinogen beta and gamma chains, C-terminal globular domain
304914	Phosphotyrosine interaction domain (PTB/PID)
304915	BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems
304916	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
304917	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
304918	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
304918	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
304919	pfam02938, GAD, GAD domain. This domain is found in some members of the GatB and aspartyl tRNA synthetases
304919	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
304919	tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt
304919	OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl -tRNA synthetases cata
304920	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
304920	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
304922	Hsp90 protein
304922	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
304922	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
304922	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
304935	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
304936	TIP41-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 interacts with TAP42 and negatively regulates the TOR signaling pathway
304938	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
304941	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
304941	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
304944	Phosphoribulokinase / Uridine kinase family
304947	Homeobox domain
304947	PBX domain. The PBX domain is a bipartite acidic domain
304950	PBX domain. The PBX domain is a bipartite acidic domain
304951	Nuf2 family. Members of this family are components of the mitotic spindle. It has been shown that Nuf2 from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle. An arabidopsis protein ha
304954	UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate urid
304956	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
304957	SH2 domain
304958	SH2 domain
304960	Olfactomedin-like domain
304963	HSF-type DNA-binding
304963	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
304964	C-5 cytosine-specific DNA methylase
304971	Eukaryotic porin
304972	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
304973	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
304974	Macrophage migration inhibitory factor (MIF)
304978	WD domain, G-beta repeat
304978	Coatomer WD associated domain
304980	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
304981	Ribosomal protein S2
304983	Calponin family repeat
304983	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
304985	7 transmembrane receptor (rhodopsin family)
304987	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
304987	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
304988	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
304988	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
304989	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
304989	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
304990	7 transmembrane receptor (rhodopsin family)
304991	7 transmembrane receptor (rhodopsin family)
304993	Actin
304998	7 transmembrane receptor (rhodopsin family)
304999	Repeat in ubiquitin-activating (UBA) protein
304999	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
305000	XPG I-region
305000	XPG N-terminal domain
305008	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
305009	NAC domain
305013	Ribosomal protein L31e
305019	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
305023	Fatty acid desaturase
305025	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
305025	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
305025	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
305031	BRO1-like domain. This functionally uncharacterised domain is found in a number of signal transduction proteins, including Rhophilin and BRO1
305034	MIT domain
305034	Protein kinase domain
305034	PX domain. PX domains bind to phosphoinositides
305035	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
305039	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
305045	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
305047	Ribosomal protein L21e
305050	Ribosomal protein L31e
305055	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
305057	Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised
305058	Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised
305063	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
305064	Protein-tyrosine phosphatase
305064	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
305071	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
305072	14-3-3 protein
305073	WD domain, G-beta repeat
305074	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
305078	Laminin N-terminal (Domain VI)
305078	Giardia variant-specific surface protein
305078	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
305078	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
305083	Kinesin motor domain
305085	ubiE/COQ5 methyltransferase family
305087	Ribosomal protein L6
305091	Protein kinase domain
305092	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
305096	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
305099	HMG (high mobility group) box
305104	Thrombospondin N-terminal -like domain
305104	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
305106	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
305107	7 transmembrane receptor (metabotropic glutamate family)
305108	Ribosomal protein L13
305109	7 transmembrane receptor (metabotropic glutamate family)
305109	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
305111	7 transmembrane receptor (metabotropic glutamate family)
305111	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
305111	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
305112	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
305120	Domain of Unknown Function (DUF408)
305123	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
305124	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
305124	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
305138	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
305138	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
305139	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
305139	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
305139	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
305142	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
305149	NUDIX domain
305150	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
305161	ENV polyprotein (coat polyprotein)
305162	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
305162	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
305166	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
305167	UbiA prenyltransferase family
305169	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
305171	Tesmin/TSO1-like CXC domain. This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in
305178	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
305180	Ribosomal protein L21e
305181	Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold
305181	Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold
305181	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
305185	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
305188	Ribosomal S3Ae family
305194	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
305194	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
305195	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
305195	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
305198	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
305202	Ribosomal protein S19e
305203	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
305211	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
305216	ENV polyprotein (coat polyprotein)
305217	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
305224	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
305227	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
305232	Ribosomal protein L11, RNA binding domain
305232	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
305233	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
305234	Starch binding domain
305235	NAD:arginine ADP-ribosyltransferase
305236	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
305236	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
305237	Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in
305239	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
305240	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
305240	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
305241	Protein kinase domain
305241	SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop
305246	Choloylglycine hydrolase. This family of choloylglycine hydrolases includes conjugated bile acid hydrolase (CBAH) EC:3.5.1.24, penicillin acylase EC:3.5.1.11 and acid ceramidase EC:3.5.1.23 which cleave carbon-nitrogen bonds, other than peptide bonds, in
305246	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
305246	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
305248	Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain
305248	pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain
305251	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
305253	Thrombospondin type 1 domain
305253	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
305256	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
305258	Sas10/Utp3 family. This family contains Sas10 which hash been identified as a regulator of chromatin silencing. The family also contains Utp3 a component of the U3 ribonucleoprotein complex. The exact molecular function of this family is unknown
305259	MAPEG family. This family is has been called MAPEG (Membrane Associated Proteins in Eicosanoid and Glutathione metabolism)
305262	Cyclophilin type peptidyl-prolyl cis-trans isomerase
305263	Hsp90 protein
305264	UDP-glucoronosyl and UDP-glucosyl transferase
305264	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
305265	Trypsin
305266	Ribosomal protein L23
305267	Trypsin
305268	Repeat in ubiquitin-activating (UBA) protein
305268	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
305269	PH domain. PH stands for pleckstrin homology
305272	Actin
305273	Ribosomal S17
305274	Cytochrome C and Quinol oxidase polypeptide I
305275	Ribosomal protein L6
305276	Ribosomal protein S19e
305277	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-stero
305279	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
305280	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
305282	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
305284	NAC domain
305290	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
305291	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-stero
305293	Homeobox domain
305302	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
305309	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the fa
305311	SH2 domain
305311	Protein kinase domain
305311	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
305314	E1-E2 ATPase
305317	Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold
305317	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
305317	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
305317	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
305317	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
305325	E1-E2 ATPase
305331	Homeobox domain
305332	LIM domain. This family represents two copies of the LIM structural domain
305338	Phosphotyrosine interaction domain (PTB/PID)
305338	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
305340	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
305341	ADP-ribosylation factor family
305341	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
305342	Smr domain. This family includes the Smr (Small MutS Related) proteins, and the C-terminal region of the MutS2 protein. It has been suggested that this domain interacts with the MutS1 protein in the case of Smr proteins and with the N-terminal MutS relate
305344	Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14
305348	Radical SAM superfamily
305350	Mitochondrial carrier protein
305351	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
305351	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
305353	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
305353	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
305354	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
305354	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
305355	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
305358	Ribosomal protein L24e
305358	Ribosomal L29e protein family
305365	Mitochondrial carrier protein
305367	PH domain. PH stands for pleckstrin homology
305367	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
305367	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
305367	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
305371	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
305373	HMG (high mobility group) box
305375	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
305384	7 transmembrane receptor (rhodopsin family)
305390	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
305392	Chromosome condensation protein 3, C-terminal region. Cnd3 is a Member of the five subunit condensin complex. Each subunit is essential for mitotic condensation
305399	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
305400	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
305400	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
305400	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
305405	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
305408	7 transmembrane receptor (Secretin family)
305408	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
305408	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
305417	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
305417	EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, b
305422	HesB-like domain. This family includes HesB which may be involved in nitrogen fixation
305423	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
305424	F-box domain
305426	Sugar (and other) transporter
305427	Protein of unknown function, DUF270
305427	Protein of unknown function, DUF270
305428	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
305428	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
305431	Protein kinase domain
305434	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
305434	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
305438	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
305440	Villin headpiece domain
305441	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
305445	Homeobox domain
305449	Sugar (and other) transporter
305450	SH2 domain
305450	PH domain. PH stands for pleckstrin homology
305451	Ribosomal protein L21e
305454	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
305456	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
305456	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
305456	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
305462	Proteasome A-type and B-type
305464	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
305466	Phosphatidylserine decarboxylase. This is a family of phosphatidylserine decarboxylases, EC:4.1.1.65. These enzymes catalyse the reaction: Phosphatidyl-L-serine <=> phosphatidylethanolamine + CO2. Phosphatidylserine decarboxylase plays a central role in t
305470	GTP1/OBG family
305470	GTPase of unknown function
305470	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
305470	TGS domain. The TGS domain is named after ThrRS, GTPase, and SpoT. Interestingly, TGS domain was detected also at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochae
305471	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
305472	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
305475	Oxysterol-binding protein
305477	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
305478	Zinc finger, C3HC4 type (RING finger)
305479	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
305479	Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding
305481	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
305482	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
305484	CUE domain. Domain that may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two protein of the IL-1 signal transduction pathway, tollip and TAB2
305487	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
305491	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
305492	Ribosomal protein S24e
305494	Zinc carboxypeptidase
305494	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
305500	SH2 domain
305500	PH domain. PH stands for pleckstrin homology
305502	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
305506	eIF-6 family. This family includes eukaryotic translation initiation factor 6 as well as presumed archaebacterial homologues
305509	Adenylate and Guanylate cyclase catalytic domain
305510	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
305514	Ribosomal protein S8
305517	Ribosomal protein L6
305522	SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop
305539	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl
305540	Sodium:dicarboxylate symporter family
305541	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
305545	ENV polyprotein (coat polyprotein)
305547	Galactoside-binding lectin
305550	Ribosomal protein L21e
305556	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
305571	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
305574	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
305578	Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes
305581	Peroxin 13, N-terminal domain. Both termini of the Peroxin-13 are oriented to the cytosol. Peroxin-13 is required for peroxisomal association of peroxin-14
305581	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
305584	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
305584	Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain has
305586	Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to
305605	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
305605	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
305606	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
305606	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
305610	HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes.
305613	PH domain. PH stands for pleckstrin homology
305615	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
305615	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
305619	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
305621	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
305622	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
305622	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
305622	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
305622	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
305626	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
305627	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
305627	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
305629	7 transmembrane receptor (metabotropic glutamate family)
305630	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
305630	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
305631	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
305632	Ribosomal S3Ae family
305635	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
305639	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
305640	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
305641	Myosin head (motor domain)
305642	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
305644	WD domain, G-beta repeat
305668	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
305669	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
305670	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
305675	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
305676	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24
305679	Vinculin family
305681	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
305683	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
305684	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
305685	ubiE/COQ5 methyltransferase family
305685	O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production
305692	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
305692	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
305694	Glutaredoxin
305696	Glutaredoxin
305705	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
305706	ENV polyprotein (coat polyprotein)
305709	Transcription factor S-II (TFIIS)
305710	Protein kinase domain
305711	7 transmembrane receptor (rhodopsin family)
305712	Enolase, N-terminal domain
305712	Enolase, C-terminal TIM barrel domain
305722	Eukaryotic-type carbonic anhydrase
305733	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
305734	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
305735	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
305750	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
305750	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
305751	ADP-ribosylation factor family
305751	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
305755	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
305756	Enolase, C-terminal TIM barrel domain
305760	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
305761	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
305764	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
305764	MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains
305766	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
305766	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
305767	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
305767	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
305767	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
305772	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
305775	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
305776	ENV polyprotein (coat polyprotein)
305777	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
305778	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
305779	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
305781	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
305787	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
305792	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
305793	Protein kinase domain
305793	PX domain. PX domains bind to phosphoinositides
305794	Uncharacterized protein PaaI, COG2050
305794	MaoC like domain. The MaoC protein is found to share similarity with a wide variety of enzymes
305795	Lipase
305795	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
305795	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to
305797	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
305798	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
305799	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
305799	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
305800	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
305800	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
305801	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
305805	7 transmembrane receptor (rhodopsin family)
305806	Glutaredoxin
305811	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
305811	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL
305813	Polyprenyl synthetase
305814	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
305816	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera
305820	Core histone H2A/H2B/H3/H4
305824	Enolase, N-terminal domain
305824	Enolase, C-terminal TIM barrel domain
305826	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
305827	WD domain, G-beta repeat
305827	HMG (high mobility group) box
305828	SH2 domain
305830	F-box domain
305839	Ribosomal protein L23
305839	Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a
305840	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
305841	7 transmembrane receptor (rhodopsin family)
305841	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
305842	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
305843	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
305843	ATP synthase (F/14-kDa) subunit. This family includes 14-kDa subunit from vATPases, which is in the peripheral catalytic part of the complex. The family also includes archaebacterial ATP synthase subunit F
305844	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
305846	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
305850	C2 domain
305851	metallopeptidase family M24
305853	ADP-ribosylation factor family
305853	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
305854	7 transmembrane receptor (rhodopsin family)
305858	Homeobox domain
305858	Otx1 transcription factor
305860	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
305861	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
305863	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
305866	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
305867	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
305868	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
305869	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
305870	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
305873	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
305874	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
305875	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
305880	CUB domain
305886	Sugar (and other) transporter
305886	Sugar (and other) transporter
305888	Homeobox domain
305889	Adenylate cyclase. One member was found to complement mutants of E. coli. cya. This protein has been shown to be an adenylate kinase
305892	Ribosomal S17
305896	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved trypt
305898	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
305899	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
305902	Trypsin
305903	Trypsin
305904	eRF1 domain 1. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
305904	eRF1 domain 3. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
305904	eRF1 domain 2. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
305906	Trypsin
305908	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
305910	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
305915	Gag polyprotein, inner coat protein p12. The retroviral p12 is a virion structural protein. p12 is proline rich. The function carried out by p12 in assembly and replication is unknown. p12 is associated with pathogenicity of the virus
305916	Ribosomal protein L23
305917	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
305922	Protein kinase domain
305923	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
305930	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
305931	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
305932	Cytidine and deoxycytidylate deaminase zinc-binding region
305934	Ribosomal protein L23
305938	PH domain. PH stands for pleckstrin homology
305938	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
305938	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
305938	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
305938	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
305943	ADP-ribosylation factor family
305943	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
305947	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
305950	Ribosomal protein L23
305951	WD domain, G-beta repeat
305951	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
305956	WD domain, G-beta repeat
305956	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
305957	Formamidopyrimidine-DNA glycosylase
305958	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
305964	Peptide methionine sulfoxide reductase. This enzyme repairs damaged proteins. Methionine sulfoxide in proteins is reduced to methionine
305967	Kinesin motor domain
305967	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
305968	Homeobox domain
305974	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
305977	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
305979	Ribosomal L29e protein family
305980	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
305985	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
305985	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
305986	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
305986	Pentapeptide repeats (8 copies). These repeats are found in many cyanobacterial proteins. The repeats were first identified in hglK. The function of these repeats is unknown. The structure of this repeat has been predicted to be a beta-helix. The repeat c
305986	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
305988	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
305990	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
305993	ENV polyprotein (coat polyprotein)
305993	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
305996	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
305998	Ribosomal S17
305999	Homeobox domain
306000	Mitochondrial carrier protein
306001	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
306004	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
306004	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
306005	Phosphatidylethanolamine-binding protein
306006	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
306008	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
306009	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
306009	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
306010	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
306011	Ribosomal L10
306011	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
306011	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
306013	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
306013	EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. This presumed domain has no known function, b
306014	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
306016	Ribosomal protein L31e
306020	Ribosomal protein L21e
306022	Fibronectin type III domain
306026	Mitochondrial carrier protein
306038	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
306044	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
306044	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
306055	Cadherin domain
306066	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
306071	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
306071	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
306072	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
306073	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
306073	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
306079	Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14
306081	Cadherin domain
306085	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
306091	Cadherin domain
306094	ENV polyprotein (coat polyprotein)
306097	HMG14 and HMG17
306100	Elongation factor 1 gamma, conserved domain
306100	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
306103	RNB-like protein. The function of this region of similarity is uncertain
306104	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
306104	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
306114	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
306115	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
306115	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
306117	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
306118	Phosphotyrosine interaction domain (PTB/PID)
306126	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
306127	MmgE/PrpD family. This family includes 2-methylcitrate dehydratase EC:4.2.1.79 (PrpD) that is required for propionate catabolism. It catalyses the third step of the 2-methylcitric acid cycle
306129	F-box domain
306138	Ribosomal protein S15
306138	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
306147	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
306152	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
306157	Glypican
306160	Ribosomal L10
306163	Glypican
306164	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
306164	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
306164	Polysaccharide biosynthesis protein. This is a family of diverse bacterial polysaccharide biosynthesis proteins including the CapD protein, WalL protein mannosyl-transferase and several putative epimerases (e.g. WbiI)
306164	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
306166	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
306166	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
306168	HMG (high mobility group) box
306169	PMP-22/EMP/MP20/Claudin family
306179	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
306181	Ribosomal protein L21e
306183	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
306184	PH domain. PH stands for pleckstrin homology
306184	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
306185	Protein kinase domain
306186	PH domain. PH stands for pleckstrin homology
306188	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
306190	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
306191	7 transmembrane receptor (rhodopsin family)
306192	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
306193	7 transmembrane receptor (rhodopsin family)
306194	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
306195	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
306196	Endomembrane protein 70
306197	Endomembrane protein 70
306201	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
306203	PX domain. PX domains bind to phosphoinositides
306204	Filamin/ABP280 repeat
306205	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
306206	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
306207	Ribosomal L29e protein family
306208	Protein kinase domain
306210	Glutaredoxin
306212	Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins
306214	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
306222	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
306223	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
306226	Homeobox domain
306231	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
306232	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
306243	Cache domain
306244	Cache domain
306248	Ribosomal protein L34e
306249	Polyprenyl synthetase
306252	Protein kinase domain
306253	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt
306257	Ubiquitin carboxyl-terminal hydrolase, family 1
306259	Ribosomal protein S24e
306260	MIT domain
306260	Calpain family cysteine protease
306260	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by
306262	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
306264	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
306268	QXW lectin repeat
306268	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
306270	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
306270	Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mous
306271	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
306274	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
306274	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
306279	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
306280	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
306280	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
306281	Protein-tyrosine phosphatase
306281	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
306283	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
306285	C2 domain
306285	C2 domain
306288	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
306291	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
306292	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
306307	7 transmembrane receptor (rhodopsin family)
306312	Ribosomal protein L6e
306324	Tetraspanin family
306328	Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region
306328	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
306330	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
306333	SH2 domain
306334	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306335	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306338	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
306347	Occludin/ELL family
306351	Transforming growth factor beta like domain
306353	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
306357	7 transmembrane receptor (rhodopsin family)
306357	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
306357	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
306358	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306359	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306360	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306361	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306363	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306364	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306376	SH2 domain
306384	PH domain. PH stands for pleckstrin homology
306388	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
306398	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
306399	Zinc carboxypeptidase
306400	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
306402	HMG (high mobility group) box
306403	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
306404	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an
306405	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
306406	Ribosomal S3Ae family
306409	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
306409	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
306412	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
306415	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
306417	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
306419	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
306421	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
306421	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
306421	PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline
306421	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
306423	Cyclophilin type peptidyl-prolyl cis-trans isomerase
306425	Eukaryotic ribosomal protein L18
306428	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
306431	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
306431	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
306432	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
306436	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
306439	Myelin proteolipid protein (PLP or lipophilin)
306439	Myelin proteolipid protein (PLP or lipophilin)
306441	WD domain, G-beta repeat
306441	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
306444	Poly-adenylate binding protein, unique domain
306444	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
306447	Ribosomal L10
306451	Macrophage migration inhibitory factor (MIF)
306454	PMP-22/EMP/MP20/Claudin family
306456	Uncharacterized ACR, COG1565
306456	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
306460	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
306462	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
306467	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
306467	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
306471	PX domain. PX domains bind to phosphoinositides
306471	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
306474	Ribosomal protein S26e
306476	Ribosomal L29e protein family
306477	Domain of unknown function UPF0099. This domain has no known function
306487	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
306496	Macrophage scavenger receptor
306496	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
306500	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
306504	ubiE/COQ5 methyltransferase family
306505	Core histone H2A/H2B/H3/H4
306505	Zinc finger, C3HC4 type (RING finger)
306505	ATP-dependent protease La (LON) domain
306506	Protein kinase domain
306509	Ribosomal protein L21e
306511	Ribosomal protein L6
306512	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
306515	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
306516	TFIIE beta subunit core domain. General transcription factor TFIIE consists of two subunits, TFIIE alpha pfam02002 and TFIIE beta. TFIIE beta has been found to bind to the region where the promoter starts to open to be single-stranded upon transcription i
306527	Protein-tyrosine phosphatase
306527	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
306535	Nucleoside diphosphate kinase
306540	Aminotransferase class I and II
306541	C2 domain
306544	Uncharacterized protein family UPF0001
306544	Prismane/CO dehydrogenase family. This family includes both hybrid-cluster proteins and the beta chain of carbon monoxide dehydrogenase. The hybrid-cluster proteins contain two Fe/S centers - a [4Fe-4S] cubane cluster, and a hybrid [4Fe-2S-2O] cluster. Th
306548	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
306549	Protein kinase domain
306551	Calcium-activated BK potassium channel alpha subunit
306551	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
306552	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
306554	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera
306555	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
306562	Intermediate filament protein
306562	PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins
306562	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
306562	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
306564	Trypsin
306564	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
306567	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
306571	MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases
306571	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
306574	Mitochondrial carrier protein
306576	Protein kinase domain
306577	Protein kinase domain
306579	Mammalian defensin
306579	Defensin propeptide
306581	NAC domain
306582	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
306583	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
306586	Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in
306589	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
306592	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
306594	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
306597	HMG (high mobility group) box
306600	E1-E2 ATPase
306605	C2 domain
306606	PH domain. PH stands for pleckstrin homology
306606	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
306606	BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region
306607	Uncharacterized ACR, COG1579
306608	Trypsin
306608	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
306608	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy
306609	CUB domain
306613	CUB domain
306613	Sushi domain (SCR repeat)
306624	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
306626	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
306627	ADP-ribosylation factor family
306627	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
306630	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
306636	Ephrin
306640	Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase
306643	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
306655	Homeobox domain
306657	Homeobox domain
306659	Homeobox domain
306663	Thrombospondin type 1 domain
306663	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
306666	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
306667	ENV polyprotein (coat polyprotein)
306669	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
306672	PAP/25A associated domain
306672	Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to
306674	Cyclophilin type peptidyl-prolyl cis-trans isomerase
306675	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306677	ENV polyprotein (coat polyprotein)
306678	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306679	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306680	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306681	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306682	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
306683	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
306685	Trypsin
306690	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
306692	Ribosomal protein L31e
306694	Protein-tyrosine phosphatase
306694	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
306698	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
306698	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
306698	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
306705	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306706	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306708	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306709	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306710	Ribosomal protein L24e
306711	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306714	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306715	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306717	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306722	Death domain
306729	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
306736	Kinesin motor domain
306739	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
306749	Mitochondrial carrier protein
306750	F-box domain
306759	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an
306760	PTB domain (IRS-1 type)
306761	Trypsin
306761	Fibronectin type II domain
306761	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
306761	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
306762	Profilin
306764	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
306764	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
306764	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
306767	ADP-ribosylation factor family
306771	Tetraspanin family
306782	Protein kinase domain
306782	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
306782	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
306782	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
306786	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
306790	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
306790	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
306792	Actin
306792	7 transmembrane receptor (rhodopsin family)
306792	C. elegans Sre G protein-coupled chemoreceptor. Caenorhabditis elegans Sre proteins are candidate chemosensory receptors. There are four main recognized groups of such receptors: Odr-10, Sra, Sro, and Srg. Sre (this family), Sra pfam02117 and Srb pfam0217
306795	ENV polyprotein (coat polyprotein)
306798	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
306800	Transcription initiation factor IIF, beta subunit. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIF (TFIIF) is a tetramer of two beta subun
306802	Protein kinase domain
306803	Protein kinase domain
306803	HMG (high mobility group) box
306804	tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included
306806	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
306807	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
306811	Protein kinase domain
306812	Kinesin motor domain
306812	Protein kinase C terminal domain
306814	jmjC domain
306816	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306818	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
306819	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
306825	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
306827	Core histone H2A/H2B/H3/H4
306840	Sir2 family
306840	NAD(P) transhydrogenase beta subunit. This family corresponds to the beta subunit of NADP transhydrogenase in prokaryotes, and either the protein N- or C terminal in eukaryotes. The domain is often found in conjunction with pfam01262. Pyridine nucleotide
306842	Translation initiation factor SUI1
306842	Oxidoreductase family, NAD-binding Rossmann fold. This family of enzymes utilise NADP or NAD
306842	pfam02894, GFO_IDH_MocA_C, Oxidoreductase family, C-terminal alpha/beta domain. This family of enzymes utilise NADP or NAD. This family is called the GFO/IDH/MOCA family
306859	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon
306860	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon
306861	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon
306862	Transcription factor AP-2
306869	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
306870	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
306872	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
306873	Protein of unknown function (DUF390). This family of long proteins are currently only found in the rice genome. They have no known function. However they may be some kind of transposable element
306878	Ferredoxin-fold anticodon binding domain. This is the anticodon binding domain found in some phenylalanyl tRNA synthetases. The domain has a ferredoxin fold
306879	tRNA synthetases class II core domain (F). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only phenylalanyl-tRNA synthetases. This is the core catalytic domain
306885	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
306886	Death domain
306886	Protein kinase domain
306886	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
306889	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
306890	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
306891	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
306892	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
306894	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
306895	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
306896	Protein kinase domain
306900	Fork head domain
306913	Uncharacterized protein family UPF0004. This family is the N terminal half of the family. The C-terminal half has been shown to be related to MiaB proteins
306929	Somatotropin hormone family
306930	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
306936	Ribosomal protein S19e
306945	Core histone H2A/H2B/H3/H4
306945	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
306947	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
306948	Core histone H2A/H2B/H3/H4
306948	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
306949	Core histone H2A/H2B/H3/H4
306954	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
306955	Core histone H2A/H2B/H3/H4
306956	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
306957	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
306958	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
306961	WD domain, G-beta repeat
306962	Core histone H2A/H2B/H3/H4
306963	Core histone H2A/H2B/H3/H4
306964	Core histone H2A/H2B/H3/H4
306964	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
306966	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306967	Core histone H2A/H2B/H3/H4
306967	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
306968	Core histone H2A/H2B/H3/H4
306969	Core histone H2A/H2B/H3/H4
306969	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
306970	Core histone H2A/H2B/H3/H4
306971	Core histone H2A/H2B/H3/H4
306972	Core histone H2A/H2B/H3/H4
306973	Core histone H2A/H2B/H3/H4
306974	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
306974	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306974	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
306975	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
306977	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
306977	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
306977	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
306978	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
306979	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
306982	Intermediate filament protein
306990	Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain
306993	Homeobox domain
306993	Pou domain - N-terminal to homeobox domain
306997	Ribosomal protein L21e
306998	Protein kinase domain
307006	Ribosomal protein S6e
307007	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
307010	WD domain, G-beta repeat
307011	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
307011	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
307013	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
307016	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
307016	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
307016	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
307023	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
307028	ENV polyprotein (coat polyprotein)
307035	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
307035	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
307036	Guanylate kinase
307036	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi
307036	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
307039	ADP-ribosylation factor family
307039	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
307043	Protein kinase domain
307047	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
307049	Ribosomal protein L15
307050	PAP/25A associated domain
307051	RyR domain. This domain is called RyR for Ryanodine receptor. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown
307051	RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3
307052	Ribosomal protein L36e
307056	PH domain. PH stands for pleckstrin homology
307056	Daxx Family. The Daxx protein (also known as the Fas-binding protein) is thought to play a role in apoptosis, but precise role played by Daxx remians to be determined
307057	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
307058	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
307059	Ribosomal protein L6
307067	AMP-binding enzyme
307068	AMP-binding enzyme
307073	NUDIX domain
307074	NUDIX domain
307075	WD domain, G-beta repeat
307079	Ribosomal protein L21e
307080	Chaperonin 10 Kd subunit
307081	Insulinase (Peptidase family M16)
307082	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
307084	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
307091	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
307092	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
307093	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
307095	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
307096	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
307097	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
307098	PH domain. PH stands for pleckstrin homology
307098	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
307099	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
307100	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
307101	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
307102	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
307106	Sterile alpha motif (SAM)/Pointed domain
307106	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
307107	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter
307108	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter
307109	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
307114	eIF-6 family. This family includes eukaryotic translation initiation factor 6 as well as presumed archaebacterial homologues
307123	Protein kinase domain
307123	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
307124	Protein kinase domain
307132	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
307133	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
307134	ENV polyprotein (coat polyprotein)
307134	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
307135	Ribosomal protein L34e
307136	Acyl CoA binding protein
307137	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
307144	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
307148	NOL1/NOP2/sun family
307148	Conserved hypothetical protein 95
307148	ubiE/COQ5 methyltransferase family
307148	FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in comple
307151	Zinc finger, C3HC4 type (RING finger)
307151	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
307154	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
307156	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
307157	SelR domain. Methionine sulfoxide reduction is an important process, by which cells regulate biological processes and cope with oxidative stress. MsrA, a protein involved in the reduction of methionine sulfoxides in proteins, has been known for four decad
307159	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
307167	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
307167	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
307168	Protein kinase domain
307169	Protein kinase domain
307170	Acyl CoA binding protein
307171	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
307173	Ribosomal protein S19e
307177	Shikimate kinase
307177	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
307177	Pyridoxal-phosphate dependent enzyme. Members of this family are all pyridoxal-phosphate dependent enzymes. This family includes: serine dehydratase EC:4.2.1.13 P20132, threonine dehydratase EC:4.2.1.16, tryptophan synthase beta chain EC:4.2.1.20, threoni
307178	PH domain. PH stands for pleckstrin homology
307178	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
307178	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
307180	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
307189	Nebulin repeat
307189	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
307192	Actin
307192	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
307196	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
307200	SH2 domain
307202	Core histone H2A/H2B/H3/H4
307203	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
307204	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
307204	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
307205	CUB domain
307206	Enolase, N-terminal domain
307206	Enolase, C-terminal TIM barrel domain
307209	F-box domain
307212	Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases
307221	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
307234	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
307234	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
307235	Ribosome-binding factor A
307237	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
307238	SKIP/SNW domain. This domain is found in chromatin proteins
307239	HMG (high mobility group) box
307248	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
307248	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
307251	Zinc finger, C3HC4 type (RING finger)
307253	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
307255	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
307263	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
307263	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
307266	Protein kinase domain
307267	Protein kinase domain
307269	Ribosomal protein L31e
307270	Malic enzyme, N-terminal domain
307270	Malic enzyme, NAD binding domain
307288	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
307289	Eukaryotic porin
307298	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
307298	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
307304	Macrophage migration inhibitory factor (MIF)
307309	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
307314	Ribosomal protein S5, N-terminal domain
307315	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
307317	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
307317	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
307318	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
307319	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
307323	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
307326	Ribosomal protein L19e
307327	Ribosomal protein L19e
307327	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
307328	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
307329	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
307333	Ribosomal protein L19e
307334	Cupin. This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nit
307343	Late embryogenesis abundant protein. Different types of LEA proteins are expressed at different stages of late embryogenesis in higher plant seed embryos and under conditions of dehydration stress. The function of these proteins is unknown
307350	Peptidase family M41
307350	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
307351	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
307351	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
307352	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
307353	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
307365	Ribosomal protein L13
307366	Death domain
307370	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a b
307371	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
307376	Homeobox domain
307376	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
307384	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
307391	RasGAP C-terminus
307393	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
307394	LIM domain. This family represents two copies of the LIM structural domain
307395	Villin headpiece domain
307395	LIM domain. This family represents two copies of the LIM structural domain
307398	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
307398	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
307398	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
307400	3'5'-cyclic nucleotide phosphodiesterase
307401	3'5'-cyclic nucleotide phosphodiesterase
307401	GAF domain. Domain present in phytochromes and cGMP-specific phosphodiesterases
307402	HMG (high mobility group) box
307403	Protein kinase domain
307403	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
307404	Sulfatase
307410	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
307415	GTPase of unknown function
307415	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
307416	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
307420	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
307421	Isochorismatase family. This family are hydrolase enzymes
307427	WD domain, G-beta repeat
307431	HMG (high mobility group) box
307432	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
307433	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
307438	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
307440	Ribosomal L29e protein family
307441	HMG (high mobility group) box
307444	Translation initiation factor SUI1
307444	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
307444	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
307448	Skp1 family, dimerisation domain
307450	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
307451	High molecular weight glutenin subunit. Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin
307451	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
307453	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
307459	PH domain. PH stands for pleckstrin homology
307465	Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex
307465	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
307471	Zinc finger, C3HC4 type (RING finger)
307472	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
307474	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
307474	FF domain. This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions
307475	7 transmembrane receptor (rhodopsin family)
307475	7 transmembrane receptor (rhodopsin family)
307476	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
307476	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
307481	Cadherin domain
307482	PH domain. PH stands for pleckstrin homology
307482	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
307482	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
307482	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
307482	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
307482	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
307484	Ribosomal protein L21e
307486	Cadherin domain
307487	Cadherin domain
307488	Cadherin domain
307489	Cadherin domain
307490	Cadherin domain
307491	Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain
307491	tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt
307492	WHEP-TRS domain
307492	Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain
307492	tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt
307496	KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif
307500	PH domain. PH stands for pleckstrin homology
307500	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
307503	eRF1 domain 1. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
307503	eRF1 domain 2. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
307503	eRF1 domain 3. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
307505	Vinculin family
307506	Vinculin family
307508	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
307511	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
307512	SRP19 protein. The signal recognition particle (SRP) binds to the signal peptide of proteins as they are being translated. The binding of the SRP halts translation and the complex is then transported to the endoplasmic reticulum's cytoplasmic surface. The
307522	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
307523	Myosin head (motor domain)
307523	MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins
307524	WD domain, G-beta repeat
307525	RNA polymerase Rpb4
307530	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
307540	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
307554	Ribosomal protein S27
307556	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
307557	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
307561	Zinc finger, C3HC4 type (RING finger)
307562	Cadherin domain
307563	Cadherin domain
307563	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
307582	Laminin B (Domain IV)
307582	Intermediate filament protein
307582	Laminin N-terminal (Domain VI)
307582	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
307582	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
307582	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
307585	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
307587	Mitochondrial carrier protein
307594	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
307602	Elongation factor 1 gamma, conserved domain
307602	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
307603	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
307607	Ribosomal protein L19e
307607	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
307611	Synaptogyrin. This family of proteins is distantly related to pfam01284
307612	Synaptogyrin. This family of proteins is distantly related to pfam01284
307614	Cadherin domain
307618	Cadherin domain
307618	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
307623	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
307623	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
307631	Cadherin domain
307631	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
307641	Protein kinase domain
307650	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
307653	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
307660	Carboxylesterase
307662	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt
307665	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
307667	HMG (high mobility group) box
307668	Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses
307669	Sybindin-like family. Sybindin is a physiological syndecan-2 ligand on dendritic spines, the small protrusions on the surface of dendrites that receive the vast majority of excitatory synapses
307672	Myosin head (motor domain)
307674	Myosin head (motor domain)
307675	Myosin head (motor domain)
307684	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
307688	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
307689	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
307690	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
307690	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
307694	Myosin head (motor domain)
307696	Myosin head (motor domain)
307700	Myosin head (motor domain)
307707	Myosin head (motor domain)
307710	Carboxylesterase
307711	Ribosomal protein S26e
307711	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
307713	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
307715	Homeobox domain
307724	C2 domain
307727	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
307727	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
307727	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
307731	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
307731	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
307738	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
307738	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
307740	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
307742	PX domain. PX domains bind to phosphoinositides
307744	Adenylate and Guanylate cyclase catalytic domain
307745	PAP/25A associated domain
307745	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p
307745	Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to
307749	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
307750	Ribosomal protein L19e
307751	Ribosomal protein S8
307752	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
307757	CUB domain
307758	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
307759	Aminotransferase class I and II
307760	Ribosomal protein S17
307761	Protein kinase domain
307762	Ribosomal protein S19
307763	Ribosomal L29e protein family
307763	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
307764	Ubiquitin carboxyl-terminal hydrolase family 2
307764	Ubiquitin carboxyl-terminal hydrolases family 2
307767	Calx-beta domain
307771	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
307772	Anaphase-promoting complex, subunit 10 (APC10)
307772	Poxvirus D5 protein. This protein is necessary for viral DNA replication, and is a nucleic acid independent nucleoside triphosphatase
307775	Ribosomal protein S26e
307778	Ribosomal L28e protein family
307779	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
307782	Clathrin adaptor complex small chain
307784	Domain of unknown function
307790	Mitochondrial carrier protein
307791	Carboxylesterase
307794	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
307796	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
307797	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
307799	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
307801	Oxidoreductase family, NAD-binding Rossmann fold. This family of enzymes utilise NADP or NAD
307802	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
307803	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
307805	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
307807	Ribosomal protein L23
307808	Protein kinase domain
307809	Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrour
307810	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
307810	Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrour
307811	Amino acid permease
307812	Conserved hypothetical protein 95
307815	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
307821	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
307829	Homeobox domain
307833	Eukaryotic protein of unknown function, DUF292
307836	PH domain. PH stands for pleckstrin homology
307836	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
307839	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
307849	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
307849	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
307850	3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
307851	SCP-like extracellular protein. This domain is also found in prokaryotes
307851	Lectin C-type domain. This family includes both long and short form C-type
307852	FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1
307852	Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in
307853	Fatty acid hydroxylase
307853	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
307859	Sulfotransferase protein
307860	Adenosine-deaminase (editase) domain
307862	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
307868	SAND family protein. Members of this family are uncharacterised proteins that have been called SAND proteins. These protein do not contain a SAND domain. The members of this family are 500-600 amino acids long and contain several conserved motifs
307872	Lectin C-type domain. This family includes both long and short form C-type
307873	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
307882	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
307893	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
307893	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
307897	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
307897	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
307900	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
307900	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
307902	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
307903	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
307903	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
307905	Ubiquitin carboxyl-terminal hydrolase family 2
307905	Ubiquitin carboxyl-terminal hydrolases family 2
307915	Uncharacterised protein family (UPF0172)
307916	MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000
307920	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
307922	AMP-binding enzyme
307923	Vacuolar sorting protein 9 (VPS9) domain
307925	Ribosomal protein L19e
307927	WD domain, G-beta repeat
307935	B-box zinc finger
307938	Fibronectin type III domain
307938	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
307947	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
307948	Protein of unknown function, DUF265
307950	Protein kinase domain
307950	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
307952	Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73
307956	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
307960	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
307968	C2 domain
307968	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
307973	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
307979	HMG (high mobility group) box
307982	Nebulin repeat
307983	Nebulin repeat
307983	LIM domain. This family represents two copies of the LIM structural domain
307988	von Willebrand factor type A domain
307988	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
307989	Villin headpiece domain
307989	LIM domain. This family represents two copies of the LIM structural domain
307990	TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that s
307992	Delta serrate ligand
307992	Lectin C-type domain. This family includes both long and short form C-type
307992	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
307997	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
307999	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
308000	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
308000	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
308001	Homeobox domain
308003	C2 domain
308005	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
308007	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
308011	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308012	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308013	Carboxylesterase
308017	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
308021	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
308026	Protein kinase domain
308028	Sugar (and other) transporter
308032	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308036	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
308036	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
308037	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
308039	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
308044	LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known
308045	HIT family
308045	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
308047	DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds
308048	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
308051	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
308052	Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase
308053	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
308061	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
308062	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
308064	Intermediate filament protein
308066	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
308067	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
308074	Protein kinase domain
308076	Protein kinase domain
308078	Protein kinase domain
308078	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
308079	Protein kinase domain
308080	Protein kinase domain
308081	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
308086	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
308097	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
308099	Fibronectin type III domain
308100	Intermediate filament protein
308100	Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
308100	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
308100	Beta-ketoacyl synthase, N-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains.
308100	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
308101	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
308101	Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
308102	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
308102	Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
308103	7 transmembrane receptor (rhodopsin family)
308106	Protein kinase domain
308108	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL
308109	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
308113	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
308113	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
308118	ATP synthase
308118	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
308119	Peptide hormone. This family contains glucagon, GIP, secretin and VIP
308121	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
308126	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
308127	Formate--tetrahydrofolate ligase
308127	Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain
308127	pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain
308127	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
308129	Nitroreductase family. Members of this family utilise FMN as a cofactor
308136	ADP-ribosylation factor family
308140	Ribosomal RNA adenine dimethylase
308141	PMP-22/EMP/MP20/Claudin family
308142	PH domain. PH stands for pleckstrin homology
308142	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
308142	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
308163	7 transmembrane receptor (rhodopsin family)
308164	SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase
308170	Protein kinase domain
308172	Protein kinase domain
308173	Kinesin motor domain
308174	CAP protein
308176	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
308177	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
308178	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
308182	Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family
308183	Protein kinase domain
308183	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
308184	Protein kinase domain
308185	Protein kinase domain
308185	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
308186	Protein kinase domain
308186	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
308188	Protein kinase domain
308190	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
308191	Exocyst complex subunit Sec15-like
308191	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
308191	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
308193	Protein kinase domain
308194	Protein kinase domain
308195	Protein kinase domain
308196	Protein kinase domain
308197	Protein kinase domain
308197	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
308198	Protein kinase domain
308199	Protein kinase domain
308199	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
308200	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
308200	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
308201	RIO1/ZK632.3/MJ0444 family
308203	Protein kinase domain
308203	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
308204	Protein kinase domain
308205	Protein kinase domain
308205	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
308209	Kinesin motor domain
308209	Protein kinase domain
308209	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
308210	HMG (high mobility group) box
308210	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
308211	Protein kinase domain
308213	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
308213	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an
308214	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an
308224	WD domain, G-beta repeat
308231	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308232	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308233	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308234	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308236	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308237	Ribosomal L18ae protein family
308238	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308239	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308239	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
308240	Protein kinase domain
308240	7 transmembrane receptor (metabotropic glutamate family)
308240	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308241	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308242	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308243	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308244	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308245	7 transmembrane receptor (metabotropic glutamate family)
308245	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308246	7 transmembrane receptor (metabotropic glutamate family)
308246	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308247	Protein kinase domain
308248	7 transmembrane receptor (metabotropic glutamate family)
308248	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308250	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
308251	7 transmembrane receptor (metabotropic glutamate family)
308251	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308252	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308253	7 transmembrane receptor (metabotropic glutamate family)
308253	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308254	ENV polyprotein (coat polyprotein)
308254	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
308254	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308254	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
308254	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
308255	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308256	7 transmembrane receptor (metabotropic glutamate family)
308256	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308257	E1-E2 ATPase
308257	7 transmembrane receptor (metabotropic glutamate family)
308257	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308257	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
308257	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
308263	Ribosomal protein L23
308265	Protein kinase domain
308265	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
308267	Uncharacterized ACR, COG1579
308269	Fibrillarin
308282	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
308282	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
308282	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
308282	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
308283	F-box domain
308286	Mitochondrial carrier protein
308287	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
308287	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
308287	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
308287	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
308290	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
308290	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
308291	Low temperature viability protein
308294	Cytidine and deoxycytidylate deaminase zinc-binding region
308296	7 transmembrane receptor (metabotropic glutamate family)
308297	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308297	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308298	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308299	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308300	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308301	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
308301	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
308302	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known struc
308303	7 transmembrane receptor (metabotropic glutamate family)
308303	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308304	7 transmembrane receptor (metabotropic glutamate family)
308304	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308305	7 transmembrane receptor (metabotropic glutamate family)
308305	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308306	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
308306	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
308307	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308308	7 transmembrane receptor (metabotropic glutamate family)
308309	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
308311	Not1 N-terminal domain, CCR4-Not complex component
308311	NOT2 / NOT3 / NOT5 family. NOT1, NOT2, NOT3, NOT4 and NOT5 form a nuclear complex that negatively regulates the basal and activated transcription of many genes. This family includes NOT2, NOT3 and NOT5
308312	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
308313	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
308313	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
308314	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
308315	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
308316	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308318	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308320	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308322	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308324	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308325	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308326	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308330	7 transmembrane receptor (rhodopsin family)
308330	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
308330	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
308330	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
308331	Homeobox domain
308332	Protein kinase domain
308333	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
308334	7 transmembrane receptor (metabotropic glutamate family)
308334	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308335	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
308336	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
308340	Protein kinase domain
308341	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
308341	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
308344	Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase
308347	Protein kinase domain
308349	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
308353	Ribosomal protein L44
308354	7 transmembrane receptor (rhodopsin family)
308354	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
308355	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
308355	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
308355	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
308356	Ribosomal protein S21e
308356	7 transmembrane receptor (metabotropic glutamate family)
308358	7 transmembrane receptor (metabotropic glutamate family)
308358	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308359	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
308362	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
308362	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308362	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
308363	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308363	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
308364	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
308365	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
308365	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308365	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
308367	Ribosomal protein L6
308368	Ribosomal protein S19e
308372	HMG (high mobility group) box
308376	7 transmembrane receptor (Secretin family)
308376	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
308377	Connexin
308378	7 transmembrane receptor (rhodopsin family)
308379	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
308380	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
308383	ENV polyprotein (coat polyprotein)
308384	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
308386	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
308388	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
308389	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
308392	WD domain, G-beta repeat
308397	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
308400	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
308401	WD domain, G-beta repeat
308402	Class I Histocompatibility antigen, domains alpha 1 and 2
308402	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
308405	Protein kinase domain
308407	HMG (high mobility group) box
308408	7 transmembrane receptor (rhodopsin family)
308410	Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues
308420	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
308420	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308421	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308422	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308423	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308424	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308424	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
308429	Phospholipase A2 inhibitor
308431	PH domain. PH stands for pleckstrin homology
308435	HMG (high mobility group) box
308440	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
308441	3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
308444	Protein kinase domain
308444	Fibronectin type III domain
308444	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
308445	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
308446	MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains
308446	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
308449	Protein kinase domain
308451	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
308451	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
308454	Phosphotyrosine interaction domain (PTB/PID)
308454	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
308455	PH domain. PH stands for pleckstrin homology
308455	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
308457	2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2
308457	2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2
308458	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
308458	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
308459	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
308461	Protein kinase domain
308463	Protein kinase domain
308463	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
308465	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308466	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308467	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308468	Protein kinase domain
308472	KOW motif. This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG
308473	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
308476	Calpain family cysteine protease
308476	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by
308478	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl
308482	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308483	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
308483	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
308483	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308484	Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA
308485	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308486	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308487	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308487	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
308488	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308489	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308490	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
308490	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308490	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
308491	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308492	WD domain, G-beta repeat
308492	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
308493	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
308493	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
308495	Fibronectin type III domain
308495	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
308495	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
308496	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
308497	Amyloid A4 extracellular domain
308498	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
308498	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
308501	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
308503	Glycosyltransferase family 6
308503	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
308505	Hsp90 protein
308506	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
308507	LIM domain. This family represents two copies of the LIM structural domain
308508	Repeat in ubiquitin-activating (UBA) protein
308508	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
308510	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
308511	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
308525	Ribosomal protein L15
308532	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308533	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308534	SH2 domain
308535	ENV polyprotein (coat polyprotein)
308535	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308536	Phosphoglycerate kinase
308537	Putative transcriptional regulator. Members of this family range in length from 140-180 amino acids. One member interacts with c-Myc and represses its transcriptional activity. Suggesting that other members of this family may also regulate transcription b
308540	Ribosomal protein L13
308541	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308542	PH domain. PH stands for pleckstrin homology
308542	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
308543	PH domain. PH stands for pleckstrin homology
308543	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
308550	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
308550	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308551	7 transmembrane receptor (metabotropic glutamate family)
308551	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308553	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
308554	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308559	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308560	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308561	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308561	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
308562	Trypsin
308564	Trypsin
308565	Trypsin
308566	Trypsin
308569	Histidine acid phosphatase
308569	7 transmembrane receptor (rhodopsin family)
308570	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
308571	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
308575	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
308578	Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site
308578	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
308578	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
308579	Choline/Carnitine o-acyltransferase
308582	TEA/ATTS domain family
308583	Oxidoreductase family, NAD-binding Rossmann fold. This family of enzymes utilise NADP or NAD
308583	pfam02894, GFO_IDH_MocA_C, Oxidoreductase family, C-terminal alpha/beta domain. This family of enzymes utilise NADP or NAD. This family is called the GFO/IDH/MOCA family
308584	PH domain. PH stands for pleckstrin homology
308585	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
308586	Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes
308587	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
308587	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
308588	Eukaryotic-type carbonic anhydrase
308588	Eukaryotic-type carbonic anhydrase
308589	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
308592	WD domain, G-beta repeat
308597	Serum amyloid A protein
308600	Homeobox domain
308600	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
308600	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
308601	Ribosomal L38e protein family
308601	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
308603	7 transmembrane receptor (rhodopsin family)
308605	7 transmembrane receptor (rhodopsin family)
308606	7 transmembrane receptor (rhodopsin family)
308607	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
308608	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
308623	Hsp90 protein
308623	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
308636	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
308638	Ribosomal L18ae protein family
308638	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
308638	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
308641	Growth-Arrest-Specific Protein 2 Domain
308645	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
308649	Protein kinase domain
308650	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
308652	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
308653	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
308653	TAP C-terminal domain. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for nuclear export of mRNA. Tap has a modular structure, and its most C-terminal domain is important for binding to FG repeat-containing nucle
308661	Mitochondrial carrier protein
308662	Ribosomal protein L34e
308662	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
308663	Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin
308665	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
308669	Regulator of chromosome condensation (RCC1)
308669	Anaphase-promoting complex, subunit 10 (APC10)
308669	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
308669	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
308669	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
308670	C-5 cytosine-specific DNA methylase
308678	Mitochondrial carrier protein
308690	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
308691	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
308694	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
308694	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
308700	tRNA synthetases class II (A). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only alanyl-tRNA synthetases
308700	TGS domain. The TGS domain is named after ThrRS, GTPase, and SpoT. Interestingly, TGS domain was detected also at the amino terminus of the uridine kinase from the spirochaete Treponema pallidum (but not any other organism, including the related spirochae
308701	Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain
308701	tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt
308702	Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be
308703	Protein kinase domain
308703	ADP-ribosylation factor family
308705	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
308706	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
308706	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
308709	Intermediate filament protein
308709	Intermediate filament protein
308710	Pyroglutamyl peptidase
308710	Cyclophilin type peptidyl-prolyl cis-trans isomerase
308711	Ribosomal protein S18
308713	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
308715	Tissue factor
308716	Tissue factor
308716	Ribosomal protein S15
308718	WD domain, G-beta repeat
308725	HMG (high mobility group) box
308727	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
308732	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
308738	'chromo' (CHRromatin Organization MOdifier) domain
308738	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
308738	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
308742	C2 domain
308744	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
308752	Ribosomal L10
308755	HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease
308757	Glycosyl hydrolases family 38. Glycosyl hydrolases are key enzymes of carbohydrate metabolism
308761	Microtubule associated protein (MAP65/ASE1 family)
308762	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
308765	Clathrin adaptor complex small chain
308766	Helix-loop-helix DNA-binding domain
308767	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
308770	Ubiquinol-cytochrome C reductase complex 14kD subunit. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 14kD (or VI) subunit of the complex which is not directly in
308773	Extracellular link domain
308773	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
308776	3'5'-cyclic nucleotide phosphodiesterase
308777	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
308779	Fibronectin type III domain
308779	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
308787	Thrombospondin type 1 domain
308787	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
308792	START domain
308799	7 transmembrane receptor (metabotropic glutamate family)
308801	ADP-ribosylation factor family
308801	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
308802	Phosphoglycerate kinase
308807	Trypsin
308808	Malic enzyme, N-terminal domain
308811	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
308813	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
308815	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
308821	ADP-ribosylation factor family
308821	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
308821	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
308826	Ribosomal protein L21e
308833	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
308835	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
308835	ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum m
308836	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
308837	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
308839	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
308848	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
308850	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
308850	Glycerophosphoryl diester phosphodiesterase family. E. coli has two sequence related isozymes of glycerophosphoryl diester phosphodiesterase (GDPD) - periplasmic and cytosolic. This family also includes agrocinopine synthase, the similarity to GDPD has be
308850	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
308851	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
308853	7 transmembrane receptor (rhodopsin family)
308854	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
308855	Biotin/lipoate A/B protein ligase family. This family includes biotin protein ligase, lipoate-protein ligase A and B. Biotin is covalently attached at the active site of certain enzymes that transfer carbon dioxide from bicarbonate to organic acids to for
308857	DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh
308857	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
308864	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
308865	WD domain, G-beta repeat
308865	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
308865	ATP synthase B/B' CF(0). Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits a
308865	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
308867	Ribosomal protein L11, RNA binding domain
308868	O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production
308872	NAD:arginine ADP-ribosyltransferase
308873	NAD:arginine ADP-ribosyltransferase
308875	ADP-ribosylation factor family
308875	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
308879	Poly A polymerase family. This family includes nucleic acid independent RNA polymerases, such as Poly(A) polymerase, which adds the poly (A) tail to mRNA EC:2.7.7.19. This family also includes the tRNA nucleotidyltransferase that adds the CCA to the 3' of
308880	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
308887	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
308887	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
308890	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
308896	Ubiquitin carboxyl-terminal hydrolase family 2
308896	Ubiquitin carboxyl-terminal hydrolases family 2
308898	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
308900	Zinc finger, C3HC4 type (RING finger)
308901	B-box zinc finger
308901	Zinc finger, C3HC4 type (RING finger)
308901	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
308902	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
308904	7 transmembrane receptor (rhodopsin family)
308905	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
308905	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
308906	B-box zinc finger
308906	Zinc finger, C3HC4 type (RING finger)
308906	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
308907	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
308909	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
308917	Olfactomedin-like domain
308919	Trypsin
308919	CUB domain
308919	Oxidoreductase FAD-binding domain
308919	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
308920	ENV polyprotein (coat polyprotein)
308922	Ribosomal L38e protein family
308923	WD domain, G-beta repeat
308925	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
308927	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
308928	HMG (high mobility group) box
308931	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
308932	Peptidase family C54
308933	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
308937	Protein kinase domain
308941	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
308941	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
308942	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
308942	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
308942	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
308942	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
308944	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
308944	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
308944	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
308945	Ribosomal protein L21e
308946	Protein kinase domain
308949	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
308950	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
308951	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
308955	Ribosomal protein L10
308956	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
308956	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
308959	Ubiquitin carboxyl-terminal hydrolase family 2
308965	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
308966	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
308968	Zinc finger, C3HC4 type (RING finger)
308968	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
308972	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
308976	jmjC domain
308985	Ribosomal Proteins L2, C-terminal domain
308985	Ribosomal Proteins L2, RNA binding domain
308988	CUB domain
308988	Sushi domain (SCR repeat)
308990	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
308993	AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synt
308993	AIR synthase related protein, N-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The N-terminal domain of AIR synthase forms the d
308995	von Willebrand factor type A domain
308995	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
308996	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
308997	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
308998	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
308999	F-box domain
308999	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
309001	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
309001	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
309006	Ribosomal protein L21e
309007	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
309008	SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin
309010	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
309010	DDHD domain. The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoestera
309014	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
309027	Zona pellucida-like domain
309027	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
309032	WD domain, G-beta repeat
309035	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
309036	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
309036	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
309038	AMP-binding enzyme
309039	ENV polyprotein (coat polyprotein)
309041	pfam02926, THUMP, THUMP domain. The THUMP domain is named after after thiouridine synthases, methylases and PSUSs. The THUMP domain consists of about 110 amino acid residues. It is predicted that this domain is an RNA-binding domain that adopts an alpha/b
309044	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
309047	HMG (high mobility group) box
309055	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
309056	Sulfotransferase protein
309063	Ribosomal protein S12
309064	ENV polyprotein (coat polyprotein)
309066	Ribosomal protein S21e
309070	Uroporphyrinogen-III synthase HemD. This family consists of uroporphyrinogen-III synthase HemD EC:4.2.1.75 also known as Hydroxymethylbilane hydrolyase (cyclizing) from eukaryotes, bacteria and archaea. This enzyme catalyses the reaction: Hydroxymethylbil
309071	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
309077	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
309081	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
309095	Homeobox domain
309097	7 transmembrane receptor (Secretin family)
309098	Disintegrin
309098	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
309098	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
309099	Ribosomal protein L6
309103	Glycosyl hydrolase family 65 central catalytic domain. This family of glycosyl hydrolases contains vacuolar acid trehalase and maltose phosphorylase.Maltose phosphorylase (MP) is a dimeric enzyme that catalyzes the conversion of maltose and inorganic phos
309106	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
309108	Intermediate filament tail domain
309110	DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117
309111	Mitochondrial carrier protein
309113	RNA polymerases N / 8 kDa subunit
309115	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known struc
309118	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
309126	Importin-beta N-terminal domain
309127	Cyclin-dependent kinase regulatory subunit
309129	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
309139	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
309140	Ribosomal prokaryotic L21 protein
309142	Tesmin/TSO1-like CXC domain. This family includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin and TSO1. This family is called a CXC domain in
309146	ENV polyprotein (coat polyprotein)
309151	Rad9. Rad9 is required for transient cell-cycle arrests and transcriptional induction of DNA repair in response to DNA damage
309153	CAAX amino terminal protease family. Members of this family are probably proteases. the family contains CAAX prenyl protease. The proteins contain a highly conserved Glu-Glu motif at the amino end of the alignment. The alignment also contains two histidin
309155	Microtubule associated protein (MAP65/ASE1 family)
309163	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
309169	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
309172	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
309173	Mitochondrial carrier protein
309174	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di
309174	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
309179	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzy
309179	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
309185	Platelet-derived growth factor (PDGF)
309186	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
309191	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
309194	TATA box binding protein associated factor (TAF). TAF proteins adopt a histone-like fold
309195	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
309197	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
309198	Fibronectin type III domain
309201	Tetraspanin family
309202	Ribosomal protein L36e
309202	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
309203	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
309204	Ribosomal protein L36e
309207	Lipase (class 3)
309210	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
309212	7 transmembrane receptor (rhodopsin family)
309214	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
309217	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
309218	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
309219	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
309221	7 transmembrane receptor (rhodopsin family)
309222	7 transmembrane receptor (rhodopsin family)
309223	Ribosomal protein L15
309224	DNA directed RNA polymerase, 7 kDa subunit
309225	7 transmembrane receptor (rhodopsin family)
309230	7 transmembrane receptor (rhodopsin family)
309230	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
309231	7 transmembrane receptor (rhodopsin family)
309236	Ribosomal protein S2
309238	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
309240	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
309242	ADP-ribosylation factor family
309242	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
309246	Fork head domain
309248	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
309249	Ribosomal protein L36e
309250	HMG14 and HMG17
309251	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
309259	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
309261	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
309262	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
309262	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
309264	LIM domain. This family represents two copies of the LIM structural domain
309275	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
309276	ENV polyprotein (coat polyprotein)
309276	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
309279	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
309280	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
309280	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
309280	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
309281	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the fa
309283	Enolase, N-terminal domain
309283	Enolase, C-terminal TIM barrel domain
309288	metallopeptidase family M24
309288	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
309291	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
309292	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
309300	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
309301	RNA 3'-terminal phosphate cyclase
309306	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
309311	Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated a
309312	Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated a
309314	Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated a
309315	Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated a
309317	Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated a
309318	Glycine cleavage system P-protein. This family consists of Glycine cleavage system P-proteins EC:1.4.4.2 from bacterial, mammalian and plant sources. The P protein is part of the glycine decarboxylase multienzyme complex EC:2.1.2.10 (GDC) also annotated a
309320	Core histone H2A/H2B/H3/H4
309321	Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases
309324	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
309325	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
309327	Eukaryotic initiation factor 3, gamma subunit. eIF-3 is a multisubunit complex that stimulates translation initiation in vitro at several different steps. This family corresponds to the gamma subunit if eIF3
309328	Tumor protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction a
309329	EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains
309331	YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named
309331	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
309331	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
309334	Cyclic nucleotide-binding domain
309338	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
309345	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
309345	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
309348	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
309349	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
309356	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
309356	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
309358	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
309358	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
309359	Ubiquitin carboxyl-terminal hydrolase family 2
309360	Ubiquitin carboxyl-terminal hydrolase family 2
309360	Ubiquitin carboxyl-terminal hydrolases family 2
309361	Protein kinase domain
309362	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
309362	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
309362	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
309363	Ribosomal protein L19e
309368	Laminin N-terminal (Domain VI)
309368	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
309369	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
309373	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
309374	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
309376	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
309377	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
309381	Thi4 family. This family includes a putative thiamine biosynthetic enzyme
309381	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
309381	Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. Th
309382	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
309383	Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular m
309384	Protein kinase domain
309386	Glycosyl hydrolase family 79, N-terminal domain. Family of endo-beta-N-glucuronidase, or heparanase. Heparan sulfate proteoglycans (HSPGs) play a key role in the self- assembly, insolubility and barrier properties of basement membranes and extracellular m
309389	Homeobox domain
309391	Cytochrome oxidase assembly protein. This is a family of integral membrane proteins. CtaA is required for cytochrome aa3 oxidase assembly in Bacillus subtilis. COX15 is required for cytochrome c oxidase assembly in yeast
309392	CutC family. Copper transport in Escherichia coli is mediated by the products of at least six genes, cutA, cutB, cutC, cutD, cutE, and cutF. A mutation in one or more of these genes results in an increased copper sensitivity. Members of this family are be
309393	NAC domain
309393	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
309396	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
309396	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
309399	Dienelactone hydrolase family
309399	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
309406	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
309408	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
309410	MAM domain. An extracellular domain found in many receptors
309416	Frataxin-like domain. This family contains proteins that have a domain related to the globular C-terminus of Frataxin the protein that is mutated in Friedreich's ataxia. This domain is found in a family of bacterial proteins. The function of this domain i
309419	Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal
309430	DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai
309432	Fatty acid desaturase
309433	WD domain, G-beta repeat
309434	jmjC domain
309437	'Paired box' domain
309441	DnaB-like helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the Escherichia coli chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the dup
309447	LIM-domain binding protein. The LIM-domain binding protein, binds to the LIM domain pfam00412 of LIM homeodomain proteins which are transcriptional regulators of development. Nuclear LIM interactor (NLI) / LIM domain-binding protein 1 (LDB1) is located in
309448	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
309449	GNS1/SUR4 family
309451	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
309451	PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline
309452	Death domain
309452	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
309457	Zinc finger, C3HC4 type (RING finger)
309458	S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure
309460	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
309462	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
309465	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
309465	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
309475	Endomembrane protein 70
309476	CUB domain
309482	Ribosomal protein L24e
309483	Carbamoyl-phosphate synthase L chain, N-terminal domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosyn
309494	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
309495	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
309499	C2 domain
309500	C2 domain
309501	C2 domain
309504	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
309508	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
309509	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
309511	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
309511	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
309512	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
309512	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
309512	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
309513	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p
309514	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
309518	RNase P subunit p30. This protein is part of the RNase P complex that is involved in tRNA maturation
309522	tRNA synthetases class I (E and Q), catalytic domain. Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only glutamyl and glutaminyl tRNA synthetases. In some organisms, a single glutamyl-tRNA synthetase aminoacyla
309523	Kinesin motor domain
309523	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
309523	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
309525	Ribosomal protein L44
309527	Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. M
309528	ab-hydrolase associated lipase region
309528	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
309529	Ribosomal family S4e
309532	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
309533	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
309541	Ribosomal Proteins L2, C-terminal domain
309541	Ribosomal Proteins L2, RNA binding domain
309543	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
309544	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
309550	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter
309552	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
309558	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
309558	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
309562	S-adenosyl-L-homocysteine hydrolase
309562	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
309567	Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
309567	Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold
309574	7 transmembrane receptor (rhodopsin family)
309575	7 transmembrane receptor (rhodopsin family)
309576	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
309577	7 transmembrane receptor (rhodopsin family)
309578	7 transmembrane receptor (rhodopsin family)
309579	7 transmembrane receptor (rhodopsin family)
309583	7 transmembrane receptor (rhodopsin family)
309583	LIM domain. This family represents two copies of the LIM structural domain
309584	Class I Histocompatibility antigen, domains alpha 1 and 2
309584	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
309585	B-box zinc finger
309586	B-box zinc finger
309586	Zinc finger, C3HC4 type (RING finger)
309586	Class I Histocompatibility antigen, domains alpha 1 and 2
309586	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
309586	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
309587	Class I Histocompatibility antigen, domains alpha 1 and 2
309588	Class I Histocompatibility antigen, domains alpha 1 and 2
309588	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
309589	Class I Histocompatibility antigen, domains alpha 1 and 2
309590	Class I Histocompatibility antigen, domains alpha 1 and 2
309591	B-box zinc finger
309591	Zinc finger, C3HC4 type (RING finger)
309591	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
309592	Class I Histocompatibility antigen, domains alpha 1 and 2
309593	GTPase of unknown function
309595	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
309595	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
309596	tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included
309598	Class I Histocompatibility antigen, domains alpha 1 and 2
309598	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
309599	Class I Histocompatibility antigen, domains alpha 1 and 2
309599	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
309600	Class I Histocompatibility antigen, domains alpha 1 and 2
309601	TNF(Tumor Necrosis Factor) family
309603	Class I Histocompatibility antigen, domains alpha 1 and 2
309605	Class I Histocompatibility antigen, domains alpha 1 and 2
309606	Class I Histocompatibility antigen, domains alpha 1 and 2
309606	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
309607	Class I Histocompatibility antigen, domains alpha 1 and 2
309612	Ribosomal protein L13e
309613	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
309613	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
309615	TPR Domain
309618	Homeobox domain
309618	PBX domain. The PBX domain is a bipartite acidic domain
309619	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
309620	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
309621	Class II histocompatibility antigen, alpha domain
309621	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
309622	Class II histocompatibility antigen, beta domain
309622	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
309623	Class II histocompatibility antigen, beta domain
309625	Class II histocompatibility antigen, beta domain
309626	Zinc finger, C3HC4 type (RING finger)
309627	Class I Histocompatibility antigen, domains alpha 1 and 2
309629	KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif
309630	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
309630	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
309639	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
309640	Phosphotyrosine interaction domain (PTB/PID)
309640	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
309642	PH domain. PH stands for pleckstrin homology
309644	Tub family
309647	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
309647	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
309647	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
309649	Intermediate filament protein
309649	ADP-ribosylation factor family
309649	7 transmembrane receptor (rhodopsin family)
309649	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
309649	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
309649	PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins
309649	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
309650	C2 domain
309650	von Willebrand factor type A domain
309651	Cyclophilin type peptidyl-prolyl cis-trans isomerase
309653	PH domain. PH stands for pleckstrin homology
309653	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
309653	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
309657	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
309658	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
309665	Trypsin
309665	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
309666	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
309666	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
309667	MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000
309684	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
309688	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
309691	Uncharacterized protein family UPF0054
309694	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
309695	Sodium:solute symporter family
309697	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
309713	Ribosomal protein S26e
309715	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
309722	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
309722	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
309723	CTF/NF-I family
309728	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
309733	jmjC domain
309738	jmjC domain
309743	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
309745	Vinculin family
309747	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
309749	Vinculin family
309751	Vinculin family
309753	Vinculin family
309754	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
309757	Sir2 family
309758	Regulator of chromosome condensation (RCC1)
309758	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
309760	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
309763	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
309763	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
309766	Ribosomal protein L21e
309767	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
309771	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
309771	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
309773	CUB domain
309773	LCCL domain
309774	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
309778	Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region
309786	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
309788	TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme
309794	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
309796	Homeobox domain
309799	Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits
309800	Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits
309804	Protein kinase domain
309805	Protein kinase domain
309811	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
309812	DNA polymerase family B, exonuclease domain. This domain has 3' to 5' exonuclease activity and adopts a ribonuclease H type fold
309812	DNA polymerase family B. This region of DNA polymerase B appears to consist of more than one structural domain, possibly including elongation, DNA-binding and dNTP binding activities
309821	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
309822	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
309828	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
309830	Homeobox domain
309831	Homeobox domain
309833	Core histone H2A/H2B/H3/H4
309834	LIM domain. This family represents two copies of the LIM structural domain
309837	Thrombospondin type 1 domain
309837	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
309837	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
309840	Ribosomal protein S7e
309846	AFG1-like ATPase. This family of proteins contains a P-loop motif and are predicted to be ATPases
309848	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
309848	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
309852	Cyclophilin type peptidyl-prolyl cis-trans isomerase
309854	WD domain, G-beta repeat
309855	SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin
309858	pfam02889, Sec63, Sec63 domain
309859	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
309868	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
309869	Autophagy protein Apg5. Apg5p is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway
309871	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
309871	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
309883	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
309884	Proteasome A-type and B-type
309887	pfam02889, Sec63, Sec63 domain
309887	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
309887	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
309891	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
309892	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
309893	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
309895	Class I Histocompatibility antigen, domains alpha 1 and 2
309896	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
309897	Protein kinase domain
309898	Protein kinase domain
309900	Tub family
309901	Fibronectin type III domain
309902	CXXC zinc finger. This domain contains eight conserved cysteine residues that bind to zinc. The CXXC domain is found in proteins that methylate cytosine, proteins that bind to methyl cytosine and HRX related proteins
309904	Homeobox domain
309905	Homeobox domain
309906	Homeobox domain
309909	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
309911	Amidase
309912	Zinc-binding dehydrogenase
309915	ENV polyprotein (coat polyprotein)
309918	Occludin/ELL family
309919	2-hydroxychromene-2-carboxylate isomerase family. 2-hydroxychromene-2-carboxylate (HCCA) isomerase enzymes catalyse one step in prokaryotic polyaromatic hydrocarbon (PAH) catabolic pathways . This family also contains members with functions other than HCC
309922	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
309922	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
309923	Rieske [2Fe-2S] domain. The rieske domain has a [2Fe-2S] centre. Two conserved cysteines that one Fe ion while the other Fe ion is coordinated by two conserved histidines
309927	Ribosomal protein L24e
309928	C2 domain
309940	Nucleoside diphosphate kinase
309941	Mitochondrial carrier protein
309946	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
309949	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
309957	SRF-type transcription factor (DNA-binding and dimerisation domain)
309959	GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins
309962	3'5'-cyclic nucleotide phosphodiesterase
309962	PAS domain. CAUTION. This family does not currently match all known examples of PAS domains. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels
309969	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
309975	Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13,
309977	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
309981	TMS membrane protein/tumour differentially expressed protein (TDE)
309985	DNA mismatch repair proteins, mutS family
309992	Autophagocytosis associated protein, C-terminal domain. Autophagocytosis is a starvation-induced process responsible for transport of cytoplasmic proteins to the vacuole
309998	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
309999	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
309999	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
310000	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
310004	Succinate dehydrogenase cytochrome b subunit
310005	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
310011	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
310011	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon
310019	WD domain, G-beta repeat
310029	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
310030	Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction
310033	Occludin/ELL family
310034	Rad17 cell cycle checkpoint protein
310034	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
310053	Cyclophilin type peptidyl-prolyl cis-trans isomerase
310056	Importin-beta N-terminal domain
310058	Ribosomal protein L23
310071	WD domain, G-beta repeat
310074	GNS1/SUR4 family
310074	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
310086	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
310086	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
310086	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL
310090	Fibronectin type III domain
310090	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
310090	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
310091	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
310091	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
310092	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
310096	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
310097	PX domain. PX domains bind to phosphoinositides
310097	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
310098	Translation initiation factor SUI1
310106	ADP-ribosylation factor family
310126	Sushi domain (SCR repeat)
310126	Low-density lipoprotein receptor domain class A
310126	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl
310132	Fibronectin type III domain
310139	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
310141	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
310144	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
310151	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
310160	RNase3 domain
310169	HMG (high mobility group) box
310173	Cadherin domain
310174	Cadherin domain
310174	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
310178	PH domain. PH stands for pleckstrin homology
310178	MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins
310179	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
310180	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
310182	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
310183	Ribosomal protein L6
310192	PH domain. PH stands for pleckstrin homology
310192	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
310192	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
310192	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
310193	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
310194	MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases
310200	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
310201	Dienelactone hydrolase family
310209	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
310210	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
310212	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
310213	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
310214	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
310215	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
310216	Clathrin adaptor complex small chain
310216	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
310218	Eukaryotic-type carbonic anhydrase
310223	Ribosomal protein L34e
310228	Ribosomal protein L31e
310228	Eukaryotic protein of unknown function, DUF279
310230	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the fa
310236	HMG (high mobility group) box
310242	Ribosomal protein L34e
310242	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
310243	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
310245	Ribosomal protein L36e
310247	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
310250	Homeobox domain
310256	Eukaryotic initiation factor 4E
310256	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
310257	Glutathione peroxidase
310257	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
310261	Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold
310273	Fibronectin type III domain
310277	Carboxylesterase
310281	Carboxylesterase
310284	Ribosomal protein L6
310285	Ribosomal protein L6
310298	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
310302	Calcium-activated potassium channel, beta subunit
310303	Calcium-activated potassium channel, beta subunit
310305	Ubiquitin carboxyl-terminal hydrolase family 2
310306	Ubiquitin carboxyl-terminal hydrolases family 2
310306	Zn-finger in ubiquitin-hydrolases and other protein
310306	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
310312	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
310317	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
310318	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
310322	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
310324	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
310325	Core histone H2A/H2B/H3/H4
310325	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
310326	Sulfatase
310326	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
310326	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
310327	7 transmembrane receptor (rhodopsin family)
310334	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
310334	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
310339	ENV polyprotein (coat polyprotein)
310341	Cadherin domain
310341	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
310344	Protein kinase domain
310352	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
310353	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
310354	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
310358	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
310360	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
310360	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
310360	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
310361	Coenzyme A transferase
310363	F-box domain
310365	Ribosomal protein L21e
310366	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
310366	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
310369	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
310369	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
310375	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
310376	Protein kinase domain
310378	Alanine dehydrogenase/pyridine nucleotide transhydrogenase. This family now also contains the lysine 2-oxoglutarate reductases
310383	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
310385	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
310388	Ribosomal protein L31e
310389	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
310390	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
310392	Amino acid permease
310395	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
310401	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
310410	Ribosomal protein L21e
310426	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
310427	WD domain, G-beta repeat
310429	MCM2/3/5 family
310433	Ribosomal protein L11, RNA binding domain
310433	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
310434	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
310436	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
310437	Ribosomal protein S15
310437	TSC-22/dip/bun family
310440	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
310442	7 transmembrane receptor (rhodopsin family)
310443	7 transmembrane receptor (rhodopsin family)
310444	7 transmembrane receptor (rhodopsin family)
310448	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
310448	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
310449	Transcriptional Coactivator p15 (PC4). p15 has a bipartite structure composed of an amino-terminal regulatory domain and a carboxy-terminal cryptic DNA-binding domain. The DNA-binding activity of the carboxy-terminal is disguised by the amino-terminal p15
310450	7 transmembrane receptor (rhodopsin family)
310451	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
310454	mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein
310460	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
310461	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
310461	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
310462	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
310463	C2 domain
310463	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
310463	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
310464	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
310465	7 transmembrane receptor (metabotropic glutamate family)
310465	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
310467	WWE domain. The WWE domain is named after three of its conserved residues and is predicted to mediate specific protein- protein interactions in ubiquitin and ADP ribose conjugation systems
310467	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
310471	PH domain. PH stands for pleckstrin homology
310480	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
310480	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
310487	7 transmembrane receptor (rhodopsin family)
310499	Ribosomal protein L21e
310506	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
310508	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
310509	ATP synthase alpha/beta chain, C terminal domain
310509	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
310509	pfam02874, ATP-synt_ab_N, ATP synthase alpha/beta family, beta-barrel domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
310510	Aminotransferase class-V
310511	WD domain, G-beta repeat
310514	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
310514	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
310515	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
310516	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
310516	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
310517	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
310519	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
310521	Enolase, N-terminal domain
310521	Enolase, C-terminal TIM barrel domain
310529	Eukaryotic porin
310531	Hsp90 protein
310531	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
310533	Cyclic nucleotide-binding domain
310533	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
310533	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
310533	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
310542	DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117
310545	HMG (high mobility group) box
310545	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
310548	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
310552	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
310552	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
310553	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
310553	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
310563	Polyprenyl synthetase
310565	Trypsin
310574	Actinobacillus constitutively-expressed outer membrane lipoprotein A
310575	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
310576	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
310579	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
310580	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
310585	Enolase, N-terminal domain
310585	Enolase, C-terminal TIM barrel domain
310585	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
310585	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
310587	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
310589	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
310589	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
310590	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
310591	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
310591	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
310592	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
310592	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
310592	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
310592	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
310593	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
310594	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
310595	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
310595	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
310596	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
310596	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
310596	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
310597	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
310597	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
310598	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
310598	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
310600	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
310601	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
310601	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
310601	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
310602	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
310603	Dihydrofolate reductase
310606	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
310606	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
310607	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
310609	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
310611	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
310611	N-acetylmuramoyl-L-alanine amidase. This family includes zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity EC:3.5.1.28. This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (prefere
310615	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
310615	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
310615	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
310617	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
310617	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
310625	Ammonium Transporter Family
310628	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
310629	ADP-ribosylation factor family
310629	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
310630	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
310630	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
310632	GTPase of unknown function
310632	ADP-ribosylation factor family
310632	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
310633	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
310635	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
310636	7 transmembrane receptor (rhodopsin family)
310637	ADP-ribosylation factor family
310637	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
310638	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
310639	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
310639	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
310641	B-box zinc finger
310641	Fibronectin type III domain
310641	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
310642	Ephrin
310643	Ephrin
310645	Phosphomevalonate kinase. Phosphomevalonate kinase (EC:2.7.4.2) catalyzes the phosphorylation of 5-phosphomevalonate into 5-diphosphomevalonate, an essential step in isoprenoid biosynthesis via the mevalonate pathway. This family represents the animal typ
310647	SH2 domain
310647	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
310648	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
310648	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
310650	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
310650	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
310650	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
310651	C2 domain
310651	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
310651	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
310652	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
310652	Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein. Slight ambiguities in the alignment
310653	Ribosomal protein L9, N-terminal domain
310654	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
310657	HMG (high mobility group) box
310658	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
310659	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
310664	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
310664	DHH family. It is predicted that this family of proteins all perform a phosphoesterase function. It included the single stranded DNA exonuclease RecJ
310664	DHHA2 domain. This domain is often found adjacent to the DHH domain pfam01368 and is called DHHA2 for DHH associated domain. This domain is diagnostic of DHH subfamily 2 members. The domain is about 120 residues long and contains a conserved DXK motif at
310666	Annexin. This family of annexins also includes giardin that has been shown to function as an annexin
310667	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
310668	Methyl-CpG binding domain. The Methyl-CpG binding domain (MBD) binds to DNA that contains one or more symmetrically methylated CpGs. DNA methylation in animals is associated with alterations in chromatin structure and silencing of gene expression. MBD has
310668	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
310672	Eukaryotic-type carbonic anhydrase
310672	Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain
310672	tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt
310673	Ribosomal protein L6
310674	PH domain. PH stands for pleckstrin homology
310675	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
310678	Core histone H2A/H2B/H3/H4
310679	Core histone H2A/H2B/H3/H4
310680	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
310683	von Willebrand factor type A domain
310683	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
310684	NUDIX domain
310688	SH2 domain
310691	Ets-domain
310693	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
310694	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
310697	Doublecortin
310698	Protein kinase domain
310700	Beige/BEACH domain
310701	Mab-21 protein
310706	Ribosomal L38e protein family
310707	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
310707	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
310710	Synaptobrevin
310713	Core histone H2A/H2B/H3/H4
310715	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
310715	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
310716	tRNA synthetases class I (W and Y)
310716	Ribosomal Proteins L2, C-terminal domain
310716	Ribosomal Proteins L2, RNA binding domain
310718	tRNA synthetases class I (W and Y)
310720	WD domain, G-beta repeat
310720	Dip2/Utp12 Family. This domain is found at the C-terminus of proteins containing WD40 repeats. These proteins are part of the U3 ribonucleoprotein the yeast protein is called Utp12 or DIP2
310724	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
310727	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
310728	Ribosomal protein L21e
310729	Cyclophilin type peptidyl-prolyl cis-trans isomerase
310732	Sugar (and other) transporter
310738	Nerve growth factor family
310740	'Cold-shock' DNA-binding domain
310741	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
310742	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
310743	Olfactomedin-like domain
310745	Metallo-beta-lactamase superfamily
310746	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
310747	Ribosomal protein L44
310753	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
310753	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
310756	UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Some members have large insertions near t
310758	Ribosomal protein L34e
310758	F-actin capping protein alpha subunit
310760	Ribosomal protein L36e
310760	Uncharacterized ACR, COG1579
310760	Intermediate filament protein
310760	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
310760	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
310760	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
310760	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
310761	Ribosomal protein S12
310765	Ribosomal protein L21e
310769	WD domain, G-beta repeat
310771	Glycosyl hydrolases family 18
310772	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
310772	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
310772	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
310773	CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond
310778	Homeobox domain
310779	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
310779	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
310780	7 transmembrane receptor (rhodopsin family)
310781	Cytochrome b561
310781	Synaptophysin / synaptoporin
310781	Synaptogyrin. This family of proteins is distantly related to pfam01284
310782	Fibronectin type III domain
310784	Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone
310786	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
310786	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
310787	Fibronectin type III domain
310787	PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is about
310791	Mitochondrial carrier protein
310791	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
310797	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
310797	Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta str
310801	Cation efflux family. Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are thought to be efflux pumps that remove these ions from cells
310803	Ribosomal protein L13e
310806	Protein-tyrosine phosphatase
310806	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
310807	Uncharacterized ACR, COG1579
310807	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
310807	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
310808	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
310810	Reeler domain
310810	DOMON domain. The DOMON (named after dopamine beta-monooxygenase N-terminal) domain is 110-125 residues long. It is predicted to form an all beta fold with 7-8 strands. The domain may mediate extracellular adhesive interactions
310811	Paralemmin
310812	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
310815	PX domain. PX domains bind to phosphoinositides
310819	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
310820	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
310825	Ribosomal protein S19e
310825	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
310826	HMG (high mobility group) box
310827	Cyclophilin type peptidyl-prolyl cis-trans isomerase
310833	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
310833	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
310838	SH2 domain
310838	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
310839	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
310839	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
310842	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
310845	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
310846	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
310848	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
310848	Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti
310855	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
310856	Inorganic pyrophosphatase
310861	Ribosomal L10
310863	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
310864	Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities
310864	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
310869	7 transmembrane receptor (rhodopsin family)
310870	ENV polyprotein (coat polyprotein)
310871	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
310877	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
310879	MHCK/EF2 kinase domain family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a b
310886	Sulfatase
310888	Hsp90 protein
310891	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
310891	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
310893	Ribosomal protein S24e
310894	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
310894	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
310898	SH2 domain
310899	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
310899	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
310900	Thymosin beta-4 family
310900	Lipoprotein amino terminal region. This family contains regions from: Vitellogenin, Microsomal triglyceride transfer protein and apolipoprotein B-100. These proteins are all involved in lipid transport. This family contains the LV1n chain from lipovitelli
310902	Zinc-binding dehydrogenase
310903	Zinc-binding dehydrogenase
310908	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
310909	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
310914	Protein kinase domain
310914	Activin types I and II receptor domain. This Pfam entry consists of both TGF-beta receptor types. This is an alignment of the hydrophilic cysteine-rich ligand-binding domains, Both receptor types, (type I and II) posses a 9 amino acid cysteine box, with t
310917	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
310917	Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
310922	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
310923	WD domain, G-beta repeat
310928	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
310932	LIM domain. This family represents two copies of the LIM structural domain
310933	Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised
310945	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
310954	Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this f
310955	Adenylate kinase
310956	Adenylate kinase
310957	Adenylate kinase
310958	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
310960	Ubiquitin carboxyl-terminal hydrolase family 2
310962	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
310964	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
310975	Ribosomal protein L21e
310982	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
310986	Protein kinase domain
310987	Protein kinase C terminal domain
310988	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
310991	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
310996	Helix-loop-helix DNA-binding domain
310997	7 transmembrane receptor (rhodopsin family)
310999	Death domain
310999	PH domain. PH stands for pleckstrin homology
310999	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
311003	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
311003	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
311005	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
311005	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
311006	ENV polyprotein (coat polyprotein)
311006	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
311006	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
311006	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
311009	Zinc finger, C3HC4 type (RING finger)
311013	HIT family
311013	Sulfatase
311014	L1 transposable element
311014	Ribosomal protein L13e
311015	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
311019	Ribosomal L28e protein family
311022	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
311022	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
311028	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
311030	VHS domain. Domain present in VPS-27, Hrs and STAM
311030	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
311038	Ribosomal protein S5, C-terminal domain
311038	Ribosomal protein S5, N-terminal domain
311039	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
311041	L1 transposable element
311043	ENV polyprotein (coat polyprotein)
311046	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
311046	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
311049	Occludin/ELL family
311050	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
311050	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
311055	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
311060	Fibronectin type II domain
311060	Lectin C-type domain. This family includes both long and short form C-type
311061	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
311065	Ribosomal protein S8
311080	Ribosomal protein L6
311082	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
311082	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
311091	Ribosomal protein L15
311091	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
311095	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
311096	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
311105	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
311110	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
311111	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
311114	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
311114	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
311115	Putative zinc finger in N-recognin
311120	Ribosomal L15
311121	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
311121	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
311122	Mitochondrial carrier protein
311130	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea
311144	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
311147	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
311151	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
311159	ENV polyprotein (coat polyprotein)
311164	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
311166	Conserved hypothetical ATP binding protein. Members of this family are found in a range of archaea and eukaryotes and have hypothesised ATP binding activity
311167	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
311169	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
311171	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
311172	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
311173	7 transmembrane receptor (rhodopsin family)
311175	7 transmembrane receptor (rhodopsin family)
311176	7 transmembrane receptor (rhodopsin family)
311176	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
311179	7 transmembrane receptor (rhodopsin family)
311180	7 transmembrane receptor (rhodopsin family)
311183	Ribosomal protein S26e
311184	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
311185	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
311187	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
311187	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
311190	Low molecular weight phosphotyrosine protein phosphatase
311193	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
311197	HMG (high mobility group) box
311202	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt
311210	Tetraspanin family
311211	Tetraspanin family
311215	Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mos
311218	Aminotransferase class I and II
311220	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
311228	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
311236	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
311240	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
311241	Riboflavin kinase / FAD synthetase. This family consists part of the bifunctional enzyme riboflavin kinase / FAD synthetase. These enzymes have both ATP:riboflavin 5'-phospho transferase and ATP:FMN-adenylyltransferase activitys. They catalyse the 5'-phos
311245	TRAF-type zinc finger
311245	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
311245	PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins
311250	B-box zinc finger
311252	Trypsin
311252	CUB domain
311252	Sushi domain (SCR repeat)
311253	Ribosomal protein S8e
311254	e3 binding domain. This family represents a small domain of the E2 subunit of 2-oxo-acid dehydrogenases responsible for the binding of the E3 subunit
311254	2-oxo acid dehydrogenases acyltransferase (catalytic domain). These proteins contain one to three copies of a lipoyl binding domain followed by the catalytic domain
311255	Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins
311256	Ribosomal protein S11
311257	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
311258	Ribosomal protein S6e
311259	pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium
311260	LIM domain. This family represents two copies of the LIM structural domain
311265	Ribosomal protein S21e
311265	Ribosomal protein L21e
311271	Signal peptidase I
311276	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
311278	HIT family
311281	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
311289	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
311295	Proteasome A-type and B-type
311297	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
311307	Domain of unknown function DUF71. This family of proteins have no known function. This domain is about 200 amino acids long with a strongly conserved motif SGGKD at the N terminus.In some members of this family, this domain is associated with pfam01042
311322	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
311324	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
311326	RNA pseudouridylate synthase. Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes RluD, a pseudouridylate synthase that converts specific uracils to
311329	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
311330	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
311331	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
311332	Delta serrate ligand
311334	3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termin
311336	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
311337	Protein kinase domain
311341	C2 domain
311341	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop
311342	C2 domain
311342	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
311342	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop
311343	C2 domain
311343	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop
311344	S25 ribosomal protein
311345	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
311352	Adenosine/AMP deaminase
311352	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
311353	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
311354	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
311358	TMS membrane protein/tumour differentially expressed protein (TDE)
311362	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
311368	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
311373	Zinc finger, C3HC4 type (RING finger)
311373	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
311375	SH2 domain
311382	Ribosomal protein S5, C-terminal domain
311384	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
311387	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cells
311388	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
311390	Ribosomal protein S11
311391	SH2 domain
311391	Intermediate filament protein
311391	Phosphotyrosine interaction domain (PTB/PID)
311391	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
311391	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
311396	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
311397	AMP-binding enzyme
311399	Ubiquitin carboxyl-terminal hydrolase family 2
311399	Ubiquitin carboxyl-terminal hydrolases family 2
311401	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
311401	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
311402	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
311404	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
311406	Protein-tyrosine phosphatase
311406	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
311416	Vinculin family
311418	Ribosomal L22e protein family
311419	Protein kinase domain
311421	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
311422	Ham1 family. This family consists of the HAM1 protein and hypothetical archaeal bacterial and C. elegans proteins. HAM1 controls 6-N-hydroxylaminopurine (HAP) sensitivity and mutagenesis in S. cerevisiae. The HAM1 protein protects the cell from HAP, eithe
311424	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
311425	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
311425	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
311426	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
311427	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
311431	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
311437	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
311441	Eukaryotic initiation factor 3, gamma subunit. eIF-3 is a multisubunit complex that stimulates translation initiation in vitro at several different steps. This family corresponds to the gamma subunit if eIF3
311442	CDP-alcohol phosphatidyltransferase. All of these members have the ability to catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond
311443	Fibronectin type III domain
311446	FMN-dependent dehydrogenase
311446	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
311446	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
311446	Conserved region in glutamate synthase. This family represents a region of the glutamate synthase protein. This region is expressed as a separate subunit in the glutamate synthase alpha subunit from archaebacteria, or part of a large multidomain enzyme in
311450	Protein kinase domain
311451	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
311456	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
311462	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
311463	ENV polyprotein (coat polyprotein)
311464	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
311468	Asparaginase
311469	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
311469	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
311471	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
311474	Appr-1"-p processing enzyme family. This domain is found in a number of otherwise unrelated proteins. This domain is found at the C-terminus of the macro-H2A histone protein. This domain is found in the non-structural proteins of several types of ssRNA vi
311475	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
311475	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
311477	CTF/NF-I family
311478	PX domain. PX domains bind to phosphoinositides
311480	Kinesin motor domain
311483	Spumavirus gag protein
311483	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
311483	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
311491	Papain family cysteine protease
311491	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
311498	Rap/ran-GAP
311503	Homeobox domain
311504	Enolase, N-terminal domain
311504	Enolase, C-terminal TIM barrel domain
311505	'Paired box' domain
311508	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
311510	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
311511	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
311512	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
311513	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
311514	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
311515	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
311517	Core histone H2A/H2B/H3/H4
311517	Utp11 protein. This protein is found to be part of a large ribonucleoprotein complex containing the U3 snoRNA. Depletion of the Utp proteins impedes production of the 18S rRNA, indicating that they are part of the active pre-rRNA processing complex. This
311521	Ribonuclease T2 family
311523	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
311525	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
311526	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
311527	Protein kinase domain
311527	Apoptosis regulator proteins, Bcl-2 family
311528	Uncharacterised protein family (UPF0080)
311529	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
311529	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
311530	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
311532	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
311538	HMG (high mobility group) box
311542	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
311544	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
311544	PUA domain. The PUA domain named after Pseudouridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi
311545	Membrane protein of unknown function (DUF435). The members of this family are membrane proteins of unknown function. In some proteins this region is found associated with pfam02225
311547	Fork head domain
311547	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
311547	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved trypt
311548	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
311557	C-5 cytosine-specific DNA methylase
311557	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
311558	LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
311558	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
311560	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
311561	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
311564	Eukaryotic protein of unknown function, DUF279
311565	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
311566	Ribosomal protein L11, RNA binding domain
311566	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
311567	C2 domain
311567	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
311569	AMP-binding enzyme
311570	Myosin head (motor domain)
311570	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
311570	Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown
311570	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
311571	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
311571	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
311574	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
311574	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
311575	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
311577	SH2 domain
311577	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
311577	Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to
311578	Intermediate filament protein
311578	Ribosomal protein S5, C-terminal domain
311578	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
311580	HD domain. HD domains are metal dependent phosphohydrolases
311582	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
311587	Uncharacterised protein family (UPF0041)
311604	Homeobox domain
311605	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
311607	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
311607	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
311607	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
311609	Fibronectin type III domain
311612	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
311613	Zinc finger, C2HC type. This is a DNA binding zinc finger domain
311613	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter
311614	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
311615	HMG (high mobility group) box
311616	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
311618	SCP-like extracellular protein. This domain is also found in prokaryotes
311621	TPR Domain
311622	Protein kinase domain
311626	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
311627	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
311628	Ribosomal protein L21e
311629	PX domain. PX domains bind to phosphoinositides
311639	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
311641	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
311642	Sulfatase
311642	Protein of unknown function (DUF229). Members of this family are uncharacterised. They are 500-1200 amino acids in length and share a long region conservation that probably corresponds to several domains
311647	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
311647	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
311655	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
311655	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
311667	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
311667	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
311670	WD domain, G-beta repeat
311671	Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase
311672	Syntaxin
311674	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
311674	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
311675	ADP-ribosylation factor family
311675	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
311678	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
311681	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
311685	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
311687	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
311688	ENV polyprotein (coat polyprotein)
311688	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
311689	Endothelin family
311691	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p
311693	Cadherin domain
311701	GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the do
311715	Ribosomal protein L21e
311715	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
311716	Fibronectin type III domain
311716	von Willebrand factor type A domain
311716	Thrombospondin N-terminal -like domain
311716	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
311720	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
311721	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
311723	HMG (high mobility group) box
311724	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
311726	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
311727	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
311727	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
311730	Auxin Efflux Carrier
311730	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
311731	Peptidase family U34
311735	GTP1/OBG family
311735	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
311738	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
311740	Ribosomal L39 protein
311741	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
311745	Cyclic nucleotide-binding domain
311748	metallopeptidase family M24
311751	Adenylate kinase
311756	MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains
311758	Uncharacterized ACR, COG1579
311758	KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif
311758	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
311759	Thrombospondin type 1 domain
311761	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
311765	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
311767	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
311768	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
311775	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
311776	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
311777	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
311781	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
311782	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
311783	Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18
311784	Interleukin-1 / 18. This family includes interleukin-1 and interleukin-18
311785	PH domain. PH stands for pleckstrin homology
311785	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
311786	TRAF-type zinc finger
311787	TRAF-type zinc finger
311788	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
311789	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
311791	MYND finger
311792	GDA1/CD39 (nucleoside phosphatase) family
311793	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
311794	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
311795	RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses
311799	Flavodoxin
311799	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
311799	FAD binding domain. This domain is found in sulfite reductase, NADPH cytochrome P450 reductase, Nitric oxide synthase and methionine synthase reductase
311802	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
311803	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
311807	Myc amino-terminal region. The myc family belongs to the basic helix-loop-helix leucine zipper class of transcription factors, see pfam00010. Myc forms a heterodimer with Max, and this complex regulates cell growth through direct activation of genes invol
311809	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
311810	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
311811	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
311812	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
311816	Ribosomal Proteins L2, C-terminal domain
311816	Ribosomal Proteins L2, RNA binding domain
311821	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
311822	Ribosomal protein L6
311825	Protein kinase domain
311826	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
311827	Thrombospondin type 1 domain
311833	Adenylate kinase
311839	AMP-binding enzyme
311843	Uncharacterized ACR, COG2106
311844	Aminotransferase class I and II
311844	DegT/DnrJ/EryC1/StrS aminotransferase family. The members of this family are probably all pyridoxal-phosphate-dependent aminotransferase enzymes with a variety of molecular functions. The family includes StsA, StsC and StsS. The aminotransferase activity
311844	Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OAH
311845	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
311845	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
311853	ubiE/COQ5 methyltransferase family
311853	O-methyltransferase. This family includes a range of O-methyltransferases. These enzymes utilise S-adenosyl methionine
311856	Ubiquitin carboxyl-terminal hydrolase family 2
311856	Ubiquitin carboxyl-terminal hydrolases family 2
311856	Zn-finger in ubiquitin-hydrolases and other protein
311858	Arginosuccinate synthase. This family contains a PP-loop motif
311860	SH2 domain
311860	Protein kinase domain
311860	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
311861	Fibrinogen beta and gamma chains, C-terminal globular domain
311862	Laminin B (Domain IV)
311862	Intermediate filament protein
311862	Laminin N-terminal (Domain VI)
311862	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
311862	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
311862	ATP synthase B/B' CF(0). Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits a
311864	Phosphoribulokinase / Uridine kinase family
311868	Rap/ran-GAP
311870	PH domain. PH stands for pleckstrin homology
311872	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
311876	Homeobox domain
311876	PBX domain. The PBX domain is a bipartite acidic domain
311878	PBX domain. The PBX domain is a bipartite acidic domain
311880	Vacuolar sorting protein 9 (VPS9) domain
311880	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
311881	WD domain, G-beta repeat
311882	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
311883	CutA1 divalent ion tolerance protein. Several gene loci with a possible involvement in cellular tolerance to copper have been identified. One such locus in eubacteria and archaebacteria, cutA, is thought to be involved in cellular tolerance to a wide vari
311885	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
311885	Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fo
311886	Protein of unknown function (DUF390). This family of long proteins are currently only found in the rice genome. They have no known function. However they may be some kind of transposable element
311889	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
311898	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
311901	Homeobox domain
311901	LIM domain. This family represents two copies of the LIM structural domain
311902	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
311903	Ribosome recycling factor. The ribosome recycling factor (RRF / ribosome release factor) dissociates the ribosome from the mRNA after termination of translation, and is essential bacterial growth. Thus ribosomes are "recycled" and ready for another round
311905	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
311906	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
311907	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
311909	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
311910	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
311910	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
311911	7 transmembrane receptor (rhodopsin family)
311911	Phosphotyrosine interaction domain (PTB/PID)
311914	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
311914	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
311914	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
311917	Protein kinase domain
311918	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid
311919	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
311923	Cyclophilin type peptidyl-prolyl cis-trans isomerase
311923	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
311926	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
311934	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
311936	IQ calmodulin-binding motif. Calmodulin-binding motif
311942	Homeobox domain
311944	HMG (high mobility group) box
311944	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
311950	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
311952	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
311955	CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh
311956	WD domain, G-beta repeat
311957	NAD-dependent glycerol-3-phosphate dehydrogenase
311959	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
311965	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
311968	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
311968	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
311973	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
311976	Zinc-binding dehydrogenase
311977	ADP-ribosylation factor family
311977	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
311980	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
311996	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
312000	Myosin head (motor domain)
312003	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
312007	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
312008	Myosin head (motor domain)
312009	Myosin head (motor domain)
312011	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
312015	Myosin head (motor domain)
312020	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
312020	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
312021	Myosin head (motor domain)
312024	Myosin head (motor domain)
312032	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
312034	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
312035	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
312036	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
312049	Uncharacterized protein family UPF0036
312049	NADP oxidoreductase coenzyme F420-dependent
312051	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
312052	NADP oxidoreductase coenzyme F420-dependent
312053	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
312053	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
312054	GTP1/OBG family
312054	GTPase of unknown function
312054	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
312054	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
312067	mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein
312068	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
312069	Protein kinase domain
312071	Protein kinase domain
312075	Protein kinase domain
312076	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
312077	Intermediate filament protein
312078	Apoptosis regulator proteins, Bcl-2 family
312086	Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine
312086	Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. H
312096	Homeobox domain
312100	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
312102	HSF-type DNA-binding
312103	Hsp90 protein
312105	eRF1 domain 2. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
312105	eRF1 domain 3. The release factor eRF1 terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase center. The crystal structure of human eRF1 is known.
312108	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
312113	START domain
312113	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
312115	von Willebrand factor type A domain
312115	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
312115	Ku70/Ku80 N-terminal alpha/beta domain. The Ku heterodimer (composed of Ku70 and Ku80) contributes to genomic integrity through its ability to bind DNA double-strand breaks and facilitate repair by the non-homologous end-joining pathway. This is the amino
312115	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
312120	Ribosomal family S4e
312127	Thrombospondin type 1 domain
312130	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
312136	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
312140	Fork head domain
312142	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
312148	wnt family
312149	wnt family
312150	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
312150	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
312152	Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding
312157	wnt family
312162	Ribosomal protein S5, C-terminal domain
312167	Zinc finger, C3HC4 type (RING finger)
312167	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
312168	Ribosomal protein L36e
312170	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
312171	Hyaluronidase
312174	Telomere-binding protein alpha subunit, N-terminal domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. Al
312178	Core histone H2A/H2B/H3/H4
312179	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
312182	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
312186	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
312189	Tetraspanin family
312191	Ribosomal protein L19e
312192	S-adenosyl-L-homocysteine hydrolase
312192	TrkA-N domain. This domain is found in a wide variety of proteins. These protein include potassium channels, phosphoesterases, and various other transporters. This domain binds to NAD
312192	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
312199	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
312201	Zinc carboxypeptidase
312201	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fol
312202	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
312207	Translation initiation factor SUI1
312210	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
312210	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
312213	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
312217	HMG (high mobility group) box
312221	O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production
312222	Ribosomal protein L21e
312225	WD domain, G-beta repeat
312226	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predict
312227	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
312228	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
312240	HMG (high mobility group) box
312242	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
312243	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
312245	Sugar (and other) transporter
312248	Ribosomal protein L14p/L23e
312250	Protein of unknown function, DUF259
312251	Protein of unknown function, DUF259
312255	Sugar (and other) transporter
312256	ADP-ribosylation factor family
312256	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
312257	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
312257	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
312257	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
312258	ABC1 family. This family includes ABC1 from yeast and AarF from E. coli. These proteins have a nuclear or mitochondrial subcellular location in eukaryotes. The exact molecular functions of these proteins is not clear, however yeast ABC1 suppresses a cytoc
312260	Protein kinase domain
312261	Thymosin beta-4 family
312262	Protein kinase domain
312263	Raf-like Ras-binding domain
312263	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
312263	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
312269	Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues
312273	Trypsin
312275	Protein kinase domain
312275	Fibronectin type III domain
312275	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
312275	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
312279	Protein kinase domain
312279	Fibronectin type III domain
312279	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
312279	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
312281	7 transmembrane receptor (rhodopsin family)
312282	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
312282	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
312288	ENV polyprotein (coat polyprotein)
312299	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
312301	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
312301	ATP synthase, Delta/Epsilon chain, beta-sandwich domain. Part of the ATP synthase CF(1). These subunits are part of the head unit of the ATP synthase. The subunits are called delta and epsilon in human and metazoan species but in bacterial species the del
312302	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
312302	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
312303	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
312303	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
312303	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
312312	GTPase of unknown function
312314	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
312315	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
312316	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
312316	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge
312317	Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
312317	Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous
312317	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge
312319	Ribosomal L29e protein family
312320	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
312320	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
312321	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
312325	Guanylate kinase
312327	Ribosomal protein L15
312331	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
312331	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
312333	Ribosomal protein L21e
312335	Homeobox domain
312336	Homeobox domain
312337	Homeobox domain
312338	Homeobox domain
312339	Homeobox domain
312343	Ribosomal L15
312346	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
312350	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
312351	Serine carboxypeptidase
312357	SH2 domain
312359	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
312360	FKBP-type peptidyl-prolyl cis-trans isomerase
312361	PH domain. PH stands for pleckstrin homology
312363	Ribosomal protein L11, RNA binding domain
312363	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
312370	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
312372	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
312374	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
312376	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
312377	7 transmembrane receptor (rhodopsin family)
312377	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
312379	7 transmembrane receptor (rhodopsin family)
312379	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
312380	Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest
312381	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
312382	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
312384	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
312385	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
312389	Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid p
312391	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
312391	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
312393	HIT family
312398	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
312398	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
312400	7 transmembrane receptor (rhodopsin family)
312403	HMG (high mobility group) box
312407	Fibronectin type III domain
312408	Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 an
312409	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312411	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312413	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312415	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312416	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312418	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312419	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312420	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312421	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312423	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312424	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312425	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
312426	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312429	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312430	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312430	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
312431	Fibrillarin
312431	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
312436	Low molecular weight phosphotyrosine protein phosphatase
312438	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
312438	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
312440	jmjC domain
312440	jmjC domain
312442	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
312451	HMG (high mobility group) box
312465	Vinculin family
312477	PTB domain (IRS-1 type)
312478	Lysyl oxidase
312478	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
312479	Homeobox domain
312480	Zinc finger, C3HC4 type (RING finger)
312483	Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain
312483	pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain
312488	ATP synthase alpha/beta chain, C terminal domain
312488	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
312488	pfam02874, ATP-synt_ab_N, ATP synthase alpha/beta family, beta-barrel domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
312489	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
312492	C2 domain
312495	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
312495	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
312498	Exocyst complex subunit Sec15-like
312498	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
312501	Protein kinase domain
312502	C2 domain
312505	'Cold-shock' DNA-binding domain
312505	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
312507	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
312510	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
312514	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
312516	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
312519	Protein kinase domain
312522	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
312526	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
312527	Uncharacterized ACR, COG2135
312529	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
312530	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
312532	Ribosomal protein L21e
312533	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
312535	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
312535	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
312536	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
312536	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
312537	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
312538	MCM2/3/5 family
312542	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
312544	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
312544	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
312545	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
312548	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
312549	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
312552	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
312558	wnt family
312560	DNA repair protein Rad4
312563	LIM domain. This family represents two copies of the LIM structural domain
312566	Thrombospondin type 1 domain
312566	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
312566	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
312568	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
312574	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312575	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
312585	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
312589	Helix-loop-helix DNA-binding domain
312600	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
312601	Macrophage migration inhibitory factor (MIF)
312607	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
312607	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
312611	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
312612	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
312616	Poly A polymerase family. This family includes nucleic acid independent RNA polymerases, such as Poly(A) polymerase, which adds the poly (A) tail to mRNA EC:2.7.7.19. This family also includes the tRNA nucleotidyltransferase that adds the CCA to the 3' of
312617	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312617	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
312622	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
312626	Ribosomal protein S6e
312630	DNA polymerase epsilon subunit B. DNA polymerase epsilon is essential for cell viability and chromosomal DNA replication in budding yeast. In addition, DNA polymerase epsilon may be involved in DNA repair and cell-cycle checkpoint control. The enzyme cons
312636	CIDE-N domain. This domain is found in CAD nuclease, and ICAD, the inhibitor of CAD nuclease. The two proteins interact through this domain
312638	Giardia variant-specific surface protein
312647	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
312649	tRNA intron endonuclease, catalytic C-terminal domain. Members of this family cleave pre tRNA at the 5' and 3' splice sites to release the intron EC:3.1.27.9
312651	WD domain, G-beta repeat
312652	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
312652	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
312652	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
312657	7 transmembrane receptor (rhodopsin family)
312659	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
312663	Core histone H2A/H2B/H3/H4
312664	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
312666	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
312667	Intermediate filament protein
312667	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
312667	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
312667	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
312667	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
312667	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
312667	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
312668	Cache domain
312670	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
312674	wnt family
312676	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
312678	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
312681	Mitochondrial carrier protein
312681	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
312682	Apoptosis regulator proteins, Bcl-2 family
312686	Ubiquitin carboxyl-terminal hydrolase family 2
312687	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
312689	Cation-dependent mannose-6-phosphate receptor
312690	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
312695	DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds
312696	Lectin C-type domain. This family includes both long and short form C-type
312698	Lectin C-type domain. This family includes both long and short form C-type
312700	Trypsin
312700	CUB domain
312700	Sushi domain (SCR repeat)
312701	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
312704	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
312706	Suppressor Mra1. The suppressor Mra1 is found in high-copy-number when Ras1 is mutated, that recovers the mating deficiency caused by the decrease of Ras1 activity. Mutational analysis in yeast suggests that the suppressor Mra1 is essential for cell growt
312710	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
312714	von Willebrand factor type D domain
312721	Protein kinase domain
312723	Ribosomal protein S7e
312727	Tetraspanin family
312728	Tetraspanin family
312736	TEA/ATTS domain family
312738	TEA/ATTS domain family
312739	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
312740	Tub family
312742	Lectin C-type domain. This family includes both long and short form C-type
312744	Lectin C-type domain. This family includes both long and short form C-type
312745	Lectin C-type domain. This family includes both long and short form C-type
312748	Lectin C-type domain. This family includes both long and short form C-type
312750	'chromo' (CHRromatin Organization MOdifier) domain
312750	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
312751	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
312752	Lectin C-type domain. This family includes both long and short form C-type
312754	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
312756	Lectin C-type domain. This family includes both long and short form C-type
312758	Lectin C-type domain. This family includes both long and short form C-type
312759	Lectin C-type domain. This family includes both long and short form C-type
312760	Lectin C-type domain. This family includes both long and short form C-type
312762	Lectin C-type domain. This family includes both long and short form C-type
312763	Lectin C-type domain. This family includes both long and short form C-type
312777	Ets-domain
312777	Sterile alpha motif (SAM)/Pointed domain
312782	Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure
312787	7 transmembrane receptor (rhodopsin family)
312790	7 transmembrane receptor (metabotropic glutamate family)
312791	7 transmembrane receptor (metabotropic glutamate family)
312792	Ribosomal S17
312803	Core histone H2A/H2B/H3/H4
312804	Core histone H2A/H2B/H3/H4
312806	NAD:arginine ADP-ribosyltransferase
312807	RHO protein GDP dissociation inhibitor
312808	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
312811	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
312812	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
312816	Pentapeptide repeats (8 copies). These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknow
312818	LIM domain. This family represents two copies of the LIM structural domain
312824	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
312824	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
312826	Cytidylyltransferase. This family consists of two main Cytidylyltransferase activities: 1) 3-deoxy-manno-octulosonate cytidylyltransferase,, EC:2.7.7.38 catalysing the reaction:- CTP + 3-deoxy-D-manno-octulosonate <=> diphosphate + CMP-3-deoxy-D-manno-oct
312827	Sulfotransferase protein
312828	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
312828	Domain of unknown function (DUF227). This family includes a large number of drosophila proteins of unknown function. The family also includes several C. elegans proteins. The alignment contains many histidines and aspartates that are conserved, suggesting
312828	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
312839	Intermediate filament tail domain
312843	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
312843	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
312850	Protein kinase domain
312851	Helix-loop-helix DNA-binding domain
312855	Uncharacterized ACR, COG1579
312855	Intermediate filament protein
312855	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
312855	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
312855	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
312858	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
312858	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
312860	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
312866	Ribosomal family S4e
312869	Ribosomal L15
312872	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
312873	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
312874	Myosin head (motor domain)
312875	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
312879	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
312882	N-acetylmuramoyl-L-alanine amidase. This family includes zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity EC:3.5.1.28. This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (prefere
312882	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
312883	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
312884	Homeobox domain
312885	Lectin C-type domain. This family includes both long and short form C-type
312887	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
312890	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
312890	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
312893	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
312896	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
312896	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
312897	Helix-loop-helix DNA-binding domain
312903	Prothymosin/parathymosin family. Prothymosin alpha and parathymosin are two ubiquitous small acidic nuclear proteins that are thought to be involved in cell cycle progression, proliferation, and cell differentiation
312903	Longevity-assurance protein (LAG1). Members of this family are involved in determining life span. The molecular mechanisms by which LAG1 determines longevity are unclear, although some evidence suggest a participation in ceramide synthesis
312906	ENV polyprotein (coat polyprotein)
312906	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
312907	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
312907	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
312912	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
312912	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
312913	Zinc carboxypeptidase
312913	Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of fol
312915	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
312916	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
312917	Helix-loop-helix DNA-binding domain
312919	Iron-containing alcohol dehydrogenase
312921	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
312923	PH domain. PH stands for pleckstrin homology
312923	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
312924	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
312925	TIP41-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 interacts with TAP42 and negatively regulates the TOR signaling pathway
312930	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
312932	Transcription factor S-II (TFIIS)
312933	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
312936	HMG (high mobility group) box
312937	Doublecortin
312938	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
312938	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
312939	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
312940	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
312946	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
312946	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
312947	Zinc-binding dehydrogenase
312947	Conserved hypothetical protein 95
312947	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
312947	O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production
312947	Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat
312948	Ribosomal protein L19e
312949	Fork head domain
312950	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
312952	Inositol monophosphatase family
312959	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
312959	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea
312964	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
312965	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
312967	Beige/BEACH domain
312967	WD domain, G-beta repeat
312967	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
312974	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
312974	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
312974	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
312979	CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650
312980	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
312987	Ribosomal protein L31e
312990	Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons
312993	Ribosomal protein L24e
312994	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
312996	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
312999	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
313000	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
313000	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
313001	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
313008	Ribosomal protein L44
313011	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
313011	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
313013	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
313013	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
313015	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
313017	14-3-3 protein
313017	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
313020	WD domain, G-beta repeat
313030	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
313034	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
313035	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
313038	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
313039	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
313039	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
313039	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
313041	HMG (high mobility group) box
313045	Intermediate filament protein
313045	KE2 family protein. The function of members of this family is unknown, although they have been suggested to contain a DNA binding leucine zipper motif
313045	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
313045	PspA/IM30 family. This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins
313045	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
313047	tRNA synthetases class I (W and Y)
313047	Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti
313048	WD domain, G-beta repeat
313049	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
313050	SH2 domain
313050	Protein kinase domain
313050	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
313051	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
313052	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
313056	WD domain, G-beta repeat
313057	TMS membrane protein/tumour differentially expressed protein (TDE)
313058	Thrombospondin type 1 domain
313058	7 transmembrane receptor (rhodopsin family)
313058	7 transmembrane receptor (Secretin family)
313058	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
313058	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
313058	Paramyxovirus P/V phosphoprotein. Paramyxoviral P genes are able to generate more than one product, using alternative reading frames and RNA editing. The P gene encodes the structural phosphoprotein P. In addition, it encodes several non-structural protei
313063	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
313063	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
313063	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
313069	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
313070	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
313074	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
313074	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
313076	MYND finger
313087	C2 domain
313089	Amino acid permease
313090	ATP synthase (C/AC39) subunit. This family includes the AC39 subunit from vacuolar ATP synthase, and the C subunit from archaebacterial ATP synthase. The family also includes subunit C from the Sodium transporting ATP synthase from Enterococcus hirae
313093	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
313098	Ubiquitin carboxyl-terminal hydrolase family 2
313099	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
313101	Ribosomal protein L34e
313109	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
313110	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
313111	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
313112	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
313114	Ribosomal S3Ae family
313121	Protein kinase domain
313125	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
313126	Connexin
313129	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
313130	Peptidase family M20/M25/M40. This family includes a range of zinc metallopeptidases belonging to several families in the peptidase classification. Family M20 are Glutamate carboxypeptidases. Peptidase family M25 contains X-His dipeptidases
313131	mRNA capping enzyme, catalytic domain. This family represents the ATP binding catalytic domain of the mRNA capping enzyme
313131	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
313132	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
313133	mRNA capping enzyme, catalytic domain. This family represents the ATP binding catalytic domain of the mRNA capping enzyme
313134	Ribosomal protein L31e
313139	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
313141	L1 transposable element
313142	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
313149	ENV polyprotein (coat polyprotein)
313149	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
313149	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
313149	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
313152	Interferon alpha/beta domain
313153	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
313156	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
313163	ADP-ribosylation factor family
313163	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
313166	R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA
313167	Nrap protein. Members of this family are nucleolar RNA-associated proteins (Nrap) which are highly conserved from yeast (Saccharomyces cerevisiae) to human. In the mouse, Nrap is ubiquitously expressed and is specifically localized in the nucleolus. Nrap
313168	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
313169	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
313169	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
313170	Kinesin motor domain
313171	WD domain, G-beta repeat
313177	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
313182	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24
313183	Actin
313184	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
313191	Sushi domain (SCR repeat)
313191	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
313192	von Willebrand factor type A domain
313197	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
313200	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
313200	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
313206	HIT family
313208	Fibronectin type III domain
313208	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
313210	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
313212	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
313213	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
313213	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
313216	Zinc finger, C3HC4 type (RING finger)
313219	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
313228	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
313229	Calsequestrin
313229	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
313231	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid
313233	QXW lectin repeat
313233	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
313234	PH domain. PH stands for pleckstrin homology
313234	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
313235	WD domain, G-beta repeat
313236	B-box zinc finger
313236	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
313240	SH2 domain
313240	Ribosomal protein L31e
313241	Mitochondrial carrier protein
313242	WD domain, G-beta repeat
313244	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
313247	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
313249	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
313254	Kinesin motor domain
313255	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
313256	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
313258	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
313259	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
313264	Zinc finger, C3HC4 type (RING finger)
313267	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
313270	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
313270	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
313273	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
313274	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
313276	jmjC domain
313276	Protein kinase C terminal domain
313277	jmjC domain
313278	Fibronectin type III domain
313278	Protein-tyrosine phosphatase
313283	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
313284	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
313296	WD domain, G-beta repeat
313297	HMG (high mobility group) box
313303	SH2 domain
313304	Stromal antigen (SA/STAG) protein
313310	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
313312	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
313313	HMG (high mobility group) box
313313	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
313317	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
313318	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
313318	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
313322	Calx-beta domain
313322	DAN domain. This domain contains 9 conserved cysteines and is extracellular. Therefore the cysteines may form disulphide bridges. This family of proteins has been termed the DAN family after the first member to be reported. This family includes DAN, Cerbe
313323	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
313323	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
313331	Skp1 family, tetramerisation domain
313331	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
313332	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
313333	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
313339	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
313340	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
313340	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
313340	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
313341	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
313343	YEATS family. We have named this family the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit'. This family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity
313348	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
313348	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
313350	Phosphorylase family 2
313352	DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown
313352	DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai
313353	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
313355	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
313357	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
313362	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
313364	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
313368	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
313372	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
313373	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
313373	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
313374	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
313375	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
313375	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
313379	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
313382	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
313383	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
313385	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
313386	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
313387	Ubiquitin carboxyl-terminal hydrolase family 2
313387	Ubiquitin carboxyl-terminal hydrolases family 2
313389	Fibrinogen beta and gamma chains, C-terminal globular domain
313391	Peptidase family C54
313393	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
313399	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
313409	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
313410	Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain
313410	pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain
313415	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
313416	WD domain, G-beta repeat
313418	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
313418	ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. EL
313421	Thrombospondin type 1 domain
313421	Low-density lipoprotein receptor domain class A
313421	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl
313426	Ribosomal protein L21e
313429	Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain
313429	tRNA synthetase class II core domain (G, H, P, S and T). Other tRNA synthetase sub-families are too dissimilar to be included. This domain is the core catalytic domain of tRNA synthetases and includes glycyl, histidyl, prolyl, seryl and threonyl tRNA synt
313438	PH domain. PH stands for pleckstrin homology
313439	Ribosomal protein S19
313443	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
313443	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
313444	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
313445	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
313447	Ribosomal protein S26e
313449	Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3
313450	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
313451	Homeobox domain
313452	Homeobox domain
313454	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
313455	Homeobox domain
313467	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
313471	DMRTA motif. This region is found to the C-terminus of the pfam00751. DM-domain proteins with this motif are known as DMRTA proteins. The function of this region is unknown
313471	DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai
313472	Enolase, N-terminal domain
313472	Enolase, C-terminal TIM barrel domain
313474	Uncharacterized ACR, COG1579
313474	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
313474	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
313476	Calreticulin family
313477	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
313478	C2 domain
313479	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
313479	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
313480	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
313484	DM DNA binding domain. The DM domain is named after dsx and mab-3. dsx contains a single amino-terminal DM domain, whereas mab-3 contains two amino-terminal domains. The DM domain has a pattern of conserved zinc chelating residues C2H2C4. The dsx DM domai
313487	SCP-like extracellular protein. This domain is also found in prokaryotes
313491	HIT family
313492	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
313495	Eukaryotic-type carbonic anhydrase
313499	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
313504	Fork head domain
313505	Fork head domain
313507	Helix-loop-helix DNA-binding domain
313508	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
313509	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
313511	Protein kinase domain
313511	Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known struc
313512	Homeobox domain
313513	Ribosomal protein S5, C-terminal domain
313518	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
313522	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
313524	Eukaryotic ribosomal protein L18
313525	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
313526	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
313527	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
313533	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
313535	HesB-like domain. This family includes HesB which may be involved in nitrogen fixation
313535	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
313538	Transforming growth factor beta like domain
313539	jmjN domain
313539	jmjC domain
313539	Ribosomal protein L19e
313540	Protein kinase domain
313543	Protein kinase domain
313546	7 transmembrane receptor (rhodopsin family)
313547	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
313548	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
313553	Carbamoyl-phosphate synthase L chain, ATP binding domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosy
313558	Fork head domain
313560	Glutamine amidotransferase class-I
313564	Peptidase family M48
313567	Palmitoyl protein thioesterase
313568	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
313569	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
313572	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
313573	Coenzyme A transferase
313575	Helix-loop-helix DNA-binding domain
313576	Coenzyme A transferase
313576	Transforming growth factor beta like domain
313576	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
313577	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
313581	Utp11 protein. This protein is found to be part of a large ribonucleoprotein complex containing the U3 snoRNA. Depletion of the Utp proteins impedes production of the 18S rRNA, indicating that they are part of the active pre-rRNA processing complex. This
313582	LIM domain. This family represents two copies of the LIM structural domain
313588	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
313592	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
313594	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
313596	Transcription factor AP-2
313599	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
313600	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
313600	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
313601	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
313602	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
313608	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
313609	Protein kinase domain
313610	Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mos
313611	Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl
313611	Platelet-activating factor acetylhydrolase, plasma/intracellular isoform II. Platelet-activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl
313614	Ribosomal L10
313615	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
313617	Glycosyl hydrolase family 47. Members of this family are alpha-mannosidases that catalyse the hydrolysis of the terminal 1,2-linked alpha-D-mannose residues in the oligo-mannose oligosaccharide Man(9)(GlcNAc)(2)
313618	Intermediate filament protein
313618	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
313618	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
313618	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
313625	Tissue factor
313625	Fibronectin type III domain
313625	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
313626	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
313629	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
313632	Putative tRNA binding domain. This domain is found in prokaryotic methionyl-tRNA synthetases, prokaryotic phenylalanyl tRNA synthetases the yeast GU4 nucleic-binding protein (G4p1 or p42, ARC1), human tyrosyl-tRNA synthetase, and endothelial-monocyte acti
313633	Protein kinase domain
313633	Fibronectin type III domain
313633	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
313635	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
313636	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
313637	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
313641	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
313641	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
313644	Rap/ran-GAP
313647	linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types
313648	Dolichyl-diphosphooligosaccharide-protein glycosyltransferase 48kD subunit. Members of this family are involved in asparagine-linked protein glycosylation. In particular, dolichyl-diphosphooligosaccharide-protein glycosyltransferase (DDOST), also known as
313649	Cytidine and deoxycytidylate deaminase zinc-binding region
313652	Ribosomal protein L11, RNA binding domain
313652	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
313653	von Willebrand factor type A domain
313654	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
313658	Putative zinc finger in N-recognin
313659	Homeobox domain
313659	'Paired box' domain
313665	Hsp20/alpha crystallin family
313666	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
313667	PH domain. PH stands for pleckstrin homology
313667	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
313668	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
313670	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
313672	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
313676	ribosomal L5P family C-terminus. This region is found associated with pfam00281
313679	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
313680	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
313682	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
313685	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
313687	Ribosomal family S4e
313687	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
313687	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
313688	Carboxylesterase
313689	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
313692	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
313693	Ribosomal protein L6
313694	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
313697	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
313698	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
313702	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
313703	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated
313705	AcrB/AcrD/AcrF family. Members of this family are integral membrane proteins. Some are involved in drug resistance
313705	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
313706	UbiA prenyltransferase family
313707	HRDC domain. The HRDC (Helicase and RNase D C-terminal) domain has a putative role in nucleic acid binding. Mutations in the HRDC domain cause human disease
313707	3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termin
313708	Ribosomal protein S5, C-terminal domain
313714	Ribosomal L29e protein family
313721	Ribosomal protein S5, C-terminal domain
313721	Ribosomal protein S5, N-terminal domain
313722	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
313723	Ribosomal protein S5, N-terminal domain
313725	7 transmembrane receptor (Secretin family)
313726	Sugar (and other) transporter
313734	CG-1 domain. CG-1 domains are highly conserved domains of about 130 amino-acid residues containing a predicted bipartite NLS and named after a partial cDNA clone isolated from parsley encoding a sequence-specific DNA-binding protein. CG-1 domains are asso
313735	Disulfide bond formation protein DsbB. This family consists of disulfide bond formation protein DsbB from bacteria. The DsbB protein oxidizes the periplasmic protein DsbA which in turn oxidizes cysteines in other periplasmic proteins in order to make disu
313741	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
313741	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
313742	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
313743	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
313743	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
313745	Death domain
313745	PH domain. PH stands for pleckstrin homology
313745	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
313748	Isoprenylcysteine carboxyl methyltransferase (ICMT) family. The isoprenylcysteine o-methyltransferase (EC:2.1.1.100) family carry out carboyxl methylation of cleaved eukaryotic proteins that terminate in a CaaX motif. In Saccharomyces cerevisiae this meth
313755	Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be
313756	C2 domain
313756	PH domain. PH stands for pleckstrin homology
313756	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
313756	Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit
313756	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
313760	Caldesmon
313760	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
313767	Elongation factor 1 gamma, conserved domain
313767	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
313767	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
313772	PH domain. PH stands for pleckstrin homology
313772	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
313772	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
313777	Uncharacterised protein family (UPF0120)
313779	Ribosomal protein S2
313780	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
313782	F-box domain
313783	Olfactomedin-like domain
313784	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
313786	2Fe-2S iron-sulfur cluster binding domain
313787	HMG (high mobility group) box
313790	Uncharacterized ACR, YfiH family COG1496
313799	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
313800	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
313804	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
313804	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
313806	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
313811	DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh
313811	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
313813	ENV polyprotein (coat polyprotein)
313813	Gag polyprotein, inner coat protein p12. The retroviral p12 is a virion structural protein. p12 is proline rich. The function carried out by p12 in assembly and replication is unknown. p12 is associated with pathogenicity of the virus
313815	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
313817	MCM2/3/5 family
313817	Magnesium chelatase, subunit ChlI. Magnesium-chelatase is a three-component enzyme that catalyses the insertion of Mg2+ into protoporphyrin IX. This is the first unique step in the synthesis of (bacterio)chlorophyll. Due to this, it is thought that Mg-che
313819	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
313820	Protein kinase domain
313823	Phosphotyrosine interaction domain (PTB/PID)
313827	Core histone H2A/H2B/H3/H4
313829	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
313830	LCCL domain
313830	von Willebrand factor type A domain
313831	LCCL domain
313831	von Willebrand factor type A domain
313833	Sulfotransferase protein
313834	Protein kinase domain
313834	PH domain. PH stands for pleckstrin homology
313837	Glutaminyl cyclase. Glutaminyl cyclase catalyses the formation of the pyroglutamyl residue present at the amino terminus of numerous secretory peptides and proteins. Glutaminyl cyclase posses a zinc aminopeptidase domain in which the four functionally imp
313841	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
313842	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
313843	Aldose 1-epimerase
313844	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
313845	PH domain. PH stands for pleckstrin homology
313845	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
313846	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
313847	Protein kinase domain
313851	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
313858	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi
313858	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
313860	Protein kinase domain
313861	WD domain, G-beta repeat
313861	Translationally controlled tumor protein
313861	HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes.
313862	ENV polyprotein (coat polyprotein)
313866	PH domain. PH stands for pleckstrin homology
313866	MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins
313866	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
313868	Ribosomal protein L31e
313868	Ribosomal protein L14p/L23e
313874	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
313874	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
313878	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
313879	Calpain family cysteine protease
313880	Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by
313881	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
313889	Enolase, C-terminal TIM barrel domain
313907	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
313910	Protein kinase domain
313911	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
313912	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
313917	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
313918	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
313921	RNA pseudouridylate synthase. Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes RluD, a pseudouridylate synthase that converts specific uracils to
313925	Ribosomal protein S6e
313925	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
313927	C-5 cytosine-specific DNA methylase
313927	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
313935	C2 domain
313935	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
313939	Sigma-54 interaction domain
313939	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
313940	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
313946	Ribosomal protein L13e
313950	PX domain. PX domains bind to phosphoinositides
313954	von Willebrand factor type A domain
313957	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
313960	Helix-loop-helix DNA-binding domain
313961	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
313961	SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily
313961	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
313970	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
313970	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
313974	Protein kinase domain
313976	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
313978	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
313983	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
313986	HMG (high mobility group) box
313987	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
313991	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
313994	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
313997	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
314004	Thymidylate kinase
314009	Ribosomal protein L21e
314010	Lectin C-type domain. This family includes both long and short form C-type
314010	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
314011	Caulimovirus viroplasmin. This family consists of various caulimovirus viroplasmin proteins. The viroplasmin protein is encoded by gene VI and is the main component of viral inclusion bodies or viroplasms. Inclusions are the site of viral assembly, DNA sy
314014	WD domain, G-beta repeat
314016	Animal haem peroxidase
314016	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
314019	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
314030	Ribosomal protein L35Ae
314035	Ribosomal L22e protein family
314037	Ribosomal protein S2
314039	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
314040	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
314041	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
314046	MYND finger
314046	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
314047	Ribosomal protein S26e
314050	Adenosine-deaminase (editase) domain
314054	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
314063	Core histone H2A/H2B/H3/H4
314071	Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes
314075	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of
314092	Protein kinase domain
314095	LCCL domain
314095	von Willebrand factor type A domain
314099	SH2 domain
314108	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
314110	Elongation factor 1 gamma, conserved domain
314110	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
314110	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
314111	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
314112	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
314113	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
314114	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
314117	Hsp90 protein
314117	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
314118	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
314118	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
314118	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
314121	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
314123	Aminotransferase class I and II
314124	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
314124	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
314125	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
314126	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
314126	DDT domain. This domain is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors)
314126	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
314129	Ribosomal L29e protein family
314131	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
314138	'Paired box' domain
314140	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
314144	Ribosomal protein L15
314147	Ribosomal protein S6e
314148	Lectin C-type domain. This family includes both long and short form C-type
314151	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
314151	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
314153	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
314154	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
314157	F-box domain
314160	ENV polyprotein (coat polyprotein)
314164	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
314165	Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73
314166	HMG (high mobility group) box
314173	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
314174	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
314176	Core histone H2A/H2B/H3/H4
314178	Ribosomal protein L21e
314179	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
314180	MAM domain. An extracellular domain found in many receptors
314181	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
314181	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
314182	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314184	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
314184	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
314191	NAC domain
314193	Protein kinase domain
314194	Protein kinase domain
314195	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
314198	Protein kinase domain
314200	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
314213	7 transmembrane receptor (rhodopsin family)
314213	7 transmembrane receptor (rhodopsin family)
314215	Ribosomal protein L6
314216	Cyclophilin type peptidyl-prolyl cis-trans isomerase
314218	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
314221	Homeobox domain
314222	Ribosomal protein L23
314223	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
314224	LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in respo
314225	C2 domain
314245	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
314246	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
314248	Ribosomal S17
314249	PH domain. PH stands for pleckstrin homology
314249	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
314250	PH domain. PH stands for pleckstrin homology
314250	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
314253	Ribosomal protein S7e
314254	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
314258	Adenylate kinase
314259	Guanylate kinase
314259	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
314259	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
314260	PH domain. PH stands for pleckstrin homology
314260	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
314262	PH domain. PH stands for pleckstrin homology
314262	MyTH4 domain. Domain in myosin and kinesin tails, present twice in myosin-VIIa, and also present in 3 other myosins
314264	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
314273	WD domain, G-beta repeat
314274	Ribosomal protein L44
314275	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
314283	Disintegrin
314283	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
314283	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
314293	Ribosomal L39 protein
314296	PWI domain
314297	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314302	Cyclin-dependent kinase regulatory subunit
314304	Dienelactone hydrolase family
314309	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
314309	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
314312	GDA1/CD39 (nucleoside phosphatase) family
314313	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
314317	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
314317	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
314322	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
314328	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
314332	Domain of unknown function DUF221. This family consists of hypothetical transmembrane proteins none of which have any function, the aligned region is at 538 residues at maximum length
314333	Aminotransferase class I and II
314333	Orn/Lys/Arg decarboxylase, major domain
314334	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
314336	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
314336	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
314337	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
314337	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
314345	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
314345	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
314349	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
314352	Fibronectin type II domain
314353	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
314356	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
314360	Glycosyl hydrolase family 59
314374	Fork head domain
314384	Protein kinase domain
314384	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
314386	7 transmembrane receptor (rhodopsin family)
314389	Ribosomal protein S3, C-terminal domain. This family contains a central domain pfam00013, hence the amino and carboxyl terminal domains are stored separately. This is a minimal carboxyl-terminal domain. Some are much longer
314391	C2 domain
314396	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cells
314397	Vacuolar sorting protein 9 (VPS9) domain
314400	Enolase, N-terminal domain
314400	Enolase, C-terminal TIM barrel domain
314400	Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain
314406	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
314407	Importin-beta N-terminal domain
314416	Adenylate kinase
314427	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
314430	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
314431	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes
314432	Repeat in ubiquitin-activating (UBA) protein
314432	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
314434	Ribosomal protein L6
314435	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
314436	Ets-domain
314438	Fatty acid desaturase
314439	'Cold-shock' DNA-binding domain
314441	Mitochondrial carrier protein
314442	WHEP-TRS domain
314442	tRNA synthetases class I (W and Y)
314453	WD domain, G-beta repeat
314457	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
314458	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
314461	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
314462	Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT)
314464	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
314465	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
314465	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
314466	Uncharacterized ACR, YggU family COG1872
314468	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
314473	Kinesin motor domain
314480	7 transmembrane receptor (rhodopsin family)
314482	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
314483	Transcription factor TFIIB repeat
314491	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314492	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314493	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314494	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314495	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314496	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314497	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314498	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314499	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314500	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314501	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314503	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314504	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314507	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314509	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314510	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314511	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314512	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314513	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
314514	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
314515	7 transmembrane receptor (rhodopsin family)
314515	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314516	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314518	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314521	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314524	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
314527	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
314532	Ribosomal protein L19e
314537	Ribosomal protein L36e
314537	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
314537	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
314538	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
314540	Hsp90 protein
314540	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
314543	Phosphotyrosine interaction domain (PTB/PID)
314543	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
314543	Herpesvirus UL6 like. This family consists of various proteins from the herpesviridae that are similar to herpes simplex virus type I UL6 virion protein. UL6 is essential for cleavage and packaging of the viral genome
314544	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
314547	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
314548	Glycoprotease family
314548	ORMDL family. Evidence form suggests that ORMDLs are involved in protein folding in the ER
314549	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
314550	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
314551	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
314553	Phosphatidyl serine synthase. Phosphatidyl serine synthase is also known as serine exchange enzyme. This family represents eukaryotic PSS I and II which are membrane bound proteins which catalyses the replacement of the head group of a phospholipid (phosp
314555	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
314556	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
314560	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
314563	Helix-loop-helix DNA-binding domain
314566	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
314569	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
314570	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
314570	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
314574	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
314575	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
314582	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
314584	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
314586	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
314587	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
314588	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
314589	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
314590	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
314591	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
314592	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
314593	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
314594	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
314596	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
314600	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
314601	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
314602	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
314603	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
314605	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
314605	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
314606	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
314608	7 transmembrane receptor (rhodopsin family)
314610	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
314612	SH2 domain
314612	Phosphotyrosine interaction domain (PTB/PID)
314613	Zinc finger, C3HC4 type (RING finger)
314614	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
314614	PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2
314615	Trypsin
314616	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
314617	WD domain, G-beta repeat
314618	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
314618	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
314621	Protein kinase domain
314623	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
314624	Ephrin
314625	TB2/DP1, HVA22 family. This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein, which in humans is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological
314626	NTR/C345C module. We have not included the related pfam00965 family. It has been suggested that the common function of these modules is binding to metzincins. A subset of this family is known as the C345C domain because it occurs in complement C3, C4 and
314627	Protein kinase domain
314627	POLO box duplicated region
314629	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
314633	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
314634	PH domain. PH stands for pleckstrin homology
314635	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
314637	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
314638	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
314640	Guanylate kinase
314640	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
314647	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
314649	7 transmembrane receptor (rhodopsin family)
314651	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
314652	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
314653	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
314654	Myosin head (motor domain)
314654	RNA helicase. This family includes RNA helicases thought to be involved in duplex unwinding during viral RNA replication. Members of this family are found in a variety of single stranded RNA viruses
314656	Nuclear movement protein. The nuclear movement protein or NudC, was first identified as a nuclear distribution (nud) gene that regulates nuclear movement in the filamentous fungus. The mammalian homologue of NudC interacts with Lis1, a neuronal migration
314658	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
314659	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
314660	HMG (high mobility group) box
314661	ENV polyprotein (coat polyprotein)
314661	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
314663	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
314669	Protein kinase domain
314669	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
314669	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
314670	Protein kinase domain
314674	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
314675	WD domain, G-beta repeat
314675	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
314675	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
314676	DNA photolyase. This domain binds a light harvesting cofactor
314676	Antifreeze-like domain. This family contains type III antifreeze proteins as well as a variety of enzymes. This domain is presumed to be involved in sugar binding in the enzyme proteins
314676	NeuB family. NeuB is the prokaryotic N-acetylneuraminic acid (Neu5Ac) synthase. It catalyses the direct formation of Neu5Ac (the most common sialic acid) by condensation of phosphoenolpyruvate (PEP) and N-acetylmannosamine (ManNAc). This reaction has only
314678	LIM domain. This family represents two copies of the LIM structural domain
314681	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
314682	HMG (high mobility group) box
314688	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
314691	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
314692	Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples hav
314694	Sulfotransferase. Chondroitin 6-sulfotransferase catalyses the transfer of sulfate to position 6 of the N-acetylgalactosamine residue of chondroitin
314706	Insulin/IGF/Relaxin family. Superfamily includes insulins
314709	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
314709	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
314711	Ets-domain
314712	ADP-ribosylation factor family
314712	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
314712	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
314715	Ribosomal protein L6
314716	Growth-Arrest-Specific Protein 2 Domain
314716	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
314717	Protein kinase domain
314721	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
314723	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
314727	Phosphotyrosine interaction domain (PTB/PID)
314728	Phosphotyrosine interaction domain (PTB/PID)
314730	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
314733	Ribosomal protein S19e
314743	Protein kinase domain
314745	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
314746	Ubiquitin carboxyl-terminal hydrolase family 2
314746	Ubiquitin carboxyl-terminal hydrolases family 2
314746	Zn-finger in ubiquitin-hydrolases and other protein
314747	PH domain. PH stands for pleckstrin homology
314747	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
314747	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
314754	Death domain
314756	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
314764	Ribosomal protein L6e
314764	Ribosomal protein L6, N-terminal domain
314764	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
314766	Ribosomal L15
314767	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
314767	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
314771	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
314776	Cyclophilin type peptidyl-prolyl cis-trans isomerase
314777	QXW lectin repeat
314777	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
314778	WD domain, G-beta repeat
314782	Eukaryotic porin
314785	TPR Domain
314795	TPR Domain
314800	AMP-binding enzyme
314803	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
314805	C2 domain
314810	HMG (high mobility group) box
314812	Ribosomal protein L31e
314814	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
314818	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
314829	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
314834	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
314844	Ribosomal protein L11, RNA binding domain
314844	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
314845	Core histone H2A/H2B/H3/H4
314847	Putative membrane protein. This family called family 8 in, may be a transmembrane protein The specific function of this protein is unknown
314850	PTB domain (IRS-1 type)
314853	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes
314854	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes
314855	Zinc carboxypeptidase
314856	Zn-finger in Ran binding protein and others
314856	p53-associated protein (MDM2). The p53-associated protein (MDM2) is an inhibitor of the p53 tumor suppressor gene binding the transactivation domain and down regulating the ability of p53 to activate transcription. This family contains the p53 binding dom
314860	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
314862	Protein kinase domain
314870	Death domain
314870	Protein kinase domain
314870	K+ potassium transporter. This is a family of K+ potassium transporters that are conserved across phyla, having both bacterial (KUP), yeast (HAK), and plant (AtKT) sequences as members
314871	Uncharacterized protein family UPF0005
314874	SelR domain. Methionine sulfoxide reduction is an important process, by which cells regulate biological processes and cope with oxidative stress. MsrA, a protein involved in the reduction of methionine sulfoxides in proteins, has been known for four decad
314879	Exportin-t. Exportin-t is a specific mediator of tRNA export. RanGTP-binding, importin beta-related factor with predominantly nuclear localization. It shuttles rapidly between nucleus and between nucleus and cytoplasm and interacts with nuclear pore compl
314896	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
314902	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
314903	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
314903	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
314904	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
314907	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
314908	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
314908	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
314909	SH2 domain
314909	STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. Th
314909	STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth fac
314912	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
314912	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
314927	Ribosomal protein L31e
314940	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
314941	CUB domain
314942	CUB domain
314942	Sushi domain (SCR repeat)
314943	Sushi domain (SCR repeat)
314943	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
314944	CUB domain
314944	Thymosin beta-4 family
314946	GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the do
314950	Ribosomal S3Ae family
314953	Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the mos
314954	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
314954	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
314961	PH domain. PH stands for pleckstrin homology
314961	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
314961	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
314965	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
314969	NOL1/NOP2/sun family
314969	Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat
314972	Ribosomal protein L19e
314975	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
314975	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
314977	Core histone H2A/H2B/H3/H4
314979	Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signaling proteins that contain pfam00169, rasGEF, rhoGEF, rhoGAP, pfam00615, pfam00595 domains
314981	Fibronectin type III domain
314981	von Willebrand factor type A domain
314981	Thrombospondin N-terminal -like domain
314988	Homeobox domain
314990	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
314990	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
314993	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
314993	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
314995	Ribosomal protein S27
314996	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
314996	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
314997	C2 domain
314999	Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat
315000	Zinc finger, C3HC4 type (RING finger)
315001	TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme
315005	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
315007	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
315007	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
315009	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
315023	Mitochondrial carrier protein
315027	pfam02935, COX7C, Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast mem
315033	TPR Domain
315035	Ribosomal protein S24e
315038	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
315040	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
315042	Protein kinase domain
315046	HMG (high mobility group) box
315047	ENV polyprotein (coat polyprotein)
315047	Golgi 4-transmembrane spanning transporter
315051	Ribosomal L29e protein family
315055	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
315055	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
315065	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
315073	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di
315073	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
315074	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
315075	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
315076	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
315077	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
315078	Ribosomal protein L19e
315079	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
315081	Transposase. Transposase proteins are necessary for efficient DNA transposition. Contains transposases for IS204, IS1001, IS1096 and IS1165
315083	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
315083	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
315085	Nicotinate phosphoribosyltransferase (NAPRTase). Nicotinate phosphoribosyltransferase (EC:2.4.2.11) is the rate limiting enzyme that catalyses the first reaction in the NAD salvage synthesis
315087	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
315088	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
315089	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
315090	Protein kinase domain
315093	WD domain, G-beta repeat
315094	Domain of unknown function (DUF384). domo keywords :chromosome c26f1.12c 41.3
315094	Domain of unknown function (DUF383). domo keywords :chromosome c26f1.12c 41.3
315095	Protein kinase domain
315096	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results
315100	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
315101	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
315103	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
315108	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
315109	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
315112	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
315112	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
315114	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
315114	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
315115	PH domain. PH stands for pleckstrin homology
315115	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
315117	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
315119	Fringe-like. The drosophila protein fringe (FNG) is a glucosaminyltransferase that controls the response of the Notch receptor to specific ligands. FNG is localised to the Golgi apparatus (not secreted as previously thought). Modification of Notch occurs
315122	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
315124	PH domain. PH stands for pleckstrin homology
315125	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
315131	ER lumen protein retaining receptor
315132	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
315133	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
315133	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
315134	Josephin
315135	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
315138	'chromo' (CHRromatin Organization MOdifier) domain
315141	Mab-21 protein
315143	Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration
315144	SH2 domain
315144	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
315147	Ribosomal L29e protein family
315150	Lyase
315151	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
315151	RPEL repeat. The RPEL repeat is named after four conserved amino acids it contains. The function of the RPEL repeat is unknown however it might be a DNA binding repeat based on the observation that one member contains a pfam02037 domain that is also impli
315154	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
315155	metallopeptidase family M24
315156	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
315156	TAZ zinc finger. The TAZ2 domain of CBP binds to other transcription factors such as the p53 tumor suppressor protein, E1A oncoprotein, MyoD, and GATA-1
315156	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
315156	KIX domain. CBP and P300 bind to the CREB via a domain known as KIX. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun
315159	BTG1 family. A novel family of anti-proliferative proteins
315165	Melibiase
315170	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
315173	Acyl transferase domain
315174	CUB domain
315178	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
315180	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
315183	Sulfotransferase protein
315185	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
315186	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
315188	NAC domain
315196	wnt family
315203	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
315205	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
315205	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
315206	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
315210	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
315210	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
315210	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
315211	pfam02892, zf-BED, BED zinc finger
315214	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
315214	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
315216	Uncharacterized ACR, YdiU/UPF0061 family
315219	Glycosyl transferase family, a/b domain. This family includes anthranilate phosphoribosyltransferase (TrpD), thymidine phosphorylase. All these proteins can transfer a phosphorylated ribose substrate
315219	pfam02885, Glycos_trans_3N, Glycosyl transferase family, helical bundle domain. This family includes anthranilate phosphoribosyltransferase (TrpD), thymidine phosphorylase. All these proteins can transfer a phosphorylated ribose substrate
315219	SCO1/SenC. This family is involved in biogenesis of respiratory and photosynthetic systems. SCO1 is required for a post-translational step in the accumulation of subunits COXI and COXII of cytochrome c oxidase. SenC is required for optimal cytochrome c ox
315220	Phosphotyrosine interaction domain (PTB/PID)
315222	Sulfatase
315223	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
315223	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
315230	C2 domain
315231	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
315231	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
315234	Sugar (and other) transporter
315237	Sugar (and other) transporter
315238	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
315241	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
315248	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
315250	Ribosomal protein L15
315252	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt
315254	Ribosomal protein L35Ae
315255	Zn-finger in Ran binding protein and others
315255	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
315259	LIM domain. This family represents two copies of the LIM structural domain
315263	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
315263	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
315264	Eukaryotic ribosomal protein L18
315264	Uncharacterized protein family UPF0024
315265	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
315284	Somatomedin B domain
315287	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
315289	7 transmembrane receptor (rhodopsin family)
315290	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
315291	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
315292	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
315294	wnt family
315298	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
315298	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
315299	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
315299	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
315303	AMP-binding enzyme
315305	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
315305	pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB
315306	ubiE/COQ5 methyltransferase family
315310	QXW lectin repeat
315310	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
315311	Ribosomal protein L14p/L23e
315311	HCO3- transporter family. This family contains Band 3 anion exchange proteins that exchange CL-/HCO3-. This family also includes cotransporters of Na+/HCO3-
315312	LIM domain. This family represents two copies of the LIM structural domain
315318	Intermediate filament protein
315318	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
315319	Intermediate filament protein
315321	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
315323	Intermediate filament protein
315323	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
315324	Intermediate filament protein
315324	ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit
315324	Vps53-like, N-terminal domain. Vps53 complexes with Vps52 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events
315324	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
315324	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
315324	IncA protein. Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation
315324	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
315326	SH2 domain
315326	Phosphotyrosine interaction domain (PTB/PID)
315327	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
315330	Peptidase family C50
315333	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
315337	Homeobox domain
315337	Tubulin binding cofactor A
315338	Homeobox domain
315341	Homeobox domain
315344	Uracil DNA glycosylase superfamily
315344	Uracil DNA glycosylase superfamily
315345	Clathrin adaptor complex small chain
315345	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
315350	PTB domain (IRS-1 type)
315350	PH domain. PH stands for pleckstrin homology
315352	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
315358	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
315358	Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However
315360	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
315362	Protein kinase domain
315363	Ribosomal protein S19e
315364	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
315365	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
315369	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
315371	Cyclophilin type peptidyl-prolyl cis-trans isomerase
315372	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
315374	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
315378	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
315379	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
315380	HIT family
315382	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
315385	Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions
315387	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di
315387	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
315388	Trypsin
315389	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
315389	TAZ zinc finger. The TAZ2 domain of CBP binds to other transcription factors such as the p53 tumor suppressor protein, E1A oncoprotein, MyoD, and GATA-1
315389	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
315389	KIX domain. CBP and P300 bind to the CREB via a domain known as KIX. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun
315394	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
315394	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
315395	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
315398	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
315399	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
315399	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
315400	Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors
315404	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
315405	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
315406	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
315406	OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl -tRNA synthetases cata
315407	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
315407	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
315407	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
315409	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
315412	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
315417	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
315429	jmjN domain
315429	jmjC domain
315430	7 transmembrane receptor (rhodopsin family)
315432	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
315432	PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the
315434	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
315435	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
315435	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
315436	Ribosomal protein S6e
315437	Virulence determinant. The UK protein is an African swine fever virus (ASFV) protein that is highly conserved amongst strains, and is an important viral virulence determinant for domestic pigs
315438	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
315439	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
315442	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
315444	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
315452	7 transmembrane receptor (rhodopsin family)
315455	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
315457	7 transmembrane receptor (rhodopsin family)
315458	7 transmembrane receptor (rhodopsin family)
315458	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
315459	7 transmembrane receptor (rhodopsin family)
315461	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
315467	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
315469	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
315470	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
315471	Ribosomal L10
315478	PX domain. PX domains bind to phosphoinositides
315478	PXA domain. This domain is associated with PX domains pfam00787
315479	Elongation factor 1 gamma, conserved domain
315479	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
315479	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
315484	Proteasome A-type and B-type
315487	Rad17 cell cycle checkpoint protein
315487	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
315492	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
315492	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
315493	Intermediate filament protein
315497	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
315504	Glycosyl hydrolases family 35
315505	Ribosomal protein L23
315506	Glycosyl hydrolases family 35
315507	Acyl-CoA dehydrogenase, middle domain. Central domain of Acyl-CoA dehydrogenase has a beta-barrel fold
315507	Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle
315508	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
315509	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
315510	Fibronectin type III domain
315510	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
315521	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
315527	Homeobox domain
315530	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
315532	Ets-domain
315533	Sterile alpha motif (SAM)/Pointed domain
315534	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
315534	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
315545	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
315546	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
315547	FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1
315548	SRP54-type protein, GTPase domain. This family includes relatives of the G-domain of the SRP54 family of proteins
315548	ArgK protein. The ArgK protein acts as an ATPase enzyme and as a kinase, and phosphorylates periplasmic binding proteins involved in the LAO (lysine, arginine, ornithine)/AO transport systems
315548	Signal recognition particle, alpha subunit, N-terminal. SRP is a complex of six distinct polypeptides and a 7S RNA that is essential for transferring nascent polypeptide chains that are destined for export from the cell to the translocation apparatus of t
315550	RNA pseudouridylate synthase. Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes RluD, a pseudouridylate synthase that converts specific uracils to
315554	tRNA pseudouridine synthase. Involved in the formation of pseudouridine at the anticodon stem and loop of transfer-RNAs Pseudouridine is an isomer of uridine (5-(beta-D-ribofuranosyl) uracil, and id the most abundant modified nucleoside found in all cellu
315558	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
315559	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
315568	7 transmembrane receptor (rhodopsin family)
315570	7 transmembrane receptor (rhodopsin family)
315571	7 transmembrane receptor (rhodopsin family)
315572	7 transmembrane receptor (rhodopsin family)
315573	7 transmembrane receptor (rhodopsin family)
315575	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
315575	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
315575	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
315576	ENV polyprotein (coat polyprotein)
315579	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
315579	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
315583	Phosphopantetheine attachment site. A 4'-phosphopantetheine prosthetic group is attached through a serine. This prosthetic group acts as a a 'swinging arm' for the attachment of activated fatty acid and amino-acid groups. This domain forms a four helix bu
315597	CUB domain
315597	Fz domain. Also known as the CRD (cysteine rich domain), the C6 box in MuSK receptor. This domain of unknown function has been independently identified by several groups. The domain contains 10 conserved cysteines
315598	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
315600	Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain
315604	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
315604	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
315606	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
315618	Protein kinase domain
315634	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
315638	Fibronectin type III domain
315639	Ubiquitin carboxyl-terminal hydrolase family 2
315639	Ubiquitin carboxyl-terminal hydrolases family 2
315642	Ribosomal protein L15
315643	HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats
315643	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
315644	Ribosomal protein S15
315644	Retinal pigment epithelial membrane protein. This family represents a retinal pigment epithelial membrane receptor which is abundantly expressed in retinal pigment epithelium, and binds plasma retinal binding protein. The family also includes the sequence
315646	Ferredoxin-fold anticodon binding domain. This is the anticodon binding domain found in some phenylalanyl tRNA synthetases. The domain has a ferredoxin fold
315647	Ribosomal protein L15
315648	HEAT repeat. The HEAT repeat family is related to armadillo/beta-catenin-like repeats (see pfam00514). CAUTION: This family does not contain all known HEAT repeats
315648	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
315650	BTG1 family. A novel family of anti-proliferative proteins
315655	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
315655	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
315656	Ribosomal protein S19e
315661	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
315661	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
315664	Filamin/ABP280 repeat
315668	ADP-ribosylation factor family
315668	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
315673	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
315675	Olfactomedin-like domain
315675	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
315675	Olfactomedin-like domain
315675	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
315681	BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems
315685	Protein kinase domain
315689	HMG (high mobility group) box
315691	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
315696	PX domain. PX domains bind to phosphoinositides
315696	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
315699	Protein-tyrosine phosphatase
315699	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
315704	Flavoprotein. This family contains diverse flavoprotein enzymes. This family includes epidermin biosynthesis protein, EpiD, which has been shown to be a flavoprotein that binds FMN. This enzyme catalyzes the removal of two reducing equivalents from the cy
315705	Ribosomal S3Ae family
315707	SH2 domain
315707	Protein kinase domain
315708	YjeF-related protein N-terminus
315710	Single-strand binding protein family
315712	C2 domain
315714	Lysyl oxidase
315716	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
315723	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
315729	Ribosomal protein S5, C-terminal domain
315731	Ribosomal protein L21e
315732	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
315732	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
315740	Kinesin motor domain
315740	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
315741	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
315744	von Willebrand factor type A domain
315745	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
315748	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
315750	Glycosyl hydrolase family 1
315755	Thymosin beta-4 family
315756	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
315756	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
315756	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
315758	Prolyl oligopeptidase family
315758	Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site
315759	Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However
315762	ADP-ribosylation factor family
315762	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
315763	Formyl transferase. This family includes the following members. Glycinamide ribonucleotide transformylase catalyses the third step in de novo purine biosynthesis, the transfer of a formyl group to 5'-phosphoribosylglycinamide. Formyltetrahydrofolate defor
315764	PH domain. PH stands for pleckstrin homology
315765	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
315768	Ribosomal protein S5, C-terminal domain
315768	Ribosomal protein S5, N-terminal domain
315771	WD domain, G-beta repeat
315771	Regulator of chromosome condensation (RCC1)
315771	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
315771	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
315773	Ubiquitin carboxyl-terminal hydrolases family 2
315776	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
315776	I/LWEQ domain. I/LWEQ domains bind to actin. It has been shown that the I/LWEQ domains from mouse talin and yeast Sla2p interact with F-actin. I/LWEQ domains can be placed into four major groups based on sequence similarity: (1) Metazoan talin
315778	Ribosomal protein L44
315781	Peptidase family C54
315786	Phosphorylase family 2
315788	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon
315792	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
315793	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
315793	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
315795	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
315803	Ribosomal L38e protein family
315805	Ribosomal protein L21e
315806	Fibronectin type III domain
315806	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
315807	Plasmid Maintenance Protein. The SMP protein has been implemented in plasmid stability
315809	Ribosomal L29e protein family
315811	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
315813	C2 domain
315819	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
315822	Leo1-like protein. Members of this family are part of the Paf1/RNA polymerase II complex. The Paf1 complex probably functions during the elongation phase of transcription
315824	Transforming growth factor beta like domain
315824	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
315826	HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase
315827	HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase
315828	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
315833	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
315833	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
315836	Ribosomal protein L11, RNA binding domain
315836	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
315840	Myosin head (motor domain)
315843	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
315844	Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellul
315845	Hsp90 protein
315851	GNS1/SUR4 family
315852	Protein kinase domain
315858	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
315860	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
315866	Trypsin
315868	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
315870	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
315871	PXA domain. This domain is associated with PX domains pfam00787
315872	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
315876	PH domain. PH stands for pleckstrin homology
315877	Guanine nucleotide exchange factor for Ras-like GTPases
315877	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
315877	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
315878	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
315879	Thrombospondin type 1 domain
315879	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
315879	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
315880	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
315883	Scramblase. Scramblase is palmitoylated and contains a potential protein kinase C phosphorylation site. Scramblase exhibits Ca2+-activated phospholipid scrambling activity in vitro. There are also possible SH3 and WW binding motifs. Scramblase is involved
315887	Ribosomal protein S5, N-terminal domain
315887	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
315889	ENV polyprotein (coat polyprotein)
315889	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
315901	Sulfotransferase protein
315903	Surp module. This domain is also known as the SWAP domain. SWAP stands for Suppressor-of-White-APricot. It has been suggested that these domains may be RNA binding
315903	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
315904	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
315907	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
315908	UBX domain. Domain present in ubiquitin-regulatory proteins. Present in FAF1 and Shp1p
315909	14-3-3 protein
315910	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
315911	Zinc finger, C3HC4 type (RING finger)
315911	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
315912	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
315912	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
315914	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
315918	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
315919	Ribosomal protein S2
315919	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
315923	LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known
315926	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
315926	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
315932	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
315935	C2 domain
315936	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
315939	Histidine acid phosphatase
315941	Mitochondrial carrier protein
315946	Nucleoside diphosphate kinase
315947	Nucleoside diphosphate kinase
315947	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
315950	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
315953	PMP-22/EMP/MP20/Claudin family
315954	S25 ribosomal protein
315955	Tissue factor
315957	Integral membrane protein DUF6. This family includes many hypothetical membrane proteins of unknown function. Many of the proteins contain two copies of the aligned region
315958	Stromal antigen (SA/STAG) protein
315968	ADP-ribosylation factor family
315968	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
315969	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
315971	HMG (high mobility group) box
315973	Acyl-CoA dehydrogenase, middle domain. Central domain of Acyl-CoA dehydrogenase has a beta-barrel fold
315973	Acyl-CoA dehydrogenase, N-terminal domain. The N-terminal domain of Acyl-CoA dehydrogenase is an all-alpha domain
315973	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
315974	7 transmembrane receptor (rhodopsin family)
315975	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
315978	Protein kinase domain
315979	von Willebrand factor type A domain
315979	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
315980	von Willebrand factor type A domain
315981	von Willebrand factor type A domain
315982	WD domain, G-beta repeat
315988	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
315993	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
315994	Protein kinase domain
315995	Ribosomal protein S2
315996	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
315997	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
315999	SAND family protein. Members of this family are uncharacterised proteins that have been called SAND proteins. These protein do not contain a SAND domain. The members of this family are 500-600 amino acids long and contain several conserved motifs
316000	Actin
316002	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
316005	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
316006	Regulatory subunit of type II PKA R-subunit
316009	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
316009	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
316009	Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found in Plexin. Two copies of the repeat are found in mahogany protein. A related C. elegans p
316010	Cathelicidin. A novel protein family, showing a conserved proregion and a variable carboxyl-terminal antimicrobial domain. This region shows similarity to cystatins
316012	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
316012	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
316014	Beige/BEACH domain
316014	WD domain, G-beta repeat
316015	Arginyl tRNA synthetase N terminal domain. This domain is found at the amino terminus of Arginyl tRNA synthetase, also called additional domain 1 (Add-1). It is about 140 residues long and it has been suggested that this domain will be involved in tRNA re
316016	Trypsin
316017	Vacuolar sorting protein 9 (VPS9) domain
316017	MORN repeat. The MORN (Membrane Occupation and Recognition Nexus) repeat is found in multiple copies in several proteins including junctophilins (See Takeshima et al. Mol. Cell 2000
316019	7 transmembrane receptor (rhodopsin family)
316022	UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Some members have large insertions near t
316023	Doublecortin
316023	Protein kinase domain
316028	R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA
316033	Glycosyl hydrolases family 35
316033	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
316045	Eukaryotic porin
316048	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
316051	Intermediate filament protein
316051	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
316052	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
316053	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
316055	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
316061	Sugar (and other) transporter
316062	Sugar (and other) transporter
316064	Protein kinase domain
316065	Ribosomal protein L10
316066	Protein kinase domain
316067	FGGY family of carbohydrate kinases, C-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain
316067	FGGY family of carbohydrate kinases, N-terminal domain. This domain adopts a ribonuclease H-like fold and is structurally related to the C-terminal domain
316068	Ribosomal protein S27
316069	Ribosomal protein S27
316073	Ribosome recycling factor. The ribosome recycling factor (RRF / ribosome release factor) dissociates the ribosome from the mRNA after termination of translation, and is essential bacterial growth. Thus ribosomes are "recycled" and ready for another round
316075	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
316077	GDA1/CD39 (nucleoside phosphatase) family
316077	GDA1/CD39 (nucleoside phosphatase) family
316077	GDA1/CD39 (nucleoside phosphatase) family
316077	GDA1/CD39 (nucleoside phosphatase) family
316080	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
316086	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes
316087	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
316087	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
316088	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
316088	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
316090	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
316094	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
316095	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
316095	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
316096	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
316097	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
316098	3' exoribonuclease family, domain 2. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
316098	3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase
316099	Lectin C-type domain. This family includes both long and short form C-type
316101	LIM domain. This family represents two copies of the LIM structural domain
316103	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
316105	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
316106	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases
316107	7 transmembrane receptor (rhodopsin family)
316107	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
316109	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
316110	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
316110	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
316113	Helix-hairpin-helix motif
316118	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
316120	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
316120	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
316122	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
316124	AMP-binding enzyme
316126	RanBP1 domain
316129	Zinc finger, C3HC4 type (RING finger)
316129	YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named
316129	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
316129	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
316130	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
316131	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
316132	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
316134	ENV polyprotein (coat polyprotein)
316137	7 transmembrane receptor (Secretin family)
316137	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
316138	7 transmembrane receptor (metabotropic glutamate family)
316142	ENV polyprotein (coat polyprotein)
316142	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
316142	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
316144	7 transmembrane receptor (metabotropic glutamate family)
316147	Ribosomal protein L15
316153	Sulfotransferase protein
316156	Sulfotransferase protein
316157	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
316158	Ribosomal protein S26e
316161	Ribosomal protein L31e
316164	CUT domain. The CUT domain is a DNA-binding motif which can bind independently or in cooperation with the homeodomain, often found downstream of the CUT domain. Multiple copies of the CUT domain can exist in one protein
316166	Homeobox domain
316168	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
316173	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
316202	MoaC family. Members of this family are involved in molybdenum cofactor biosynthesis. However their molecular function is not known
316208	Fork head domain
316210	cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of
316211	Ubiquitin carboxyl-terminal hydrolase family 2
316211	Ubiquitin carboxyl-terminal hydrolases family 2
316211	Zn-finger in ubiquitin-hydrolases and other protein
316212	LIM domain. This family represents two copies of the LIM structural domain
316213	PTB domain (IRS-1 type)
316214	Helix-loop-helix DNA-binding domain
316231	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
316231	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
316234	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
316235	impB/mucB/samB family. These proteins are involved in UV protection
316236	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
316241	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
316242	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
316249	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
316256	Death domain
316258	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
316259	7 transmembrane receptor (rhodopsin family)
316259	7 transmembrane receptor (Secretin family)
316259	7 transmembrane receptor (rhodopsin family)
316259	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
316262	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
316265	Phosphoglycerate kinase
316266	SCP-like extracellular protein. This domain is also found in prokaryotes
316269	Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73
316270	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
316273	MCM2/3/5 family
316275	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
316276	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
316281	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
316281	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
316282	Uncharacterised protein family (UPF0136). This family of short membrane proteins are as yet uncharacterised
316283	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
316283	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
316284	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
316284	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
316285	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
316294	Thrombospondin type 1 domain
316294	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
316296	Thrombospondin N-terminal -like domain
316296	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
316299	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
316304	Glutamine synthetase, beta-Grasp domain
316304	Glutamine synthetase, catalytic domain
316310	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
316312	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
316312	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
316313	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
316316	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
316317	Protein-tyrosine phosphatase
316317	IMP4 family. This family includes IMP4, which is involved ribosomal RNA processing
316318	Trypsin
316323	Ribosomal protein L21e
316327	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
316330	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
316338	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
316341	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
316341	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
316341	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
316342	Biotin/lipoate A/B protein ligase family. This family includes biotin protein ligase, lipoate-protein ligase A and B. Biotin is covalently attached at the active site of certain enzymes that transfer carbon dioxide from bicarbonate to organic acids to for
316343	MIT domain
316344	impB/mucB/samB family. These proteins are involved in UV protection
316344	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
316348	Phosducin
316354	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
316364	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
316369	SH2 domain
316369	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
316370	Filamin/ABP280 repeat
316372	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
316373	HMG (high mobility group) box
316376	Inositol monophosphatase family
316377	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
316378	HMG (high mobility group) box
316379	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
316379	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
316383	Ribosomal protein L14p/L23e
316385	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
316386	Ribosomal protein L21e
316390	WD domain, G-beta repeat
316393	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
316395	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
316395	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
316401	ribosomal L5P family C-terminus. This region is found associated with pfam00281
316403	tRNA synthetases class I (I, L, M and V). Other tRNA synthetase sub-families are too dissimilar to be included
316403	tRNA synthetases class I (C). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only cysteinyl tRNA synthetases
316408	Homeobox domain
316409	HSF-type DNA-binding
316419	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
316421	[2Fe-2S] binding domain
316421	CO dehydrogenase flavoprotein C-terminal domain
316421	FAD binding domain in molybdopterin dehydrogenase
316421	Aldehyde oxidase and xanthine dehydrogenase, a/b hammerhead domain
316421	Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain
316422	Ribosomal protein L21e
316423	Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain
316423	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
316424	[2Fe-2S] binding domain
316424	CO dehydrogenase flavoprotein C-terminal domain
316424	FAD binding domain in molybdopterin dehydrogenase
316424	Aldehyde oxidase and xanthine dehydrogenase, a/b hammerhead domain
316424	Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain
316434	ATP synthase subunit C
316435	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
316443	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
316444	Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin Putative zinc finger
316444	DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds
316444	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
316446	Zinc carboxypeptidase
316452	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
316455	PH domain. PH stands for pleckstrin homology
316456	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
316461	Indole-3-glycerol phosphate synthase
316461	Ribulose-phosphate 3 epimerase family. This enzyme catalyses the conversion of D-ribulose 5-phosphate into D-xylulose 5-phosphate
316461	Thiamine monophosphate synthase/TENI. Thiamine monophosphate synthase (TMP) (EC:2.5.1.3) catalyzes the substitution of the pyrophosphate of 2-methyl-4-amino-5- hydroxymethylpyrimidine pyrophosphate by 4-methyl-5- (beta-hydroxyethyl)thiazole phosphate to y
316467	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
316468	WD domain, G-beta repeat
316469	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
316477	Microtubule associated protein (MAP65/ASE1 family)
316481	Acyl CoA binding protein
316482	LCCL domain
316482	SCP-like extracellular protein. This domain is also found in prokaryotes
316483	14-3-3 protein
316484	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
316495	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
316500	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
316503	Hsp90 protein
316504	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
316506	Fibronectin type III domain
316507	SH2 domain
316516	Uncharacterized protein family UPF0005
316518	Metallo-beta-lactamase superfamily
316519	Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined b
316520	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
316521	Villin headpiece domain
316522	Ubiquitin carboxyl-terminal hydrolases family 2
316523	Cell differentiation family, Rcd1-like. Two of the members in this family have been characterised as being involved in regulation of Ste11 regulated sex genes
316525	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
316526	wnt family
316527	wnt family
316530	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predict
316531	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
316531	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
316541	Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand
316543	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
316545	Clathrin adaptor complex small chain
316550	ADP-ribosylation factor family
316550	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
316556	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
316556	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
316557	Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth fa
316559	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
316559	Reduced folate carrier. The reduced folate carrier (a transmembrane glycoprotein) transports reduced folate into mammalian cells via the carrier mediated mechanism (as opposed to the receptor mediated mechanism) it also transports cytotoxic folate analogu
316577	F-box domain
316579	7 transmembrane receptor (rhodopsin family)
316580	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
316580	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
316580	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
316580	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
316583	Ribosomal protein L36e
316583	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
316587	Poly-adenylate binding protein, unique domain
316587	RNB-like protein. The function of this region of similarity is uncertain
316588	RNB-like protein. The function of this region of similarity is uncertain
316590	RNB-like protein. The function of this region of similarity is uncertain
316592	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
316595	GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein
316600	UDP-glucoronosyl and UDP-glucosyl transferase
316601	UDP-glucoronosyl and UDP-glucosyl transferase
316608	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
316610	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
316611	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
316611	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
316614	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
316615	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
316624	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
316624	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
316626	Helix-loop-helix DNA-binding domain
316628	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
316630	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
316632	Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes
316633	7 transmembrane receptor (rhodopsin family)
316633	Uncharacterised protein family (UPF0123). This family of proteins are uncharacterised, they contain five CXXC motifs
316634	7 transmembrane receptor (rhodopsin family)
316636	Intermediate filament protein
316638	Fibronectin type III domain
316638	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
316639	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
316639	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
316640	Peptidase family C54
316641	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
316645	CBF/Mak21 family
316646	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
316651	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
316653	ENV polyprotein (coat polyprotein)
316656	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
316657	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
316658	NAC domain
316658	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
316660	RanBP1 domain
316663	F-box domain
316679	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
316680	PH domain. PH stands for pleckstrin homology
316685	Histidine acid phosphatase
316687	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
316689	C2 domain
316691	S1 RNA binding domain. The S1 domain occurs in a wide range of RNA associated proteins. It is structurally similar to cold shock protein which binds nucleic acids. The S1 domain has an OB-fold structure
316700	Protein kinase domain
316700	Protein kinase C terminal domain
316708	Core histone H2A/H2B/H3/H4
316710	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
316711	Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are repr
316715	UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate urid
316717	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
316718	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
316721	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
316722	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
316723	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
316725	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
316726	Cyclophilin type peptidyl-prolyl cis-trans isomerase
316727	Mouse mammary tumor virus superantigen. The mouse mammary tumor virus (MMTV) is a milk-transmitted type B retrovirus. The superantigen (SAg) is encoded by the long terminal repeat. The SAgs are also called PR73
316733	'Cold-shock' DNA-binding domain
316734	Ribosomal protein L44
316740	Fibronectin type III domain
316745	Ribosomal protein L21e
316748	Ribosomal protein S24e
316752	Citrate transporter
316762	Fibronectin type III domain
316763	Uncharacterized ACR, COG1579
316763	ADP-ribosylation factor family
316763	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
316765	Protein kinase domain
316768	Protein kinase domain
316768	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316769	Protein kinase domain
316770	Protein kinase domain
316773	Respiratory-chain NADH dehydrogenase 24 Kd subunit
316773	SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop
316774	Respiratory-chain NADH dehydrogenase 24 Kd subunit
316777	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
316781	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
316794	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
316798	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
316816	RNB-like protein. The function of this region of similarity is uncertain
316818	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
316821	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316822	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316823	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316824	Ribosomal protein L19e
316824	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316825	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316826	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
316843	Ribosomal protein L19e
316844	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316846	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316848	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316850	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316851	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316854	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316856	Ribosomal protein L19e
316858	7 transmembrane receptor (metabotropic glutamate family)
316860	Protein kinase domain
316862	Protein kinase domain
316864	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
316865	ENV polyprotein (coat polyprotein)
316865	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
316876	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
316877	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
316884	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
316884	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316885	Ribosomal protein L19e
316886	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316891	7 transmembrane receptor (rhodopsin family)
316892	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316893	Ribosomal protein L19e
316893	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316897	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
316899	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
316900	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
316900	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
316902	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
316914	ENV polyprotein (coat polyprotein)
316917	Ribosomal protein L35Ae
316925	Myosin head (motor domain)
316928	Glutaredoxin
316932	Glutaredoxin
316933	Glutaredoxin
316937	Glutaredoxin
316939	Glutaredoxin
316940	Glutaredoxin
316941	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
316943	Myosin head (motor domain)
316946	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
316949	Myosin head (motor domain)
316950	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
316951	Uncharacterized ACR, COG1579
316951	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
316953	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
316953	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
316954	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
316956	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
316958	ENV polyprotein (coat polyprotein)
316959	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
316962	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
316965	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
316970	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
316973	Myosin head (motor domain)
316976	Ribosomal RNA adenine dimethylase
316979	Ribosomal protein S5, C-terminal domain
316980	ENV polyprotein (coat polyprotein)
316984	Ribosomal protein S8e
316987	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
316988	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
316989	7 transmembrane receptor (rhodopsin family)
316990	7 transmembrane receptor (rhodopsin family)
316991	7 transmembrane receptor (rhodopsin family)
316996	Protein kinase domain
316996	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
316997	Myosin head (motor domain)
317001	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
317002	Myosin head (motor domain)
317002	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
317006	Protein kinase domain
317010	Myosin head (motor domain)
317012	Myosin head (motor domain)
317017	Myosin head (motor domain)
317019	Myosin head (motor domain)
317023	Myosin head (motor domain)
317027	Skp1 family, tetramerisation domain
317029	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
317030	Ribosomal protein L19e
317030	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
317031	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
317033	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
317034	MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains
317034	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
317037	Ribosomal protein L19e
317038	Ribosomal protein L19e
317040	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
317042	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
317043	Ribosomal protein L19e
317054	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
317062	ENV polyprotein (coat polyprotein)
317064	ADP-ribosylation factor family
317064	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
317067	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
317068	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
317071	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
317081	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
317085	Myosin head (motor domain)
317086	Myosin head (motor domain)
317088	Myosin head (motor domain)
317091	Myosin head (motor domain)
317093	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
317100	Myosin head (motor domain)
317102	Myosin head (motor domain)
317112	Myosin head (motor domain)
317114	Myosin head (motor domain)
317115	Myosin head (motor domain)
317117	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
317124	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
317127	Ribosomal protein L19e
317129	7 transmembrane receptor (rhodopsin family)
317131	Protein kinase domain
317137	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
317137	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
317140	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
317142	Myosin head (motor domain)
317146	Glutaredoxin
317147	ENV polyprotein (coat polyprotein)
317147	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
317149	Protein kinase domain
317150	Protein kinase domain
317152	Myosin head (motor domain)
317153	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
317155	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
317160	Sec23/Sec24 family. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24
317161	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
317162	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
317163	Eukaryotic initiation factor 1A
317164	Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5
317164	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
317165	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
317165	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
317166	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
317166	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
317168	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
317168	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
317170	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
317178	TPR Domain
317178	jmjC domain
317181	Sushi domain (SCR repeat)
317181	HYR domain. This domain is known as the HYR (Hyalin Repeat) domain, after the protein hyalin that is composed exclusively of this repeat. This domain probably corresponds to a new superfamily in the immunoglobulin fold. The function of this domain is unce
317182	Actin
317183	Cadherin domain
317187	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
317196	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
317196	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
317198	Beige/BEACH domain
317202	Cation-dependent mannose-6-phosphate receptor
317203	Protein kinase domain
317203	Protein kinase C terminal domain
317204	Cadherin domain
317205	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
317206	ENV polyprotein (coat polyprotein)
317207	Ribosomal protein S5, N-terminal domain
317208	Kinesin motor domain
317209	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
317211	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
317213	Cyclophilin type peptidyl-prolyl cis-trans isomerase
317213	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
317214	Riboflavin kinase / FAD synthetase. This family consists part of the bifunctional enzyme riboflavin kinase / FAD synthetase. These enzymes have both ATP:riboflavin 5'-phospho transferase and ATP:FMN-adenylyltransferase activitys. They catalyse the 5'-phos
317216	Eukaryotic protein of unknown function, DUF279
317217	Core histone H2A/H2B/H3/H4
317218	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
317219	7 transmembrane receptor (rhodopsin family)
317222	Ribosomal protein L24e
317223	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
317224	Fibronectin type III domain
317226	Fibronectin type III domain
317227	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
317231	Ribosomal L10
317232	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
317233	Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex
317233	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
317235	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
317236	Calsequestrin
317236	Nebulin repeat
317241	Zinc finger, C3HC4 type (RING finger)
317242	Ribosomal L38e protein family
317244	Ribosomal protein L34e
317244	Ribosomal protein S5, N-terminal domain
317247	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
317250	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
317252	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
317252	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
317252	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
317253	PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline
317256	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
317257	ENV polyprotein (coat polyprotein)
317257	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
317259	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
317264	Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain
317264	Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain
317268	Homeobox domain
317270	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
317271	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
317273	ATP-NAD kinase. Members of this family are ATP-NAD kinases EC:2.7.1.23. Catalyses the phosphorylation of NAD to NADP utilizing ATP and other nucleoside triphosphates as well as inorganic polyphosphate as a source of phosphorus
317274	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
317275	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
317276	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
317279	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
317284	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
317285	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
317290	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
317293	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
317294	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
317296	pfam02893, GRAM, GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins
317297	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
317298	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
317298	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
317299	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
317300	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
317301	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
317302	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
317304	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
317306	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
317308	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
317309	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
317312	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
317313	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
317314	HMG (high mobility group) box
317316	Zona pellucida-like domain
317318	HMG (high mobility group) box
317321	HMG (high mobility group) box
317322	Glypican
317323	HMG (high mobility group) box
317324	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
317326	HMG (high mobility group) box
317332	Ribosomal L29e protein family
317332	Cystine-knot domain. The family comprises glycoprotein hormones and the C-terminal domain of various extracellular proteins. It is believed to be involved in disulfide-linked dimerisation
317335	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
317335	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
317336	Ribosomal protein S19e
317337	Ribosomal protein S19e
317339	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
317339	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
317341	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
317346	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
317352	Core histone H2A/H2B/H3/H4
317353	Core histone H2A/H2B/H3/H4
317356	Cadherin domain
317358	Core histone H2A/H2B/H3/H4
317358	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
317359	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
317361	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
317362	Zn-finger in ubiquitin-hydrolases and other protein
317362	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
317363	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1
317364	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
317365	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
317366	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
317368	MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue
317369	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
317370	WD domain, G-beta repeat
317371	P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB)
317371	WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGl
317373	Ribosomal protein S19e
317374	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1
317375	Nucleotide-sugar transporter. This family of membrane proteins transport nucleotide sugars from the cytoplasm into golgi vesicles. Individual members transport CMP-sialic acid, UDP-galactose, and UDP-GlcNAc
317376	Helix-loop-helix DNA-binding domain
317377	Ribosomal L10
317379	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
317380	LIM domain. This family represents two copies of the LIM structural domain
317382	Fork head domain
317383	Ribosomal L10
317383	Putative phosphatase regulatory subunit. This family consists of several eukaryotic proteins that are thought to be involved in the regulation of glycogen metabolism. For instance, the mouse PTG protein has been shown to interact with glycogen synthase, p
317385	Zn-finger in Ran binding protein and others
317385	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
317389	Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-termin
317389	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
317393	NUDIX domain
317394	Polyprenyl synthetase
317395	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
317396	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
317396	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
317398	Ubiquitin carboxyl-terminal hydrolase family 2
317398	Ubiquitin carboxyl-terminal hydrolases family 2
317399	Ubiquitin carboxyl-terminal hydrolase family 2
317399	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
317399	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
317401	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
317414	Insulinase (Peptidase family M16)
317415	MCM2/3/5 family
317416	von Willebrand factor type A domain
317417	GTPase of unknown function
317417	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
317417	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
317417	Ferrous iron transport protein B. Escherichia coli has an iron(II) transport system (feo) which may make an important contribution to the iron supply of the cell under anaerobic conditions. FeoB has been identified as part of this transport system. FeoB i
317417	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
317419	Ribosomal protein L21e
317420	Protein kinase domain
317422	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
317425	jmjC domain
317425	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
317426	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
317427	Protein kinase domain
317428	HMG (high mobility group) box
317433	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
317439	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
317440	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
317444	Cytochrome c/c1 heme lyase
317446	Glutaredoxin
317449	EXS family. We have named this region the EXS family after (ERD1, XPR1, and SYG1). This family includes C-terminus portions from the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisae, and sequences that are thought to be m
317450	EXS family. We have named this region the EXS family after (ERD1, XPR1, and SYG1). This family includes C-terminus portions from the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisae, and sequences that are thought to be m
317451	EXS family. We have named this region the EXS family after (ERD1, XPR1, and SYG1). This family includes C-terminus portions from the SYG1 G-protein associated signal transduction protein from Saccharomyces cerevisae, and sequences that are thought to be m
317454	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
317455	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
317461	Cytochrome c/c1 heme lyase
317468	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
317470	MAM domain. An extracellular domain found in many receptors
317476	Macrophage migration inhibitory factor (MIF)
317476	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
317479	CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14
317479	MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins
317481	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
317482	Uncharacterized BCR, YhhW family COG1741
317488	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
317488	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
317489	Glutamine amidotransferase class-I
317491	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
317496	Protein kinase domain
317497	Ribosomal protein L19e
317498	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
317501	Ribosomal protein L13e
317506	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
317506	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
317509	WD domain, G-beta repeat
317510	Actin
317515	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
317517	Ribosomal protein L34e
317517	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
317520	Clathrin light chain
317521	Protein kinase domain
317521	Protein kinase C terminal domain
317525	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
317525	Vitamin K-dependent carboxylation/gamma-carboxyglutamic (GLA) domain. This domain is responsible for the high-affinity binding of calcium ions. This domain contains post-translational modifications of many glutamate residues by Vitamin K-dependent carboxy
317533	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
317534	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
317537	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
317538	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
317541	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
317544	Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain
317545	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
317550	ADP-ribosylation factor family
317550	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
317550	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
317552	ENV polyprotein (coat polyprotein)
317553	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
317553	TIR domain. The TIR domain is an intracellular signaling domain found in MyD88, interleukin 1 receptor and the Toll receptor. Called TIR (by SMART?) for Toll - Interleukin - Resistance
317554	Proteasome A-type and B-type
317559	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
317561	Josephin
317566	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
317566	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
317566	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
317569	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
317569	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
317571	HMG (high mobility group) box
317572	HMG (high mobility group) box
317573	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
317573	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
317574	HMG (high mobility group) box
317575	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
317575	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
317575	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
317576	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
317576	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
317580	ADP-ribosylation factor family
317580	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
317581	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
317583	HMG (high mobility group) box
317584	HMG (high mobility group) box
317585	HMG (high mobility group) box
317588	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
317589	Protein kinase domain
317590	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
317593	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
317597	HMG (high mobility group) box
317598	PH domain. PH stands for pleckstrin homology
317599	E1-E2 ATPase
317601	HMG (high mobility group) box
317602	HMG (high mobility group) box
317604	HMG (high mobility group) box
317605	HMG (high mobility group) box
317606	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
317607	7 transmembrane receptor (rhodopsin family)
317608	7 transmembrane receptor (rhodopsin family)
317608	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
317609	HMG (high mobility group) box
317612	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
317613	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
317615	Autophagy protein Apg12. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg12 is covalently bound to Apg5
317615	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
317618	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
317625	Polyprenyl synthetase
317626	Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length
317627	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
317627	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
317628	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
317631	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
317631	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
317646	Ribosomal protein L13
317649	Eukaryotic initiation factor 4E
317674	Frizzled/Smoothened family membrane region. This family contains the membrane spanning region of frizzled and smoothened receptors. This membrane region is predicted to contain seven transmembrane alpha helices. Proteins related to drosophila frizzled ar
317675	Biopterin-dependent aromatic amino acid hydroxylase. This family includes phenylalanine-4-hydroxylase, the phenylketonuria disease protein
317677	CUB domain
317677	Sushi domain (SCR repeat)
317701	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
317702	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
317703	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
317705	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
317717	Synaptobrevin
317719	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of
317750	Sodium/calcium exchanger protein. This is a family of sodium/calcium exchanger integral membrane proteins. This family covers the integral membrane regions of the proteins. Sodium/calcium exchangers regulate intracellular Ca2+ concentrations in many cell
319100	7 transmembrane receptor (rhodopsin family)
319101	Intermediate filament protein
319104	7 transmembrane receptor (rhodopsin family)
319105	7 transmembrane receptor (rhodopsin family)
319106	7 transmembrane receptor (rhodopsin family)
319107	7 transmembrane receptor (rhodopsin family)
319146	Interferon alpha/beta domain
319164	Core histone H2A/H2B/H3/H4
319166	Core histone H2A/H2B/H3/H4
319179	Core histone H2A/H2B/H3/H4
319179	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
319189	Core histone H2A/H2B/H3/H4
319189	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
319195	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
319196	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
319196	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
319197	7 transmembrane receptor (rhodopsin family)
319197	7 transmembrane receptor (rhodopsin family)
319200	7 transmembrane receptor (rhodopsin family)
319210	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
319214	7 transmembrane receptor (rhodopsin family)
319214	7 transmembrane receptor (rhodopsin family)
319217	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
319239	7 transmembrane receptor (rhodopsin family)
319239	7 transmembrane receptor (rhodopsin family)
319249	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
319259	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
319262	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
319281	L1 transposable element
319281	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
319281	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
319322	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
319322	PSP. Proline rich domain found in numerous spliceosome associated proteins
319322	Domain of unknown function (DUF382). This domain is specific to the human splicing factor 3b subunit 2 and it's orthologs
319322	SAP domain. The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins
319358	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
319387	Olfactomedin-like domain
319387	Galactose binding lectin domain
319387	7 transmembrane receptor (Secretin family)
319387	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
319387	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
319387	Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein coup
319430	7 transmembrane receptor (rhodopsin family)
319430	7 transmembrane receptor (rhodopsin family)
319433	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
319446	Renal dipeptidase
319446	Renal dipeptidase
319448	Fibronectin type III domain
319468	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
319480	von Willebrand factor type A domain
319480	von Willebrand factor type A domain
319480	FG-GAP repeat. This family contains the extracellular repeat that is found in up to seven copies in alpha integrins. This repeat has been predicted to fold into a beta propeller structure. The repeat is called the FG-GAP repeat after two conserved motifs
319481	WD domain, G-beta repeat
319482	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
319482	von Willebrand factor type D domain
319482	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
319489	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
319504	Fibronectin type III domain
319504	Fibronectin type III domain
319504	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
319508	C2 domain
319508	C2 domain
319508	C2 domain
319518	FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1
319518	Glycine cleavage T-protein (aminomethyl transferase). This is a family of glycine cleavage T-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in
319520	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
319520	Protein-tyrosine phosphatase
319520	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
319520	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
319535	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
319545	ENV polyprotein (coat polyprotein)
319554	NUDIX domain
319555	WD domain, G-beta repeat
319573	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
319583	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not
319583	ATP dependent DNA ligase domain. This domain belongs to a more diverse superfamily, including pfam01331 and pfam01653
319583	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
319586	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
319601	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
319606	Importin-beta N-terminal domain
319615	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
319625	Aldose 1-epimerase
319634	GAF domain. Domain present in phytochromes and cGMP-specific phosphodiesterases
319642	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
319642	ADP-ribosylation factor family
319642	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
319642	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
319653	Mitochondrial carrier protein
319653	Mitochondrial carrier protein
319660	Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. M
319679	L1 transposable element
319679	Actin interacting protein 3
319679	dUTPase. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate
319679	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
319679	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
319679	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
319679	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
319701	F-box domain
319710	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
319734	Cache domain
319740	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
319765	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
319765	KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
319790	WD domain, G-beta repeat
319790	HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes.
319800	Sugar (and other) transporter
319800	Sugar (and other) transporter
319801	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
319817	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
319836	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
319848	LacY proton/sugar symporter. This family is closely related to the sugar transporter family
319875	Trypsin
319888	Acyltransferase family. This family includes a range of acyltransferase enzymes. This domain is found in many as yet uncharacterised C. elegans proteins and it is approximately 300 amino acids long
319888	Acyltransferase family. This family includes a range of acyltransferase enzymes. This domain is found in many as yet uncharacterised C. elegans proteins and it is approximately 300 amino acids long
319922	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
319922	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
319924	Phosphotyrosine interaction domain (PTB/PID)
319924	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
319924	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
319933	L1 transposable element
319934	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
319934	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
319944	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
319945	Phosphoadenosine phosphosulfate reductase family. This domain is found in phosphoadenosine phosphosulfate (PAPS) reductase enzymes or PAPS sulfotransferase. PAPS reductase is part of the adenine nucleotide alpha hydrolases superfamily also including N ty
319953	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
319953	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
319965	C2 domain
319991	Kinesin motor domain
319991	Kinesin motor domain
319997	Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342
320024	Carboxylesterase
320040	Zinc finger, C3HC4 type (RING finger)
320043	7 transmembrane receptor (rhodopsin family)
320067	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
320078	Olfactomedin-like domain
320078	Olfactomedin-like domain
320100	GrpE
320106	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembr
320106	Tryptophan/tyrosine permease family
320106	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
320109	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
320119	MIT domain
320119	Protein kinase domain
320127	Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown
320129	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
320129	Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits
320139	Protein-tyrosine phosphatase
320157	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
320182	Helix-loop-helix DNA-binding domain
320182	Helix-loop-helix DNA-binding domain
320183	SelR domain. Methionine sulfoxide reduction is an important process, by which cells regulate biological processes and cope with oxidative stress. MsrA, a protein involved in the reduction of methionine sulfoxides in proteins, has been known for four decad
320191	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
320202	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
320204	Conserved hypothetical protein 95
320204	Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat
320204	N2,N2-dimethylguanosine tRNA methyltransferase. This enzyme EC:2.1.1.32 used S-AdoMet to methylate tRNA. The TRM1 gene of Saccharomyces cerevisiae is necessary for the N2,N2-dimethylguanosine modification of both mitochondrial and cytoplasmic tRNAs. The e
320213	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
320244	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
320244	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
320267	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
320267	KH domain. KH motifs probably bind RNA directly. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
320292	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
320302	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain conta
320302	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
320332	Core histone H2A/H2B/H3/H4
320332	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
320338	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
320355	Lipase
320376	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
320404	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
320405	PH domain. PH stands for pleckstrin homology
320407	Lectin C-type domain. This family includes both long and short form C-type
320435	Vacuolar sorting protein 9 (VPS9) domain
320438	ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum
320438	ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum m
320452	2OG-Fe(II) oxygenase superfamily. This family contains members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily. This family includes the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity EC:1.14.11.2
320454	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
320454	Trypsin
320454	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
320484	GTPase-activator protein for Ras-like GTPase. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position
320502	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
320534	Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembra
320560	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
320560	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
320560	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
320560	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be si
320560	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
320560	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
320560	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
320560	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
320563	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
320571	E1-E2 ATPase
320590	Sugar (and other) transporter
320595	jmjC domain
320595	jmjC domain
320595	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
320615	Dopey, N-terminal domain. DopA is the founding member of the Dopey family and is required for correct cell morphology and spatiotemporal organization of multicellular structures in the filamentous fungus Aspergillus nidulans. DopA homologues are found in
320615	Dopey, N-terminal domain. DopA is the founding member of the Dopey family and is required for correct cell morphology and spatiotemporal organization of multicellular structures in the filamentous fungus Aspergillus nidulans. DopA homologues are found in
320634	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
320634	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
320635	Oxidoreductase FAD-binding domain
320635	Oxidoreductase FAD-binding domain
320635	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
320651	Ubiquitin carboxyl-terminal hydrolase family 2
320651	Ubiquitin carboxyl-terminal hydrolases family 2
320655	Per1-like. A member of this family has been implemented in protein processing in the endoplasmic reticulum
320664	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
320676	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
320678	Intermediate filament protein
320678	Intermediate filament protein
320679	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function a
320683	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
320685	Cytidine and deoxycytidylate deaminase zinc-binding region
320712	Fibronectin type III domain
320712	Fibronectin type III domain
320713	Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family
320718	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
320718	Sulfate transporter family. Mutations may lead to several human diseases
320718	STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP
320727	Importin-beta N-terminal domain
320727	CAS/CSE protein, C-terminus. Mammalian cellular apoptosis susceptibility (CAS) proteins are homologous to the yeast chromosome-segregation protein, CSE1. This family aligns the C-terminal halves (approximately). CAS is involved in both cellular apoptosis
320734	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
320734	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
320745	Ubiquitin carboxyl-terminal hydrolases family 2
320747	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
320747	Leucine Rich Repeat. CAUTION: This Pfam may not find all Leucine Rich Repeats in a protein. Leucine Rich Repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in
320769	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
320795	Hr1 repeat
320799	Homeobox domain
320808	WD domain, G-beta repeat
320825	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
320858	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter
320864	Intermediate filament protein
320873	Cadherin domain
320878	LIM domain. This family represents two copies of the LIM structural domain
320878	FAD binding domain. This domain is involved in FAD binding in a number of enzymes
320878	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
320878	Thi4 family. This family includes a putative thiamine biosynthetic enzyme
320878	LIM domain. This family represents two copies of the LIM structural domain
320878	FAD binding domain. This domain is involved in FAD binding in a number of enzymes
320878	Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain
320878	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
320898	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
320898	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
320909	Vps52 / Sac2 family. Vps52 complexes with Vps53 and Vps54 to form a multi- subunit complex involved in regulating membrane trafficking events
320910	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
320910	Integrins, beta chain. Sequences cut off at repeats due to overlap with EGF
320910	Integrin, beta chain. Sequences cut off at repeats due to overlap with EGF
320940	E1-E2 ATPase
320941	L1 transposable element
320951	Phosphatidylserine decarboxylase. This is a family of phosphatidylserine decarboxylases, EC:4.1.1.65. These enzymes catalyse the reaction: Phosphatidyl-L-serine <=> phosphatidylethanolamine + CO2. Phosphatidylserine decarboxylase plays a central role in t
320974	Fibronectin type III domain
320981	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle
320981	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
320982	ADP-ribosylation factor family
320995	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
320997	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
320997	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
321003	metallopeptidase family M24
321018	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
321019	7 transmembrane receptor (rhodopsin family)
321020	7 transmembrane receptor (rhodopsin family)
321021	7 transmembrane receptor (rhodopsin family)
326305	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
326624	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
327655	Histidine acid phosphatase
327657	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
327729	Ribosomal protein L11, RNA binding domain
327729	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
327731	7 transmembrane receptor (rhodopsin family)
327734	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
327736	Ribosomal protein L11, RNA binding domain
327736	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
327737	LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known
327741	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
327754	ENV polyprotein (coat polyprotein)
327761	Tetraspanin family
327762	SRP54-type protein, GTPase domain. This family includes relatives of the G-domain of the SRP54 family of proteins
327762	UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Some members have large insertions near t
327767	Proteasome A-type and B-type
327775	L1 transposable element
327793	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
327794	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
327799	Zn-finger in ubiquitin-hydrolases and other protein
327803	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
327803	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
327806	ENV polyprotein (coat polyprotein)
327807	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
327807	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
327807	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
327810	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
327814	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
327815	Sulfotransferase protein
327816	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
327816	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
327819	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is n
327819	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
327826	PTB domain (IRS-1 type)
327826	PTB domain (IRS-1 type)
327827	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
327827	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
327827	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
327827	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
327828	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
327830	Protein kinase domain
327836	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
327840	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
327842	Gelsolin repeat
327842	Villin headpiece domain
327843	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
327843	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
327844	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
327853	7 transmembrane receptor (rhodopsin family)
327855	7 transmembrane receptor (rhodopsin family)
327856	7 transmembrane receptor (rhodopsin family)
327859	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
327864	Frataxin-like domain. This family contains proteins that have a domain related to the globular C-terminus of Frataxin the protein that is mutated in Friedreich's ataxia. This domain is found in a family of bacterial proteins. The function of this domain i
327868	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
327870	Ribosomal L39 protein
327871	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
327871	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
327873	VHS domain. Domain present in VPS-27, Hrs and STAM
327873	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
327878	Ribosomal protein S5, C-terminal domain
327880	Ribosomal protein L11, RNA binding domain
327883	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
327884	Ribosomal L15
327884	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
327886	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
327889	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
327889	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
327890	ENV polyprotein (coat polyprotein)
327891	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
327891	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
327892	ENV polyprotein (coat polyprotein)
327900	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
327907	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
327907	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
327907	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
327919	7 transmembrane receptor (rhodopsin family)
327920	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
327920	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
327923	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
327923	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
327923	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
327923	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
327925	Ribosomal protein S28e
327929	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
327930	RasGEF domain. Guanine nucleotide exchange factor for Ras-like small GTPases
327941	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
327942	Uncharacterized LmbE-like protein, COG2120
327944	HMG (high mobility group) box
327951	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
327956	Vitelline membrane outer layer protein I (VOMI). VOMI binds tightly to ovomucin fibrils of the egg yolk membrane. The structure that consists of three beta-sheets forming Greek key motifs, which are related by an internal pseudo three-fold symmetry. Furth
327963	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
327965	7 transmembrane receptor (rhodopsin family)
327972	Ribosomal protein S2
327979	Ribosomal protein L11, RNA binding domain
327979	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
327981	WAP-type (Whey Acidic Protein) 'four-disulfide core'
327983	Carbamoyl-phosphate synthase L chain, N-terminal domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosyn
327991	Protein phosphatase 2C. Protein phosphatase 2C is a Mn++ or Mg++ dependent protein serine/threonine phosphatase
327995	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
328022	Protein kinase domain
328026	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
328026	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
328026	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328035	Fatty acid desaturase
328042	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
328050	HMG (high mobility group) box
328054	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
328054	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
328059	Amino acid permease
328062	Phosphoglycerate kinase
328063	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
328076	Leucine Rich Repeat. CAUTION: This Pfam may not find all Leucine Rich Repeats in a protein. Leucine Rich Repeats are short sequence motifs present in a number of proteins with diverse functions and cellular locations. These repeats are usually involved in
328076	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
328076	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
328077	Ribosomal protein L36e
328079	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
328080	Ribosomal protein S5, N-terminal domain
328081	Ribosomal protein L24e
328083	Phosphatidylethanolamine-binding protein
328089	Ribosomal protein S26e
328090	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
328092	Uncharacterized ACR, COG1490
328095	Ribosomal protein S26e
328099	Phosphoribosyl transferase domain. This family includes a range of diverse phosphoribosyl transferase enzymes. This family includes: Adenine phosphoribosyltransferase EC:2.4.2.7, Hypoxanthine-guanine-xanthine phosphoribosyltransferase, Hypoxanthine phosph
328102	Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits
328104	Proliferating cell nuclear antigen, C-terminal domain. N-terminal and C-terminal domains of PCNA are topologically identical. Three PCNA molecules are tightly associated to form a closed ring encircling duplex DNA
328106	Fibronectin type III domain
328106	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
328106	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
328107	FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1
328107	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
328107	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
328115	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
328115	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
328115	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
328115	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328116	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
328119	Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes
328123	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328133	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
328140	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
328141	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328143	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
328146	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
328148	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
328149	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
328158	Macrophage migration inhibitory factor (MIF)
328167	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
328169	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
328169	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328173	Protein kinase domain
328174	SMC family, C-terminal domain. This Pfam family represents a conserved domain towards the C-terminus of the SMC family proteins. A second conserved domain is found at the N-terminus (pfam02463). The SMC (structural maintenance of chromosomes) superfamily
328174	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
328176	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
328177	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
328178	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
328182	Ribosomal protein S5, N-terminal domain
328188	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
328196	7 transmembrane receptor (rhodopsin family)
328197	Ribosomal L29e protein family
328198	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
328200	7 transmembrane receptor (rhodopsin family)
328201	Core histone H2A/H2B/H3/H4
328201	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
328204	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
328205	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
328205	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
328205	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
328205	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328208	Core histone H2A/H2B/H3/H4
328208	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
328211	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328226	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
328230	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon
328237	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
328247	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
328248	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
328251	Core histone H2A/H2B/H3/H4
328258	Mitochondrial carrier protein
328258	Mitochondrial carrier protein
328266	ENV polyprotein (coat polyprotein)
328268	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
328269	Low molecular weight phosphotyrosine protein phosphatase
328274	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
328279	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
328280	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
328282	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-stero
328292	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
328297	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
328297	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
328297	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328302	Ribosomal protein S11
328306	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
328306	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328311	NAC domain
328317	Ribosomal S17
328323	Protein kinase domain
328329	Protein kinase domain
328329	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
328340	Protein of unknown function (DUF229). Members of this family are uncharacterised. They are 500-1200 amino acids in length and share a long region conservation that probably corresponds to several domains
328342	Ribosomal protein L14p/L23e
328343	Ribosomal protein L34e
328344	Ribosomal protein S5, N-terminal domain
328351	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
328355	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328361	L1 transposable element
328374	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
328374	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328381	SH2 domain
328386	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
328386	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
328386	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328395	7 transmembrane receptor (rhodopsin family)
328396	7 transmembrane receptor (rhodopsin family)
328397	7 transmembrane receptor (rhodopsin family)
328401	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
328403	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
328403	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
328404	ENV polyprotein (coat polyprotein)
328410	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
328413	Uncharacterized ACR, COG1579
328413	HlyD family secretion protein
328413	Myosin head (motor domain)
328413	Intermediate filament protein
328413	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
328413	Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown
328413	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
328413	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
328415	IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: Th
328417	BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not b
328417	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
328418	ATP synthase subunit C
328423	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
328423	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
328423	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328424	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
328432	CAP-Gly domain. CAP stands for cytoskeleton-associated proteins
328440	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
328440	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
328451	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
328457	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
328459	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
328464	Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tran
328467	Low molecular weight phosphotyrosine protein phosphatase
328478	7 transmembrane receptor (rhodopsin family)
328481	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
328482	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
328486	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
328491	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi
328491	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
328496	UDP-glucoronosyl and UDP-glucosyl transferase
328500	SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop
328504	Actin
328508	Ribosomal protein L13
328518	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
328519	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
328523	ENV polyprotein (coat polyprotein)
328523	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
328528	Ribonucleotide reductase, small chain
328529	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
328532	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
328533	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
328537	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
328539	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
328548	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
328549	HMG (high mobility group) box
328551	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
328572	KIX domain. CBP and P300 bind to the CREB via a domain known as KIX. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun
328572	TAZ zinc finger. The TAZ2 domain of CBP binds to other transcription factors such as the p53 tumor suppressor protein, E1A oncoprotein, MyoD, and GATA-1
328572	KIX domain. CBP and P300 bind to the CREB via a domain known as KIX. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun
328580	Spc97 / Spc98 family. The spindle pole body (SPB) functions as the microtubule-organising centre in yeast. Members of this family are spindle pole body (SBP) components such as Spc97 and Spc98 that form a complex with gamma-tubulin
328585	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328586	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
328595	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
328595	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
328595	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
328595	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328599	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
328600	7 transmembrane receptor (rhodopsin family)
328616	7 transmembrane receptor (rhodopsin family)
328617	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
328621	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328625	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
328635	Ribosomal L29e protein family
328640	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
328648	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
328648	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
328648	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328672	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
328672	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
328689	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
328689	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
328689	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
328689	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328691	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
328695	Sodium:sulfate symporter transmembrane region. There are also some members in this family that belong to the subfamily SODIT1
328695	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
328696	7 transmembrane receptor (rhodopsin family)
328697	7 transmembrane receptor (rhodopsin family)
328698	7 transmembrane receptor (rhodopsin family)
328699	pfam02932, Neur_chan_memb, Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel
328699	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
328701	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
328703	Actin
328707	Eukaryotic initiation factor 4E
328714	Ribosomal protein L19e
328715	Protein kinase domain
328730	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
328731	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
328743	Ribosomal protein S18
328749	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
328751	Enolase, N-terminal domain
328751	Enolase, C-terminal TIM barrel domain
328754	Proteasome A-type and B-type
328755	Polyprenyl synthetase
328758	7 transmembrane receptor (metabotropic glutamate family)
328759	7 transmembrane receptor (metabotropic glutamate family)
328759	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
328760	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328761	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
328764	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328778	ADP-ribosylation factor family
328778	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
328778	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
328779	Sulfotransferase protein
328780	Trypsin
328781	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
328781	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
328782	Trypsin
328790	Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of
328792	7 transmembrane receptor (rhodopsin family)
328795	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
328795	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
328803	Class II histocompatibility antigen, beta domain
328804	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
328805	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
328807	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
328809	Class I Histocompatibility antigen, domains alpha 1 and 2
328810	Ribosomal protein L6
328812	7 transmembrane receptor (rhodopsin family)
328814	Mitochondrial carrier protein
328815	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
328826	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
328831	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
328832	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
328832	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328842	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
328842	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328842	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
328844	ENV polyprotein (coat polyprotein)
328846	Mitochondrial carrier protein
328847	Mitochondrial carrier protein
328849	Ribosomal protein L34e
328851	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
328854	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
328854	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
328860	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
328874	F-box domain
328877	WD domain, G-beta repeat
328878	Ribosomal L28e protein family
328879	Ribosomal protein L36e
328880	ENV polyprotein (coat polyprotein)
328880	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
328881	UDP-glucoronosyl and UDP-glucosyl transferase
328882	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
328883	UDP-glucoronosyl and UDP-glucosyl transferase
328884	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
328889	ENV polyprotein (coat polyprotein)
328890	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
328895	PUA domain. The PUA domain named after Pseudouridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi
328895	TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that s
328897	GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the do
328898	L1 transposable element
328899	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
328899	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
328899	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
328899	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
328899	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328901	Thrombospondin N-terminal -like domain
328902	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
328908	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
328909	Cadherin domain
328909	Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic
328911	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
328911	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
328911	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
328918	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
328927	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
328927	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
328940	Phenazine biosynthesis-like protein. PhzC/PhzF is involved in dimerization of two 2,3-dihydro-3-oxo-anthranilic acid molecules to create PCA by P. fluorescens. This family appears to be distantly related to pfam01678, including containing a weak internal
328948	HMG (high mobility group) box
328954	Ribosomal protein L10
328959	Ribosomal L29e protein family
328964	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
328967	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
328967	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
328967	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
328971	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
328973	WD domain, G-beta repeat
328975	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
328992	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
329000	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
329008	Homocysteine S-methyltransferase. This is a family of related homocysteine S-methyltransferases enzymes: 5-methyltetrahydrofolate--homocysteine S-methyltransferases also known EC:2.1.1.13,
329020	Ribosomal protein S7e
329023	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
329027	Uncharacterized ACR, YagE family COG1723
329028	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
329029	7 transmembrane receptor (rhodopsin family)
329030	7 transmembrane receptor (rhodopsin family)
329031	ENV polyprotein (coat polyprotein)
329032	Ribosomal protein S2
329037	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
329048	Ribosomal S17
329052	HMG (high mobility group) box
329055	ab-hydrolase associated lipase region
329055	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
329064	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
329064	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
329076	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
329080	Ribosomal L29e protein family
329081	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
329087	14-3-3 protein
329088	Ribosomal protein S15
329089	ENV polyprotein (coat polyprotein)
329089	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
329090	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329090	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329090	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329093	Zinc carboxypeptidase
329094	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
329106	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
329111	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
329112	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
329112	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
329113	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329116	IMP4 family. This family includes IMP4, which is involved ribosomal RNA processing
329126	Ribosomal protein L13e
329133	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
329136	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
329136	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
329139	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
329141	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
329141	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
329143	Mitochondrial carrier protein
329144	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
329146	Protein kinase domain
329147	ZIP Zinc transporter. The ZIP family consists of zinc transport proteins and many putative metal transporters. The main contribution to this family is from the Arabidopsis thaliana ZIP protein family these proteins are responsible for zinc uptake in the p
329149	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
329154	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
329161	Ribosomal protein L6e
329162	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329169	Cytochrome c oxidase subunit Vb
329179	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329179	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329179	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329179	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329194	R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA
329199	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
329216	Ribosomal L18ae protein family
329218	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
329219	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329219	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329220	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329221	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329221	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329221	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329222	F-actin capping protein alpha subunit
329226	Protein of unknown function DUF51. This family contains members in prokaryotes and eukaryotes. None of the members has a known function
329232	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
329234	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
329234	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
329236	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
329244	Interleukin 10
329248	Phosphatidylinositol 3- and 4-kinase
329249	C2 domain
329251	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
329259	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329259	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329260	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
329260	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
329262	ENV polyprotein (coat polyprotein)
329263	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329263	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329278	Fibronectin type III domain
329278	Fibronectin type III domain
329278	Fibrinogen beta and gamma chains, C-terminal globular domain
329281	GDP dissociation inhibitor
329289	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329289	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329289	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329289	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329293	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
329296	Ribosomal protein L6e
329299	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329299	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329302	Protein kinase domain
329303	CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations
329310	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329319	MYND finger
329320	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
329320	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329322	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
329324	C2 domain
329326	Ribosomal protein S5, C-terminal domain
329327	7 transmembrane receptor (metabotropic glutamate family)
329328	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
329328	Natural resistance-associated macrophage protein. The natural resistance-associated macrophage protein (NRAMP) family consists of Nramp1, Nramp2, and yeast proteins Smf1 and Smf2. The NRAMP family is a novel family of functional related proteins defined b
329331	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329331	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329331	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329331	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329334	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329334	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329336	mbt repeat. The function of this repeat is unknown, but is found in a number of nuclear proteins such as drosophila sex comb on midleg protein. The repeat is found in up to four copies. The repeat contains a completely conserved glutamate at its amino ter
329337	ATP synthase
329339	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
329341	Nucleoside diphosphate kinase
329342	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
329344	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
329344	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
329355	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329357	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
329357	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
329362	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
329362	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
329363	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
329364	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329364	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329378	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
329383	Ribosomal protein S2
329384	Peptidyl-tRNA hydrolase
329389	Ribosomal protein L36e
329390	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
329396	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
329398	Ribosomal L28e protein family
329399	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
329399	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
329400	ENV polyprotein (coat polyprotein)
329403	Ribosomal S3Ae family
329416	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
329417	Low-density lipoprotein receptor domain class A
329419	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
329421	Myosin head (motor domain)
329424	Adenylate kinase
329432	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
329435	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
329440	Enolase, N-terminal domain
329440	Enolase, C-terminal TIM barrel domain
329440	Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain
329441	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329443	L1 transposable element
329443	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
329445	MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains
329445	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
329447	7 transmembrane receptor (rhodopsin family)
329448	7 transmembrane receptor (rhodopsin family)
329450	7 transmembrane receptor (rhodopsin family)
329454	7 transmembrane receptor (rhodopsin family)
329457	7 transmembrane receptor (rhodopsin family)
329458	7 transmembrane receptor (rhodopsin family)
329459	7 transmembrane receptor (rhodopsin family)
329460	7 transmembrane receptor (rhodopsin family)
329461	7 transmembrane receptor (rhodopsin family)
329462	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
329463	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329470	Aminotransferase class I and II
329472	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
329473	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
329474	Ribosomal L15
329477	Transport protein particle (TRAPP) component, Bet3. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is an 800kDa that contains at least 10 subunits
329482	Doublecortin
329482	Doublecortin
329485	Ribosomal protein L13e
329486	Cyclophilin type peptidyl-prolyl cis-trans isomerase
329489	7 transmembrane receptor (rhodopsin family)
329493	HMG (high mobility group) box
329502	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lyso
329502	Lysophospholipase catalytic domain. This family consists of Lysophospholipase / phospholipase B EC:3.1.1.5 and cytosolic phospholipase A2 EC:3.1.4 which also has a C2 domain pfam00168. Phospholipase B enzymes catalyse the release of fatty acids from lysop
329504	Leucine carboxyl methyltransferase. Family of leucine carboxyl methyltransferases EC:2.1.1.- . This family may need divides a the full alignment contains a significantly shorter mouse sequence
329509	Protein kinase domain
329512	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
329516	Protein kinase domain
329520	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
329521	Eukaryotic ribosomal protein L18
329522	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329522	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329522	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329522	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329524	Thrombospondin type 1 domain
329524	AMOP domain. This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges
329528	Spumavirus gag protein
329532	Hsp90 protein
329533	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
329539	Endomembrane protein 70
329547	LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
329547	pfam02886, LBP_BPI_CETP_C, LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar
329553	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329553	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329553	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329553	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329555	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
329555	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
329560	Cyclophilin type peptidyl-prolyl cis-trans isomerase
329567	Helicase. This family consists of Helicases from the Herpes viruses. Helicases are responsible for the unwinding of DNA and are essential for replication and completion of the viral life cycle
329573	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
329574	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
329579	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
329581	Inhibitor of Apoptosis domain. BIR stands for 'Baculovirus Inhibitor of apoptosis protein Repeat' Also known as IAP repeat
329585	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329586	Ribosomal protein L15
329589	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329589	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329592	ENV polyprotein (coat polyprotein)
329592	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
329598	ENV polyprotein (coat polyprotein)
329602	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
329603	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
329635	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
329652	Carboxylesterase
329653	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
329653	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
329655	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
329656	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329656	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329660	7 transmembrane receptor (metabotropic glutamate family)
329660	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
329662	ENV polyprotein (coat polyprotein)
329662	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
329662	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
329662	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
329666	7 transmembrane receptor (rhodopsin family)
329667	Zinc finger, C3HC4 type (RING finger)
329669	Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase
329671	Class I Histocompatibility antigen, domains alpha 1 and 2
329674	Intermediate filament protein
329675	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
329675	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
329681	Ribosomal protein S5, C-terminal domain
329686	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329686	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329692	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
329693	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
329694	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
329714	Core histone H2A/H2B/H3/H4
329715	Core histone H2A/H2B/H3/H4
329720	HMG (high mobility group) box
329723	Translationally controlled tumor protein
329725	Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tran
329727	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
329727	uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain
329727	DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways
329732	Glycosyl hydrolases family 18
329736	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329736	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329736	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329736	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329749	Ribosomal protein S6e
329757	Ribosomal protein L11, RNA binding domain
329757	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
329761	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329761	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329764	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
329777	Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this
329777	Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this
329777	GHMP kinases putative ATP-binding protein
329777	Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this f
329778	Eukaryotic porin
329779	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
329783	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
329786	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
329786	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
329787	ENV polyprotein (coat polyprotein)
329787	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
329787	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
329787	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
329787	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
329788	L1 transposable element
329788	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
329790	ENV polyprotein (coat polyprotein)
329791	Ribosomal protein S12
329792	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
329799	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329801	Oxidoreductase FAD-binding domain
329801	Oxidoreductase NAD-binding domain. Xanthine dehydrogenases, that also bind FAD/NAD, have essentially no similarity
329804	Ribosomal protein S24e
329808	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329808	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329810	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
329818	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329821	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
329823	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329823	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329823	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329824	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
329828	Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases
329841	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
329846	Translationally controlled tumor protein
329855	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
329855	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
329857	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329857	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329863	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
329864	HMG (high mobility group) box
329865	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
329867	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329867	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329867	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329869	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
329874	Ribosomal protein L34e
329879	Interferon alpha/beta domain
329881	Interferon alpha/beta domain
329882	Interferon alpha/beta domain
329885	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329889	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
329889	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
329890	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329893	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
329894	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
329920	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329920	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329921	ENV polyprotein (coat polyprotein)
329921	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
329921	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
329921	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
329924	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
329925	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
329926	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
329927	EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains
329928	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
329931	CAP protein
329932	Core histone H2A/H2B/H3/H4
329934	Fork head domain
329936	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
329937	Cyclin-dependent kinase regulatory subunit
329941	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
329948	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
329948	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
329948	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
329948	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
329951	C2 domain. Phosphoinositide 3-kinase region postulated to contain a C2 domain. Outlier of pfam00168 family
329953	Ribosomal protein S8
329954	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
329954	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
329957	Ribosomal protein S27
329959	Transcription factor S-II (TFIIS)
329962	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
329963	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
329971	Ribosomal protein S5, N-terminal domain
329971	Ribosomal protein S5, C-terminal domain
329972	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
329976	Mitochondrial carrier protein
329977	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
329977	FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif
329983	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
329987	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
329987	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
329991	HMG (high mobility group) box
329993	Carboxylesterase
329994	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
329996	HMG (high mobility group) box
330011	Agrin NtA domain. Agrin is a multidomain heparan sulphate proteoglycan, that is a key organizer for the induction of postsynaptic specializations at the neuromuscular junction. Binding of agrin to basement membranes requires the amino terminal (NtA) domai
330012	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
330014	Urea transporter. Members of this family transport urea across membranes. The family includes a bacterial homologue
330017	Palmitoyl protein thioesterase
330018	Palmitoyl protein thioesterase
330018	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330020	HMG (high mobility group) box
330021	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
330022	ENV polyprotein (coat polyprotein)
330027	Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzym
330028	Ribosomal protein S27
330032	Ribosomal protein L31e
330034	ENV polyprotein (coat polyprotein)
330034	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
330034	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
330037	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330037	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330037	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330042	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
330046	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
330051	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
330057	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
330057	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330062	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
330064	Sodium:solute symporter family
330067	Zinc finger, C3HC4 type (RING finger)
330079	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
330079	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330089	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330093	ADP-ribosylation factor family
330093	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
330094	Ribosomal protein L13e
330119	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
330119	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
330122	Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds
330124	Ribosomal protein L13e
330125	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330125	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330125	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330129	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
330129	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
330130	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330130	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330136	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330136	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330136	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330149	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
330155	Ribosomal protein L34e
330158	7 transmembrane receptor (metabotropic glutamate family)
330159	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
330160	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
330161	SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation
330167	Protein kinase domain
330167	PH domain. PH stands for pleckstrin homology
330171	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly
330193	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
330200	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
330206	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330206	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
330206	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330206	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330207	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
330213	WD domain, G-beta repeat
330216	Metallo-beta-lactamase superfamily
330216	Metallo-beta-lactamase superfamily
330222	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
330223	Fibronectin type III domain
330230	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
330236	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330242	DNA directed RNA polymerase, 7 kDa subunit
330250	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
330251	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330251	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330251	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330257	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
330258	L1 transposable element
330259	PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline
330260	Strictosidine synthase. Strictosidine synthase (E.C. 4.3.3.2) is a key enzyme in alkaloid biosynthesis. It catalyses the condensation of tryptamine with secologanin to form strictosidine
330260	Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis
330260	Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyses organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. H
330264	Actin
330268	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
330269	Ribosomal protein L19e
330272	HMG (high mobility group) box
330273	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
330275	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
330287	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
330294	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
330301	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
330306	Cadherin domain
330308	Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydroge
330309	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
330326	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
330327	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
330328	ENV polyprotein (coat polyprotein)
330328	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
330330	ENV polyprotein (coat polyprotein)
330333	ENV polyprotein (coat polyprotein)
330333	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
330334	ENV polyprotein (coat polyprotein)
330340	Ribosomal protein S7e
330344	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330344	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
330344	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330344	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330348	Ribosomal protein L34e
330356	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
330357	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
330357	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
330358	Cyclophilin type peptidyl-prolyl cis-trans isomerase
330359	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
330359	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330364	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
330366	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
330370	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330370	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330380	Sec7 domain. The Sec7 domain is a guanine-nucleotide-exchange-factor (GEF) for the pfam00025 family
330387	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
330388	SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase
330396	MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function
330396	RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3
330406	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
330411	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330412	ENV polyprotein (coat polyprotein)
330412	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
330412	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
330423	Ribosomal protein S5, C-terminal domain
330423	Ribosomal protein S5, N-terminal domain
330424	HMG (high mobility group) box
330429	Lectin C-type domain. This family includes both long and short form C-type
330430	Protein kinase domain
330434	Putative esterase. This family contains Esterase D. However it is not clear if all members of the family have the same function. This family is possibly related to the pfam00135 family
330436	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
330440	Sulfotransferase protein
330441	HMG (high mobility group) box
330443	S25 ribosomal protein
330444	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
330444	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
330445	MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a small domain that may have a ligand transferase function
330445	RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3
330449	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330449	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330449	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330450	Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included
330450	Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included
330453	Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes
330454	7 transmembrane receptor (rhodopsin family)
330454	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
330455	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
330456	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
330457	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
330459	Mo25 protein family
330463	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
330463	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
330465	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330465	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
330465	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330465	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330466	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
330467	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
330470	Fibronectin type II domain
330471	Fibronectin type II domain
330474	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
330475	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
330476	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
330477	eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors
330478	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
330478	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330488	Zinc finger, C3HC4 type (RING finger)
330492	ENV polyprotein (coat polyprotein)
330495	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
330505	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
330506	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
330507	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
330509	Ribosomal family S4e
330511	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
330515	PPIC-type PPIASE domain. Rotamases increase the rate of protein folding by catalysing the interconversion of cis-proline and trans-proline
330517	ENV polyprotein (coat polyprotein)
330517	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
330519	Trypsin
330525	Eukaryotic ribosomal protein L18
330527	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
330531	7 transmembrane receptor (rhodopsin family)
330532	ENV polyprotein (coat polyprotein)
330532	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
330532	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
330534	7 transmembrane receptor (rhodopsin family)
330535	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
330548	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
330558	Tetraspanin family
330560	Actin
330565	NUDIX domain
330568	Ribosomal protein S28e
330570	Ribosomal protein L31e
330574	ENV polyprotein (coat polyprotein)
330581	7 transmembrane receptor (metabotropic glutamate family)
330581	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
330582	7 transmembrane receptor (metabotropic glutamate family)
330583	7 transmembrane receptor (metabotropic glutamate family)
330584	7 transmembrane receptor (metabotropic glutamate family)
330585	7 transmembrane receptor (rhodopsin family)
330586	7 transmembrane receptor (rhodopsin family)
330587	7 transmembrane receptor (rhodopsin family)
330588	7 transmembrane receptor (rhodopsin family)
330589	7 transmembrane receptor (rhodopsin family)
330590	7 transmembrane receptor (metabotropic glutamate family)
330591	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
330605	Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes
330609	Conserved nucleoporin domain
330613	7 transmembrane receptor (rhodopsin family)
330614	7 transmembrane receptor (rhodopsin family)
330615	7 transmembrane receptor (rhodopsin family)
330616	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
330617	Ubiquitin carboxyl-terminal hydrolase family 2
330621	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
330622	7 transmembrane receptor (rhodopsin family)
330623	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
330626	7 transmembrane receptor (rhodopsin family)
330627	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
330627	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
330634	ENV polyprotein (coat polyprotein)
330639	Nucleoside diphosphate kinase
330643	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330646	Sulfotransferase protein
330647	Ubiquitin carboxyl-terminal hydrolase family 2
330654	Protein kinase domain
330656	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
330657	Trypsin
330662	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
330666	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
330675	von Willebrand factor type D domain
330679	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
330679	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330683	7 transmembrane receptor (rhodopsin family)
330683	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
330684	7 transmembrane receptor (rhodopsin family)
330684	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
330685	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
330686	Ribosomal protein L23
330688	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
330688	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
330689	Ribosomal protein L23
330693	Exportin-t. Exportin-t is a specific mediator of tRNA export. RanGTP-binding, importin beta-related factor with predominantly nuclear localization. It shuttles rapidly between nucleus and between nucleus and cytoplasm and interacts with nuclear pore compl
330721	Protein kinase domain
330723	Trypsin
330723	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
330724	PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown
330724	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
330727	S25 ribosomal protein
330731	Zinc-binding dehydrogenase
330731	Conserved hypothetical protein 95
330731	Putative RNA methylase family UPF0020. This domain is probably a methylase. It is associated with the THUMP domain that also occurs with RNA modification domains
330731	Met-10+ like-protein. The methionine-10 mutant allele of N. crassa codes for a protein of unknown function. However, homologous proteins have been found in yeast suggesting this protein may be involved in methionine biosynthesis, transport and/or utilizat
330735	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
330735	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
330743	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330743	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330743	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330749	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
330753	Cadherin domain
330757	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
330758	Integral membrane protein DUF106. This archaebacterial protein family has no known function. Members are predicted to be integral membrane proteins
330758	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
330759	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330759	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330760	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
330761	HMG (high mobility group) box
330765	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
330770	Disintegrin
330770	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
330770	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
330772	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
330777	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330777	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330777	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330778	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
330783	Ribosomal S3Ae family
330785	Ribosomal protein L35Ae
330786	ENV polyprotein (coat polyprotein)
330786	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
330788	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
330788	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
330790	Extracellular link domain
330790	Extracellular link domain
330803	Phenazine biosynthesis-like protein. PhzC/PhzF is involved in dimerization of two 2,3-dihydro-3-oxo-anthranilic acid molecules to create PCA by P. fluorescens. This family appears to be distantly related to pfam01678, including containing a weak internal
330805	S25 ribosomal protein
330808	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
330808	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330810	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
330812	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
330812	Zinc finger, C3HC4 type (RING finger)
330812	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
330813	Galactose binding lectin domain
330814	7 transmembrane receptor (Secretin family)
330814	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
330814	Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein coup
330817	Deoxyhypusine synthase. Eukaryotic initiation factor 5A (eIF-5A) contains an unusual amino acid, hypusine [N epsilon-(4-aminobutyl-2-hydroxy)lysine]. The first step in the post-translational formation of hypusine is catalysed by the enzyme deoxyhypusine s
330825	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330825	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330836	Amino acid permease
330836	Tryptophan/tyrosine permease family
330836	Amino acid permease
330844	Lectin C-type domain. This family includes both long and short form C-type
330862	PMP-22/EMP/MP20/Claudin family
330870	Exportin-t. Exportin-t is a specific mediator of tRNA export. RanGTP-binding, importin beta-related factor with predominantly nuclear localization. It shuttles rapidly between nucleus and between nucleus and cytoplasm and interacts with nuclear pore compl
330873	Ribosomal protein S6e
330880	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
330885	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
330885	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
330886	ENV polyprotein (coat polyprotein)
330887	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
330889	Ribosomal protein S5, N-terminal domain
330889	Ribosomal protein S5, C-terminal domain
330889	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
330890	Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is unknown
330891	HMG (high mobility group) box
330898	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
330898	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
330899	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
330900	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
330900	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
330905	Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermid
330914	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
330918	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
330926	7 transmembrane receptor (rhodopsin family)
330928	7 transmembrane receptor (rhodopsin family)
330936	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
330938	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
330948	Ribosomal protein S8e
330957	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
330960	Fibronectin type III domain
330960	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
330961	GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins
330975	Ribosomal protein S8
330982	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
330983	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
330984	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
330984	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
330986	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
330988	Ribosomal S3Ae family
330991	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
330998	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
330999	Ribosomal L29e protein family
331002	Ribosomal protein S5, C-terminal domain
331004	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminu
331004	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
331012	Translationally controlled tumor protein
331014	HMG (high mobility group) box
331026	Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars
331027	F-box domain
331039	14-3-3 protein
331046	Transglutaminase family
331046	Transglutaminase family, C-terminal ig like domain
331046	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
331046	Transglutaminase family
331046	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
331047	SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145
331049	LysM domain. The LysM (lysin motif) domain is about 40 residues long. It is found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. The structure of this domain is known
331050	Tropomodulin. Tropomodulin is a novel tropomyosin regulatory protein that binds to the end of erythrocyte tropomyosin and blocks head-to-tail association of tropomyosin along actin filaments. Limited proteolysis shows this protein is composed of two domai
331051	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
331052	Protein kinase domain
331056	Somatotropin hormone family
331062	Adenylate and Guanylate cyclase catalytic domain
331065	L1 transposable element
331066	Helix-loop-helix DNA-binding domain
331068	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331069	Cadherin domain
331071	ENV polyprotein (coat polyprotein)
331072	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
331072	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
331074	Apoptosis regulator proteins, Bcl-2 family
331080	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331081	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
331085	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
331085	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
331085	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
331089	Defensin propeptide
331091	Defensin propeptide
331094	L1 transposable element
331101	Protein kinase domain
331102	14-3-3 protein
331103	7 transmembrane receptor (rhodopsin family)
331104	Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast member of this family
331104	pfam02935, COX7C, Cytochrome c oxidase subunit VIIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIIc. The yeast mem
331126	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
331133	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
331134	7 transmembrane receptor (metabotropic glutamate family)
331138	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
331142	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
331145	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
331146	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
331148	Protein kinase domain
331149	ENV polyprotein (coat polyprotein)
331149	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
331153	7 transmembrane receptor (metabotropic glutamate family)
331161	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
331163	7 transmembrane receptor (metabotropic glutamate family)
331175	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331176	7 transmembrane receptor (metabotropic glutamate family)
331177	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
331178	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
331182	ENV polyprotein (coat polyprotein)
331182	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
331189	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
331192	7 transmembrane receptor (rhodopsin family)
331198	7 transmembrane receptor (rhodopsin family)
331205	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
331205	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
331209	Somatotropin hormone family
331210	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
331212	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
331215	Somatotropin hormone family
331221	7 transmembrane receptor (metabotropic glutamate family)
331224	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
331226	Cytochrome C and Quinol oxidase polypeptide I
331236	7 transmembrane receptor (rhodopsin family)
331239	7 transmembrane receptor (rhodopsin family)
331241	7 transmembrane receptor (rhodopsin family)
331242	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
331244	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
331247	5-formyltetrahydrofolate cyclo-ligase family. 5-formyltetrahydrofolate cyclo-ligase or methenyl-THF synthetase EC:6.3.3.2 catalyses the interchange of 5-formyltetrahydrofolate (5-FTHF) to 5-10-methenyltetrahydrofolate, this requires ATP and Mg2+. 5-FTHF i
331252	7 transmembrane receptor (metabotropic glutamate family)
331256	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
331262	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
331264	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
331267	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
331267	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
331274	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
331276	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
331285	ENV polyprotein (coat polyprotein)
331288	Protein kinase domain
331297	Protein kinase domain
331301	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
331306	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
331307	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
331310	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331323	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
331330	Adenylate and Guanylate cyclase catalytic domain
331331	Histone deacetylase family. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related
331335	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
331345	7 transmembrane receptor (metabotropic glutamate family)
331348	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
331348	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
331348	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
331355	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331357	Ribosomal L18ae protein family
331358	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
331360	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
331360	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
331362	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
331365	L1 transposable element
331376	NUDIX domain
331378	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
331381	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
331382	Eukaryotic porin
331383	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
331389	ENV polyprotein (coat polyprotein)
331389	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
331389	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
331393	Ribosomal protein L6
331394	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
331398	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
331399	Ribosomal protein S27
331400	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
331403	Cellulose synthase. Cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues, is the major component of wood and thus paper, and is synthesized by plants, most algae, some bacteria and fungi, and even some animals. The genes that
331403	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
331404	Ham1 family. This family consists of the HAM1 protein and hypothetical archaeal bacterial and C. elegans proteins. HAM1 controls 6-N-hydroxylaminopurine (HAP) sensitivity and mutagenesis in S. cerevisiae. The HAM1 protein protects the cell from HAP, eithe
331409	7 transmembrane receptor (rhodopsin family)
331410	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
331411	ENV polyprotein (coat polyprotein)
331413	ATP synthase
331414	Pou domain - N-terminal to homeobox domain
331415	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
331417	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
331420	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
331428	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
331429	Ribosomal L10
331435	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
331439	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
331439	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
331439	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
331440	Spin/Ssty Family. Spindlin (Spin) is a novel maternal transcript present in the unfertilized egg and early embryo. The Y-linked spermiogenesis -specific transcript (Ssty) is also expressed during gametogenesis and forms part of this Pfam family. Members o
331443	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
331443	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
331443	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
331445	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
331446	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
331448	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
331449	Papain family cysteine protease
331451	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
331454	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
331454	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
331457	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
331457	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
331458	ENV polyprotein (coat polyprotein)
331459	Actin
331462	Translationally controlled tumor protein
331462	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
331463	Ribosomal protein L21e
331466	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
331467	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
331468	Glutamine amidotransferase class-I
331469	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
331469	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
331471	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
331474	Retrotransposon gag protein. Gag or Capsid-like proteins from LTR retrotransposons. There is a central motif QGXXEXXXXXFXXLXXH that is common to Retroviridae gag-proteins, but is poorly conserved
331476	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
331480	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
331482	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
331483	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
331488	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
331489	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
331490	Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tran
331491	Fibronectin type III domain
331493	7 transmembrane receptor (rhodopsin family)
331494	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
331494	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
331496	Kinesin motor domain
331498	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
331498	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
331498	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
331500	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
331502	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
331504	ENV polyprotein (coat polyprotein)
331504	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
331505	HMG (high mobility group) box
331506	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
331507	Low molecular weight phosphotyrosine protein phosphatase
331510	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
331512	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
331514	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
331515	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
331515	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
331522	Hsp90 protein
331523	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
331526	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
331526	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
331535	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
331535	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
331536	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
331539	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
331542	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
331543	HMG (high mobility group) box
331546	Sodium:neurotransmitter symporter family
331551	SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop
331552	Actin
331553	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
331557	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
331559	ATP-grasp domain. This family does not contain all known ATP-grasp domain members. This family includes a diverse set of enzymes that possess ATP-dependent carboxylate-amine ligase activity
331559	CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP
331559	Phosphoribosylglycinamide synthetase, B domain. Phosphoribosylglycinamide synthetase catalyses the second step in the de novo biosynthesis of purine. The reaction catalysed by Phosphoribosylglycinamide synthetase is the ATP- dependent addition of 5-phosph
331560	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
331560	Integrase Zinc binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. This domain is the amino-terminal domain zinc binding domain. The central domain is the catal
331560	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
331560	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
331562	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
331564	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
331566	Connexin
331570	Ribosomal protein S28e
331571	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
331571	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
331573	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon
331582	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
331585	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
331587	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
331589	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
331599	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
331611	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
331614	Ribosomal protein S6e
331615	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
331616	ENV polyprotein (coat polyprotein)
331616	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
331619	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
331626	pfam02878, PGM_PMM_I, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain I
331646	7 transmembrane receptor (rhodopsin family)
331650	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
331653	ENV polyprotein (coat polyprotein)
331657	Ribosomal protein S2
331665	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
331666	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
331676	Ribosomal protein L19e
331698	7 transmembrane receptor (rhodopsin family)
331703	RNA polymerase Rpb4
331705	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
331708	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
331714	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
331715	metallopeptidase family M24
331722	L1 transposable element
331724	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
331727	Ribosomal protein L6
331740	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
331740	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
331746	Protein kinase domain
331750	Tropomyosin
331750	Myosin head (motor domain)
331750	Intermediate filament protein
331750	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
331750	Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown
331750	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
331752	Anaphase-promoting complex, subunit 10 (APC10)
331755	7 transmembrane receptor (rhodopsin family)
331756	7 transmembrane receptor (rhodopsin family)
331757	7 transmembrane receptor (rhodopsin family)
331758	7 transmembrane receptor (rhodopsin family)
331762	Ribosomal L15
331807	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
331816	Ribosomal protein L11, RNA binding domain
331824	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
331826	Death effector domain
331830	Ribosomal L29e protein family
331844	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
331852	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
331857	Ubiquinol-cytochrome C reductase hinge protein. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 'hinge' protein of the complex which is thought to mediate formatio
331860	Cyclophilin type peptidyl-prolyl cis-trans isomerase
331861	Ribosomal protein L3
331868	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331871	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331873	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331875	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
331875	Zinc knuckle. The zinc knuckle is a zinc binding motif composed of the the following CX2CX4HX4C where X can be any amino acid. The motifs are mostly from retroviral gag proteins (nucleocapsid). Prototype structure is from HIV. Also contains members involv
331876	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331878	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331879	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331880	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331881	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331882	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331883	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331884	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331885	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
331887	Eukaryotic initiation factor 1A
331888	SKIP/SNW domain. This domain is found in chromatin proteins
331890	7 transmembrane receptor (rhodopsin family)
331891	Protein kinase domain
331893	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
331893	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
331897	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
331900	Endoplasmic Reticulum Oxidoreductin 1 (ERO1). Members of this family are required for the formation of disulphide bonds in the ER
331901	Core histone H2A/H2B/H3/H4
331904	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
331909	Core histone H2A/H2B/H3/H4
331909	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
331910	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
331910	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
331921	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
331921	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
331923	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
331924	Cyclophilin type peptidyl-prolyl cis-trans isomerase
331926	Transglutaminase family, C-terminal ig like domain
331934	Uncharacterized protein family UPF0024
331935	Ribosomal protein S6e
331950	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
331950	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
331955	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
331955	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
331984	Protein kinase domain
331985	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
331986	Ribosomal L29e protein family
331987	Kinesin motor domain
331994	Choline/Carnitine o-acyltransferase
331999	GTP cyclohydrolase I. This family includes GTP cyclohydrolase enzymes and a family of related bacterial proteins
332004	Protein kinase domain
332005	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
332011	Translation initiation factor SUI1
332012	SecE/Sec61-gamma subunits of protein translocation complex. SecE is part of the SecYEG complex in bacteria which translocates proteins from the cytoplasm. In eukaryotes the complex, made from Sec61-gamma and Sec61-alpha translocates protein from the cytop
332018	Phosphatidylethanolamine-binding protein
332024	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
332052	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
332060	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
332061	HMG (high mobility group) box
332062	Ribosomal L22e protein family
332072	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
332073	PUA domain. The PUA domain named after Pseudouridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi
332076	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
332077	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
332078	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
332082	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
332091	ENV polyprotein (coat polyprotein)
332091	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
332094	Ribosomal L15
332095	Fibrillarin
332098	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
332100	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
332100	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
332102	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
332102	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
332109	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
332110	Protein kinase domain
332112	Thymosin beta-4 family
332116	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
332118	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
332125	Ribosomal L29e protein family
332131	Tropomyosin
332131	Uncharacterized ACR, COG1579
332131	Intermediate filament protein
332131	Not1 N-terminal domain, CCR4-Not complex component
332131	ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit
332131	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
332131	Tektin family. Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers
332131	ATP synthase B/B' CF(0). Part of the CF(0) (base unit) of the ATP synthase. The base unit is thought to translocate protons through membrane (inner membrane in mitochondria, thylakoid membrane in plants, cytoplasmic membrane in bacteria). The B subunits a
332131	IncA protein. Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation
332131	Outer membrane efflux protein. The OEP family (Outer membrane efflux protein) form trimeric channels that allow export of a variety of substrates in Gram negative bacteria. Each member of this family is composed of two repeats. The trimeric channel is com
332131	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
332131	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
332131	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
332143	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
332147	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
332148	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
332158	E1-E2 ATPase
332158	Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
332160	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
332163	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
332175	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
332189	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
332189	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
332198	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
332203	Ribosomal protein S5, C-terminal domain
332215	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
332217	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
332221	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
332222	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
332224	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
332229	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
332233	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
332235	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
332242	HMG14 and HMG17
332244	KOW motif. This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG
332245	ADP-ribosylation factor family
332245	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
332256	Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene
332257	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
332257	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
332261	Ribosomal protein L31e
332261	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
332268	Ribosomal protein L36e
332273	UDP-glucoronosyl and UDP-glucosyl transferase
332274	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
332275	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
332279	Ribosomal protein S5, C-terminal domain
332279	Ribosomal protein S5, N-terminal domain
332280	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
332281	WD domain, G-beta repeat
332283	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
332288	Matrix protein (MA), p15. The matrix protein, p15, is encoded by the gag gene. MA is involved in pathogenicity
332288	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
332297	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
332304	L1 transposable element
332304	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
332310	Ribosomal protein S5, C-terminal domain
332317	ENV polyprotein (coat polyprotein)
332317	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
332333	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
332333	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
332342	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
332359	CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B di
332359	DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which
332361	Domain of Unknown Function (DUF408)
332365	7 transmembrane receptor (rhodopsin family)
332378	Cyclophilin type peptidyl-prolyl cis-trans isomerase
332396	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
332402	ENV polyprotein (coat polyprotein)
332404	Ribosomal protein S5, C-terminal domain
332404	Ribosomal protein S5, N-terminal domain
332427	Transglycosylase SLT domain. This family is distantly related to pfam00062
332431	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
332450	7 transmembrane receptor (Secretin family)
332450	Hormone receptor domain. This extracellular domain contains four conserved cysteines that probably for disulphide bridges. The domain is found in a variety of hormone receptors. It may be a ligand binding domain
332462	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
332474	Trypsin
332474	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
332482	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi
332482	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
332488	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
332494	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
332499	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
332511	dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be sim
332512	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
332537	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
332537	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
332548	Ribosomal protein L31e
332555	Ribosomal L29e protein family
332560	Protein kinase domain
332568	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
332570	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
332588	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
332588	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
332590	Ribosomal protein L36e
332596	Domain of unknown function DUF59. This family includes prokaryotic proteins of unknown function. The family also includes PhaH from Pseudomonas putida. PhaH forms a complex with PhaF, PhaG and PhaI, which hydroxylates phenylacetic acid to 2-hydroxyphenyla
332597	S4 domain. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yea
332604	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
332604	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
332627	Ribosomal L15
332630	MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains
332630	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
332631	MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains
332631	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
332645	Ribosomal protein L21e
332652	HMG (high mobility group) box
332664	Nucleoside diphosphate kinase
332671	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
332672	HMG (high mobility group) box
332677	ENV polyprotein (coat polyprotein)
332677	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
332683	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
332693	ATP synthase subunit C
332708	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
332709	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
332714	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
332716	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
332718	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
332719	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
332720	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
332721	Ribosomal protein S24e
332722	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
332723	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
332724	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
332724	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
332726	Eukaryotic-type carbonic anhydrase
332730	ENV polyprotein (coat polyprotein)
332737	ENV polyprotein (coat polyprotein)
332737	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
332738	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved trypt
332741	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
332742	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
332750	7 transmembrane receptor (rhodopsin family)
332764	Cyclophilin type peptidyl-prolyl cis-trans isomerase
332770	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
332775	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
332784	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
332786	Bacterial Ig-like domain (group 2). This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial and phage surface proteins such as intimins
332787	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
332790	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
332790	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
332793	O-methyltransferase. Members of this family are O-methyltransferases. The family includes catechol o-methyltransferase, caffeoyl-CoA O-methyltransferase and a family of bacterial O-methyltransferases that may be involved in antibiotic production
332800	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
332801	Ribosomal protein L6e
332822	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
332844	Protein kinase domain
332844	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
332860	ENV polyprotein (coat polyprotein)
332860	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
332860	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
332860	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
332862	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
332862	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
332866	S25 ribosomal protein
332876	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
332878	ENV polyprotein (coat polyprotein)
332880	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
332880	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
332880	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
332881	L1 transposable element
332897	L1 transposable element
332904	CTF/NF-I family
332934	MYND finger
332937	Transcription factor AP-2
332939	Ribosomal protein S19
332958	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
332971	Amiloride-sensitive sodium channel
332974	Protein kinase domain
332974	ENV polyprotein (coat polyprotein)
332979	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
332987	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
332993	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
332999	Fibronectin type III domain
332999	Protein-tyrosine phosphatase
333010	Enhancer of rudimentary. Enhancer of rudimentary is a protein of unknown function that is highly conserved in plants and animals. This protein is found to be an enhancer of the rudimentary gene
333018	ENV polyprotein (coat polyprotein)
333022	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
333027	Ribosomal L29e protein family
333046	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
333046	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
333067	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
333067	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
333071	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
333074	'chromo' (CHRromatin Organization MOdifier) domain
333074	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
333105	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
333113	Translation initiation factor SUI1
333114	Ribosomal protein L14p/L23e
333116	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333117	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333118	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333119	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333120	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333130	Protein kinase domain
333131	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
333146	Ribosomal L29e protein family
333147	Core histone H2A/H2B/H3/H4
333153	Ribosomal protein L21e
333154	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
333157	Triosephosphate isomerase
333160	Lectin C-type domain. This family includes both long and short form C-type
333162	ENV polyprotein (coat polyprotein)
333162	Poly-adenylate binding protein, unique domain
333163	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
333163	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
333166	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
333177	Ribosomal L18ae protein family
333180	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
333182	Cytochrome oxidase c subunit VIb. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the potentially heme-binding subunit IVb of the oxidase
333185	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
333187	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
333194	Ribosomal protein L23
333200	7 transmembrane receptor (metabotropic glutamate family)
333201	7 transmembrane receptor (metabotropic glutamate family)
333201	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
333202	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
333202	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
333205	7 transmembrane receptor (rhodopsin family)
333206	7 transmembrane receptor (rhodopsin family)
333227	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
333229	ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum m
333238	Ribosomal protein L5
333238	ribosomal L5P family C-terminus. This region is found associated with pfam00281
333253	Protein kinase domain
333253	Sulfotransferase protein
333254	Ribosomal protein L23
333254	Uteroglobin family. Uteroglobin is a homodimer of two identical 70 amino acid polypeptides linked by two disulphide bridges. The precise role of uteroglobin has still to be elucidated
333254	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
333256	7 transmembrane receptor (metabotropic glutamate family)
333256	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
333258	Zinc finger, C3HC4 type (RING finger)
333258	ATP-dependent protease La (LON) domain
333260	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
333261	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
333270	Myosin head (motor domain)
333280	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
333288	Ribosomal protein S5, C-terminal domain
333291	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
333291	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
333296	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
333307	Zinc finger, C3HC4 type (RING finger)
333307	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
333309	Translation initiation factor SUI1
333314	OTU-like cysteine protease. This family is comprised of a group of predicted cysteine proteases, homologous to the Ovarian Tumour (OTU) gene in Drosophila. Members include proteins from eukaryotes, viruses and pathogenic bacterium. The conserved cysteine
333315	Calx-beta domain
333320	Intermediate filament protein
333329	Cyclic nucleotide-binding domain
333329	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
333331	Acyl-CoA dehydrogenase, middle domain. Central domain of Acyl-CoA dehydrogenase has a beta-barrel fold
333331	Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle
333343	Zinc-binding dehydrogenase
333351	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
333355	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
333356	Lectin C-type domain. This family includes both long and short form C-type
333359	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
333366	Cyclin-dependent kinase regulatory subunit
333368	7 transmembrane receptor (rhodopsin family)
333369	7 transmembrane receptor (rhodopsin family)
333380	7 transmembrane receptor (rhodopsin family)
333384	Fork head domain
333395	Ribosomal protein S5, C-terminal domain
333401	Ribosomal protein S27
333412	Ribosomal protein S8
333412	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
333426	ENV polyprotein (coat polyprotein)
333433	NAD-dependent glycerol-3-phosphate dehydrogenase
333439	Ribosomal protein L19e
333443	Ribosomal protein L23
333443	Ribosomal protein L23, N-terminal domain. The N-terminal domain appears to be specific to the eukaryotic ribosomal proteins L25, L23, and L23a
333444	7 transmembrane receptor (rhodopsin family)
333446	7 transmembrane receptor (rhodopsin family)
333447	Protein kinase domain
333452	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
333455	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
333456	Homeobox domain
333460	Protein kinase domain
333461	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333462	SRP19 protein. The signal recognition particle (SRP) binds to the signal peptide of proteins as they are being translated. The binding of the SRP halts translation and the complex is then transported to the endoplasmic reticulum's cytoplasmic surface. The
333466	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333469	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333473	Zinc finger C-x8-C-x5-C-x3-H type (and similar)
333474	7 transmembrane receptor (metabotropic glutamate family)
333475	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
333477	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
333483	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333488	Ribosomal L22e protein family
333491	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
333492	Eukaryotic porin
333492	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
333497	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
333502	Repeat in ubiquitin-activating (UBA) protein
333506	Ribosomal L29e protein family
333506	ENV polyprotein (coat polyprotein)
333511	R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA
333511	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
333515	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
333516	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
333522	ENV polyprotein (coat polyprotein)
333522	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
333533	ENV polyprotein (coat polyprotein)
333534	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
333536	ENV polyprotein (coat polyprotein)
333539	Ribosomal protein L6
333543	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
333543	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
333550	HMG (high mobility group) box
333555	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
333556	Ribosomal protein L13e
333556	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
333557	Ribosomal protein S5, C-terminal domain
333560	Transcription initiation factor IID, 31kD subunit. This family represents the N-terminus of the 31kD subunit (42kD in drosophila) of transcription initiation factor IID (TAFII31). TAFII31 binds to p53, and is an essential requirement for p53 mediated tran
333561	Phosphoglycerate kinase
333563	Fibronectin type III domain
333564	Fibronectin type III domain
333566	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
333567	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
333567	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
333569	ENV polyprotein (coat polyprotein)
333572	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
333573	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
333577	Fructose-bisphosphate aldolase class-I
333582	Translin family. Members of this family include Translin that interacts with DNA and forms a ring around the DNA. This family also includes a protein that was found to interact with translin by yeast two-hybrid screen
333586	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
333586	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
333595	Ribosomal protein L10
333596	Ribosomal protein L6e
333599	Ribosomal protein L23
333608	ENV polyprotein (coat polyprotein)
333614	Death domain
333620	L1 transposable element
333621	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
333626	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
333628	Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor
333629	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
333633	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
333641	L1 transposable element
333649	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
333654	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
333654	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
333660	Translationally controlled tumor protein
333664	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
333664	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
333665	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
333669	Sodium:solute symporter family
333672	7 transmembrane receptor (metabotropic glutamate family)
333676	L1 transposable element
333678	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
333679	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
333680	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
333687	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333688	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
333688	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333689	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333691	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333692	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333694	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333695	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333697	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333698	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333699	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333708	WD domain, G-beta repeat
333710	Biotin/lipoate A/B protein ligase family. This family includes biotin protein ligase, lipoate-protein ligase A and B. Biotin is covalently attached at the active site of certain enzymes that transfer carbon dioxide from bicarbonate to organic acids to for
333711	Class I Histocompatibility antigen, domains alpha 1 and 2
333712	Class I Histocompatibility antigen, domains alpha 1 and 2
333714	Class I Histocompatibility antigen, domains alpha 1 and 2
333715	Class I Histocompatibility antigen, domains alpha 1 and 2
333715	Class I Histocompatibility antigen, domains alpha 1 and 2
333715	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333716	Protein kinase domain
333718	Ribosomal protein L6
333719	Homeobox domain
333727	ENV polyprotein (coat polyprotein)
333727	Class I Histocompatibility antigen, domains alpha 1 and 2
333728	Class I Histocompatibility antigen, domains alpha 1 and 2
333730	B-box zinc finger
333732	Class I Histocompatibility antigen, domains alpha 1 and 2
333735	Class II histocompatibility antigen, beta domain
333737	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
333738	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
333739	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
333741	Class I Histocompatibility antigen, domains alpha 1 and 2
333742	Class I Histocompatibility antigen, domains alpha 1 and 2
333742	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333743	Clathrin adaptor complex small chain
333743	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
333745	Class I Histocompatibility antigen, domains alpha 1 and 2
333746	Class I Histocompatibility antigen, domains alpha 1 and 2
333746	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333747	Class I Histocompatibility antigen, domains alpha 1 and 2
333749	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
333749	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
333750	Class I Histocompatibility antigen, domains alpha 1 and 2
333752	L1 transposable element
333753	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
333755	L1 transposable element
333756	MCM2/3/5 family
333758	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
333759	pfam02902, Peptidase_C48, Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn
333760	von Hippel-Lindau disease tumor suppressor protein. VHL forms a ternary complex with the elonginB and elonginC proteins. This complex binds Cul2, which then is involved in regulation of vascular endothelial growth factor mRNA
333761	7 transmembrane receptor (rhodopsin family)
333763	Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway
333763	Phosphotransferase enzyme family. This family consists of bacterial antibiotic resistance proteins, which confer resistance to various aminoglycosides they include:- aminoglycoside 3'-phosphotransferase or kanamycin kinase / neomycin-kanamycin phosphotran
333764	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
333767	Doublecortin
333769	e3 binding domain. This family represents a small domain of the E2 subunit of 2-oxo-acid dehydrogenases responsible for the binding of the E3 subunit
333769	Biotin-requiring enzyme. This family covers two subfamilies, the conserved lysine residue binds biotin in one group and lipoic acid in the other. Note that the model may not currently recognise the Glycine cleavage system H proteins
333773	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
333773	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
333775	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
333775	gag gene protein p24 (core nucleocapsid protein). p24 forms inner protein layer of the nucleocapsid. ELISA tests for p24 is the most commonly used method to demonstrate virus replication both in vivo and in vitro
333778	von Willebrand factor type D domain
333778	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
333778	TILa domain. This cysteine rich domain occurs along side the TIL pfam01826 domain and is likely to be a distantly related relative. This domain is found five to twenty-five times in zonadhesins. The TILa domain is also found twice in an IGG FC binding pro
333783	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
333784	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
333788	ADP-ribosylation factor family
333788	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
333791	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333797	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
333806	KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels
333812	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
333813	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
333818	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
333818	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
333822	Ribosomal protein L3
333827	PUA domain. The PUA domain named after Pseudouridine synthase and Archaeosine transglycosylase, was detected in archaeal and eukaryotic pseudouridine synthases, archaeal archaeosine synthases, a family of predicted ATPases that may be involved in RNA modi
333827	TruB family pseudouridylate synthase (N terminal domain). Members of this family are involved in modifying bases in RNA molecules. They carry out the conversion of uracil bases to pseudouridine. This family includes TruB, a pseudouridylate synthase that s
333830	Ribosomal protein S7e
333831	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
333831	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
333834	jmjN domain
333834	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
333836	Sodium:solute symporter family
333842	Ribosomal protein S5, C-terminal domain
333842	Ribosomal protein S5, N-terminal domain
333845	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
333846	HIT family
333846	Ribosomal protein S6e
333849	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333850	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333851	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333854	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333855	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333856	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333858	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333859	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333866	Core histone H2A/H2B/H3/H4
333868	ENV polyprotein (coat polyprotein)
333869	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
333872	Zinc finger, C3HC4 type (RING finger)
333873	L1 transposable element
333873	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
333873	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
333875	Class I Histocompatibility antigen, domains alpha 1 and 2
333877	L1 transposable element
333879	7 transmembrane receptor (rhodopsin family)
333883	u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087
333886	Adenosine-deaminase (editase) domain
333886	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
333888	ENV polyprotein (coat polyprotein)
333889	dUTPase. dUTPase hydrolyses dUTP to dUMP and pyrophosphate
333896	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333897	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333898	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333899	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333903	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
333906	Integrase DNA binding domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain. The central domain is the catalytic domain
333907	Mitochondrial carrier protein
333908	Ribosomal protein L21e
333910	ENV polyprotein (coat polyprotein)
333913	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
333915	L1 transposable element
333916	L1 transposable element
333924	Env gp36 protein (HERV/MMTV type). This family includes the GP36 protein from retroviruses such as mouse mammary tumour virus (MMTV) and human endogenous retroviruses (HERVs). The GP36 protein is an envelope protein that has a predicted transmembrane heli
337867	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections t
337868	Helix-loop-helix DNA-binding domain
337868	Myogenic Basic domain. This basic domain is found in the MyoD family of muscle specific proteins that control muscle development. The bHLH region of the MyoD family includes the basic domain and the Helix-loop-helix (HLH) motif. The bHLH region mediates
337876	Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys
337924	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
338322	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
338323	PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain
338324	S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins
338337	Sec34-like family. Sec34 and Sec35 form a sub-complex, in a seven protein complex that includes Dor1 (PFAM:PF04124). This complex is thought to be important for tether vesicles to the Golgi
338345	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
338346	7 transmembrane receptor (rhodopsin family)
338355	Uncharacterized ACR, COG1579
338355	FKBP-type peptidyl-prolyl cis-trans isomerase
338355	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
338355	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
338364	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
338365	Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples ha
338370	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
338375	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
338376	Interferon alpha/beta domain
338401	LIM domain. This family represents two copies of the LIM structural domain
338442	7 transmembrane receptor (rhodopsin family)
338442	7 transmembrane receptor (rhodopsin family)
338443	7 transmembrane receptor (rhodopsin family)
338521	Fatty acid hydroxylase
338521	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anc
338521	Fatty acid hydroxylase
338521	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
338523	jmjC domain
338523	Lysosome-associated membrane glycoprotein (Lamp)
338540	Ribosomal protein L13
338557	7 transmembrane receptor (rhodopsin family)
338557	7 transmembrane receptor (rhodopsin family)
338559	Cyclophilin type peptidyl-prolyl cis-trans isomerase
338561	Ribosomal protein L23
338563	Protein kinase domain
338567	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
338569	Repair protein Rad1/Rec1/Rad17
338570	NAC domain
338575	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
338590	PRP38 family. Members of this family are related to the pre mRNA splicing factor PRP38 from yeast. Therefore all the members of this family could be involved in splicing. This conserved region could be involved in RNA binding. The putative domain is about
338596	Flavivirus glycoprotein, central and dimerisation domains
338596	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
338599	Cyclophilin type peptidyl-prolyl cis-trans isomerase
338599	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
338601	Eukaryotic initiation factor 5A hypusine, DNA-binding OB fold
338605	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
338606	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
338611	Ribosomal protein S26e
338614	Helix-loop-helix DNA-binding domain
338619	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
338623	Fork head domain
338626	Mitochondrial carrier protein
338628	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
338629	Fibronectin type III domain
338637	7 transmembrane receptor (rhodopsin family)
338639	7 transmembrane receptor (rhodopsin family)
338642	Olfactomedin-like domain
338642	7 transmembrane receptor (rhodopsin family)
338644	Gag P30 core shell protein. According to Swiss-Prot annotation this protein is the viral core shell protein. P30 is essential for viral assembly
338646	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
338649	Ribosomal protein L11, RNA binding domain
338649	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
338652	Actin
338656	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
338660	Homeobox domain
338662	7 transmembrane receptor (rhodopsin family)
338674	7 transmembrane receptor (rhodopsin family)
338675	7 transmembrane receptor (rhodopsin family)
338676	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
338682	C2 domain
338692	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
338699	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
338718	7 transmembrane receptor (rhodopsin family)
338733	Oxysterol-binding protein
338736	Matrix metalloprotease, N-terminal domain. This family is found N-terminal to the catalytic domain of matrixin
338736	Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis
338736	Hemopexin. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metallopeptidases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metallopeptidases
338742	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
338746	Sec61beta family. This family consists of homologues of Sec61beta - a component of the Sec61/SecYEG protein secretory system. The domain is found in eukaryotes and archaea and is possibly homologous to the bacterial SecG
338747	7 transmembrane receptor (rhodopsin family)
338751	7 transmembrane receptor (rhodopsin family)
338755	7 transmembrane receptor (rhodopsin family)
338757	Fibronectin type III domain
338761	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
338766	Ribosomal protein S26e
338783	Intermediate filament protein
338785	Intermediate filament protein
338785	Intermediate filament protein
338801	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
338806	ADP-ribosylation factor family
338813	Alpha-2-macroglobulin family N-terminal region. This family includes the N-terminal region of the alpha-2-macroglobulin family
338815	Lectin C-type domain. This family includes both long and short form C-type
338819	metallopeptidase family M24
338821	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
338829	Antitermination protein
338829	Bacteriophage lysis protein. This protein is involved in host lysis. This family is not considered to be a peptidase according to the MEROPs database
338830	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
338831	NLP/P60 family. The function of this domain is unknown. It is found in several lipoproteins
338832	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p
338834	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
338834	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
338835	Protein kinase domain
338865	Translocon-associated protein (TRAP), alpha subunit. The alpha-subunit of the TRAP complex (TRAP alpha) is a single-spanning membrane protein of the endoplasmic reticulum (ER) which is found in proximity of nascent polypeptide chains translocating across
338869	Ribosomal protein S2
338870	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
338871	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
338872	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
338872	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
338872	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
338878	7 transmembrane receptor (rhodopsin family)
338879	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
338880	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
338902	TAP42-like family. The TOR signalling pathway activates a cell-growth program in response to nutrients. TIP41 (PFAM:PF04176) interacts with TAP42 and negatively regulates the TOR signaling pathway
338905	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
338912	PH domain. PH stands for pleckstrin homology
338913	Disintegrin
338913	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
338915	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
338919	Ribosomal protein S26e
338922	Ribosomal protein L21e
338952	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
338952	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
338954	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
338958	Eukaryotic ribosomal protein L18
338972	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
338972	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
338973	Cyclic nucleotide-binding domain
338973	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
338978	Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi
339000	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
339000	Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi
339010	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
339011	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
339026	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
339027	Ribosomal protein L31e
339039	WSC domain. This domain may be involved in carbohydrate binding
339039	Lectin C-type domain. This family includes both long and short form C-type
339039	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
339039	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
339039	REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six co
339044	PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold
339070	Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are repr
339086	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t
339087	Eukaryotic-type carbonic anhydrase
339088	BolA-like protein. This family consist of the morpho-protein BolA from E. coli and its various homologs. In E. coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems
339094	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
339097	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
339102	Trypsin
339105	Trypsin
339115	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
339120	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
339122	ADP-ribosylation factor family
339122	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
339123	jmjC domain
339133	Disintegrin
339133	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
339133	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
339162	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
339173	Protein kinase domain
339175	ubiE/COQ5 methyltransferase family
339208	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
339212	Protein kinase domain
339221	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cle
339221	Sulfatase
339221	Metalloenzyme superfamily. This family includes phosphopentomutase and 2,3-bisphosphoglycerate-independent phosphoglycerate mutase. This family is also related to pfam00245. The alignment contains the most conserved residues that are probably involved in
339221	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
339231	ADP-ribosylation factor family
339241	Intermediate filament protein
339242	Intermediate filament protein
339243	Intermediate filament protein
339244	Intermediate filament protein
339246	MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA a
339261	Ribosomal protein L21e
339272	Ribosomal L10
339284	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
339295	RNA dependent RNA polymerase. This family may represent an RNA dependent RNA polymerase. The family also contains the following proteins: 2A protein from bromoviruses putative RNA dependent RNA polymerase from tobamoviruses Non structural polyprotein from
339295	Reverse transcriptase (RNA-dependent DNA polymerase). A reverse transcriptase gene is usually indicative of a mobile element such as a retrotransposon or retrovirus. Reverse transcriptases occur in a variety of mobile elements, including retrotransposons,
339309	Ribosomal protein S8e
339311	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
339319	von Willebrand factor type D domain
339319	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
339320	von Willebrand factor type D domain
339320	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
339321	Ets-domain
339327	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
339347	Fibronectin type II domain
339354	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
339365	Ribosomal protein S19
339371	Ribosomal protein L24e
339371	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
339377	Lectin C-type domain. This family includes both long and short form C-type
339389	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
339390	Lectin C-type domain. This family includes both long and short form C-type
339395	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
339396	Intermediate filament protein
339403	7 transmembrane receptor (rhodopsin family)
339403	7 transmembrane receptor (rhodopsin family)
339408	Cytochrome C and Quinol oxidase polypeptide I
339408	Cytochrome C oxidase subunit II, periplasmic domain
339409	Eukaryotic protein of unknown function, DUF279
339414	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
339416	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
339425	Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6
339451	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
339452	Laminin G domain
339452	Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6
339452	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
339452	Agrin NtA domain. Agrin is a multidomain heparan sulphate proteoglycan, that is a key organizer for the induction of postsynaptic specializations at the neuromuscular junction. Binding of agrin to basement membranes requires the amino terminal (NtA) domai
339452	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
339463	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
339479	Giardia variant-specific surface protein
339479	MAC/Perforin domain. The membrane-attack complex (MAC) of the complement system forms transmembrane channels. These channels disrupt the phospholipid bilayer of target cells, leading to cell lysis and death. A number of proteins participate in the assembl
339487	Protein of unknown function DUF101. The members of this family are uncharacterised. The alignment of these proteins contains several conserved polar residues that might be potential catalytic residues
339487	Protein of unknown function DUF101. The members of this family are uncharacterised. The alignment of these proteins contains several conserved polar residues that might be potential catalytic residues
339488	Transcription factor AP-2
339488	Transcription factor AP-2
339489	Protein kinase domain
339492	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
339501	Trypsin
339509	Repair protein Rad1/Rec1/Rad17
339511	7 transmembrane receptor (rhodopsin family)
339521	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
339530	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
339536	Peptidase family M3. This is the Thimet oligopeptidase family, large family of mammalian and bacterial oligopeptidases that cleave medium sized peptides. The group also contains mitochondrial intermediate peptidase which is encoded by nuclear DNA but func
339542	Ribosomal protein S17
339544	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
339559	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
339562	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
339565	7 transmembrane receptor (rhodopsin family)
339575	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
339581	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
339587	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
339589	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
339604	Cystatin domain. Very diverse family. Attempts to define separate sub-families failed. Typically, either the N-terminal or C-terminal end is very divergent. But splitting into two domains would make very short families. pfam00666 are related to this famil
339605	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
339608	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
339618	Lipase
339628	WD domain, G-beta repeat
339635	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
339639	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
339656	Sodium:solute symporter family
339661	Gamma-glutamyltranspeptidase
339672	Ribosomal protein S4/S9 N-terminal domain. This family includes small ribosomal subunit S9 from prokaryotes and S16 from metazoans. This domain is predicted to bind to ribosomal RNA. This domain is composed of four helices in the known structure. However
339672	S4 domain. The S4 domain is a small domain consisting of 60-65 amino acid residues that was detected in the bacterial ribosomal protein S4, eukaryotic ribosomal S9, two families of pseudouridine synthases, a novel family of predicted RNA methylases, a yea
339692	Gaa1-like, GPI transamidase component. GPI (glycosyl phosphatidyl inositol) transamidase is a multi-protein complex. Gpi16, Gpi8 and Gaa1 for a sub-complex of the GPI transamidase. GPI transamidase that adds glycosylphosphatidylinositols (GPIs) to newly s
339709	Ribosomal protein L24e
339710	F-actin capping protein, beta subunit
339714	CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP
339715	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
339716	Ribosomal protein S5, C-terminal domain
339716	Ribosomal protein S5, N-terminal domain
339718	Ribosomal protein S2
339721	Spumavirus gag protein
339721	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
339721	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p
339736	Intermediate filament protein
339739	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
339744	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
339745	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
339749	RanBP1 domain
339749	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
339752	Homeobox domain
339755	Protein of unknown function DUF101. The members of this family are uncharacterised. The alignment of these proteins contains several conserved polar residues that might be potential catalytic residues
339756	Ribosomal protein L21e
339761	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
339763	Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included
339772	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
339776	Ubiquinol-cytochrome C reductase hinge protein. The ubiquinol-cytochrome C reductase complex (cytochrome bc1 complex) is a respiratory multienzyme complex. This Pfam family represents the 'hinge' protein of the complex which is thought to mediate formatio
339780	Ribosomal protein L21e
339781	Intermediate filament protein
339798	Actin
339799	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
339805	Ribosomal protein L31e
339806	Ribosomal protein S2
339814	Intermediate filament protein
339821	Sulfotransferase protein
339828	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
339829	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
339839	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
339843	C2 domain
339843	HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus
339843	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
339844	LEM3 (ligand-effect modulator 3) family / CDC50 family. Members of this family have been predicted to contain transmembrane helices. The family member LEM3 is a ligand-effect modulator, mutation of which increases glucocorticoid receptor activity in respo
339845	Arginosuccinate synthase. This family contains a PP-loop motif
339855	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
339857	Cytochrome C and Quinol oxidase polypeptide I
339858	Ctr copper transporter family. The redox active metal copper is an essential cofactor in critical biological processes such as respiration, iron transport, oxidative stress protection, hormone production, and pigmentation. A widely conserved family of hig
339869	Ribosomal protein L21e
339871	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
339879	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid
339881	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
339890	Ribosomal protein L21e
339892	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
339896	Aminotransferase class-V
339896	Pyridoxal-dependent decarboxylase conserved domain
339900	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
339903	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
339905	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
339906	Trypsin
339909	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
339909	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
339939	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
339943	Protein kinase domain
339948	Uncharacterised protein family (UPF0170). This family contains uncharacterised eukaryotic proteins of around 250 amino acids
339951	Cadherin domain
339959	Arginosuccinate synthase. This family contains a PP-loop motif
339965	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
339967	Trypsin
339968	Trypsin
339976	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
339976	Zinc finger, C3HC4 type (RING finger)
339976	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
339983	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
339983	Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions
339991	Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex
339991	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
340014	UDP-glucoronosyl and UDP-glucosyl transferase
340024	Sodium:neurotransmitter symporter family
340026	pfam02910, succ_DH_flav_C, Fumarate reductase/succinate dehydrogenase flavoprotein C-terminal domain. This family contains fumarate reductases, succinate dehydrogenases and L-aspartate oxidases
340029	Ribosomal protein L19e
340042	Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif
340052	7 transmembrane receptor (rhodopsin family)
340057	Ribosomal L39 protein
340066	HMG14 and HMG17
340068	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
340075	Sulfatase
340077	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
340081	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
340096	Core histone H2A/H2B/H3/H4
340116	NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane
340119	Ribosomal protein S5, C-terminal domain
340119	Ribosomal protein S5, N-terminal domain
340120	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
340132	Ribosomal protein L31e
340133	PH domain. PH stands for pleckstrin homology
340135	Ribosomal protein L6e
340147	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
340151	Ribosomal protein S2
340156	Protein kinase domain
340159	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
340167	HORMA domain. The HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is
340179	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
340183	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon
340189	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
340207	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
340215	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
340216	HMG (high mobility group) box
340216	Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in
340217	Homeobox domain
340221	BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction
340223	Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their o
340224	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
340232	Exportin-t. Exportin-t is a specific mediator of tRNA export. RanGTP-binding, importin beta-related factor with predominantly nuclear localization. It shuttles rapidly between nucleus and between nucleus and cytoplasm and interacts with nuclear pore compl
340236	Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase acti
340237	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
340243	Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their o
340245	Mitochondrial carrier protein
340248	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
340248	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
340250	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
340252	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
340260	Homeobox domain
340265	Ribosomal protein L31e
340265	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
340273	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
340273	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
340274	Arginosuccinate synthase. This family contains a PP-loop motif
340274	tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs
340284	GMP synthase C terminal domain. GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. This family is the C-terminal domain specific to the GMP synthases EC:6.3.5.2. In prokaryotes this domain mediates dimerisation. E
340289	Ets-domain
340289	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
340307	Intermediate filament protein
340311	Intermediate filament protein
340312	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
340326	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
340330	Cyclophilin type peptidyl-prolyl cis-trans isomerase
340331	Ribosomal protein L31e
340333	Class I Histocompatibility antigen, domains alpha 1 and 2
340336	Ribosomal protein L19e
340342	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
340343	Phospholipase/Carboxylesterase. This family consists of both phospholipases and carboxylesterases with broad substrate specificity, and is structurally related to alpha/beta hydrolases pfam00561
340348	Tetraspanin family
340348	Tetraspanin family
340354	Doublecortin
340358	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
340359	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
340359	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
340361	Homeobox domain
340361	Pou domain - N-terminal to homeobox domain
340368	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
340371	Protein kinase domain
340376	Ribosomal protein L23
340377	Ribosomal protein L23
340379	Ribosomal protein L23
340386	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
340406	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
340413	Scavenger receptor cysteine-rich domain. These domains are disulphide rich extracellular domains. These domains are found in several extracellular receptors and may be involved in protein-protein interactions
340416	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
340416	Transcription initiation factor IIA, gamma subunit, beta-barrel domain. Accurate transcription in vivo requires at least six general transcription initiation factors, in addition to RNA polymerase II. Transcription initiation factor IIA (TFIIA) is a multi
340417	Ribosomal L10
340417	Thymosin beta-4 family
340418	PCRF domain. This domain is found in peptide chain release factors
340418	Peptidyl-tRNA hydrolase domain. This domain is found in peptide chain release factors such as RF-1 and RF-2, and a number of smaller proteins of unknown function. This domain contains the peptidyl-tRNA hydrolase activity. The domain contains a highly cons
340421	Eukaryotic ribosomal protein L18
340426	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
340433	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
340441	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
340443	Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes
340448	Oxidoreductase molybdopterin binding domain. This domain is found in a variety of oxidoreductases. This domain binds to a molybdopterin cofactor. Xanthine dehydrogenases, that also bind molybdopterin, have essentially no similarity
340449	Exonuclease. This family includes a variety of exonuclease proteins, such as ribonuclease T and the epsilon subunit of DNA polymerase III
340452	Transforming growth factor beta like domain
340458	Homeobox domain
340459	Cytochrome C and Quinol oxidase polypeptide I
340459	NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane
340460	Intermediate filament protein
340472	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
340480	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
340481	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
340487	Interferon alpha/beta domain
340493	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
340499	FMN-dependent dehydrogenase
340499	2-nitropropane dioxygenase. Members of this family catalyse the denitrification of a number of nitroalkanes using either FAD or FMN as a cofactor
340499	CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh
340499	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
340501	Transcription factor S-II (TFIIS)
340503	7 transmembrane receptor (metabotropic glutamate family)
340509	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
340509	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
340510	NADH-ubiquinone/plastoquinone oxidoreductase, chain 3
340518	Lipoxygenase
340518	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
340523	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
340529	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
340530	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
340535	7 transmembrane receptor (rhodopsin family)
340536	Eukaryotic protein of unknown function, DUF279
340539	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
340546	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
340548	Sterol desaturase. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain many conserved histidine residues. M
340551	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
340555	7 transmembrane receptor (rhodopsin family)
340561	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
340564	Platelet activating factor acetylhydrolase. Platelet activating factor acetylhydrolase (PAF-AH) is a subfamily of phospholipases A2, responsible for inactivation of platelet-activating factor through cleavage of an acetyl group. At least three intracellul
340565	Ribosomal protein S2
340565	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
340567	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
340567	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
340567	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
340571	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
340571	Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results
340573	HMG (high mobility group) box
340574	Core histone H2A/H2B/H3/H4
340576	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
340576	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
340578	WD domain, G-beta repeat
340591	Eukaryotic-type carbonic anhydrase
340593	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
340598	Intermediate filament protein
340600	bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region
340613	Arginosuccinate synthase. This family contains a PP-loop motif
340614	Ribosomal protein S24e
340623	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
340628	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
340644	Zinc finger, C3HC4 type (RING finger)
340647	C2 domain
340649	Ribosomal protein S26e
340654	ab-hydrolase associated lipase region
340654	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
340663	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
340663	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
340665	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
340671	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
340680	HMG (high mobility group) box
340697	HMG (high mobility group) box
340699	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
340706	Protein kinase domain
340706	von Willebrand factor type A domain
340706	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
340725	Ribosomal protein L19e
340729	Ribosomal protein S8
340733	Glu/Leu/Phe/Val dehydrogenase, dimerisation domain
340733	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
340735	'Cold-shock' DNA-binding domain
340739	tRNA synthetase B5 domain. This domain is found in phenylalanine-tRNA synthetase beta subunits
340742	Amiloride-sensitive sodium channel
340745	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
340749	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
340752	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
340753	CUB domain
340757	CAP protein
340765	Homeobox domain
340767	Ribosomal protein S19e
340768	CUB domain
340769	CUB domain
340780	CBS domain. CBS domains are small intracellular modules of unknown function. They are mostly found in 2 or four copies within a protein. Pairs of CBS domains dimerise to form a stable globular domain. Two CBS domains are found in inosine-monophosphate deh
340780	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
340783	HMG14 and HMG17
340784	Homeobox domain
340786	Ribosomal protein S26e
340808	ENV polyprotein (coat polyprotein)
340810	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
340811	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
340821	Ribosomal protein L13e
340835	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
340840	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
340844	S-adenosyl-L-homocysteine hydrolase
340844	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
340846	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
340852	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
340859	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
340874	ADP-ribosylation factor family
340874	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
340888	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
340896	MAM domain. An extracellular domain found in many receptors
340897	NADH dehydrogenase
340901	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
340913	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
340914	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
340917	Ribosomal protein S2
340931	Ribosomal family S4e
340933	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
340966	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
340967	Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organization, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isoform
340968	Protein kinase domain
340970	Zinc finger, C3HC4 type (RING finger)
340970	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
340973	7 transmembrane receptor (rhodopsin family)
340977	FERM domain (Band 4.1 family). This domain has been renamed the FERM domain, which stands for F for 4.1, E for Ezrin, R for radixin and M for moesin
340977	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
340978	7 transmembrane receptor (rhodopsin family)
340980	7 transmembrane receptor (rhodopsin family)
340996	HMG (high mobility group) box
341002	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
341009	'chromo' (CHRromatin Organization MOdifier) domain
341009	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
341019	Doublecortin
341035	Ribosomal protein L23
341044	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
341056	ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family
341057	7 transmembrane receptor (rhodopsin family)
341058	7 transmembrane receptor (rhodopsin family)
341061	7 transmembrane receptor (rhodopsin family)
341062	7 transmembrane receptor (rhodopsin family)
341065	7 transmembrane receptor (rhodopsin family)
341094	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
341098	Transcription factor IIA, alpha/beta subunit. Transcription initiation factor IIA (TFIIA) is a heterotrimer, the three subunits being known as alpha, beta, and gamma, in order of molecular weight. The N and C-terminal domains of the gamma subunit are repr
341102	7 transmembrane receptor (rhodopsin family)
341105	7 transmembrane receptor (rhodopsin family)
341112	tRNA synthetases class I (W and Y)
341113	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
341115	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
341116	CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a
341119	Inner centromere protein, ARK binding region. This region of the inner centromere protein has been found to be necessary and sufficient for binding to aurora-related kinase. This interaction has been implicated in the coordination of chromosome segregatio
341123	Sugar (and other) transporter
341128	7 transmembrane receptor (rhodopsin family)
341130	7 transmembrane receptor (rhodopsin family)
341135	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling.
341138	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
341140	Actin
341141	7 transmembrane receptor (rhodopsin family)
341142	7 transmembrane receptor (rhodopsin family)
341145	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
341146	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anch
341147	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
341149	Biotin/lipoate A/B protein ligase family. This family includes biotin protein ligase, lipoate-protein ligase A and B. Biotin is covalently attached at the active site of certain enzymes that transfer carbon dioxide from bicarbonate to organic acids to for
341151	7 transmembrane receptor (rhodopsin family)
341152	7 transmembrane receptor (rhodopsin family)
341155	MAS20 protein import receptor
341168	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
341170	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
341172	HMG (high mobility group) box
341180	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
341198	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
341200	Ribosomal S3Ae family
341201	Cadherin domain
341202	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
341202	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
341208	Multicopper oxidase. Many of the proteins in this family contain multiple similar copies of this plastocyanin-like domain
341218	Cyclophilin type peptidyl-prolyl cis-trans isomerase
341225	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
341226	Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold
341228	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
341230	Arginosuccinate synthase. This family contains a PP-loop motif
341245	Eukaryotic ribosomal protein L18
341254	Aminotransferase class I and II
341256	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
341276	7 transmembrane receptor (rhodopsin family)
341277	Trypsin
341277	CUB domain
341278	7 transmembrane receptor (rhodopsin family)
341279	7 transmembrane receptor (rhodopsin family)
341287	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
341290	Ribosomal protein L6e
341301	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
341305	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
341306	Ribosomal L15
341308	Eukaryotic porin
341313	Cytochrome c oxidase subunit Vb
341315	Poly-adenylate binding protein, unique domain
341315	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
341316	Ribosomal protein L44
341321	Ribosomal protein L21e
341341	Retroviral aspartyl protease. Single domain aspartyl proteases from retroviruses, retrotransposons, and badnaviruses (plant dsDNA viruses). These proteases are generally part of a larger polyprotein
341341	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
341346	tRNA synthetases class I (R). Other tRNA synthetase sub-families are too dissimilar to be included. This family includes only arginyl tRNA synthetase
341350	Trypsin
341350	CUB domain
341356	Ribosomal protein L31e
341359	C2 domain
341376	Dynein heavy chain. This family represents the C-terminal region of dynein heavy chain. The chain also contains ATPase activity and microtubule binding ability and acts as a motor for the movement of organelles and vesicles along microtubules. Dynein is a
341383	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
341392	AMP-binding enzyme
341399	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
341404	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
341405	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
341407	Intermediate filament protein
341408	Intermediate filament protein
341409	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
341409	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
341412	Ribosomal protein L31e
341414	Eukaryotic porin
341415	HSF-type DNA-binding
341416	7 transmembrane receptor (rhodopsin family)
341417	7 transmembrane receptor (rhodopsin family)
341418	7 transmembrane receptor (rhodopsin family)
341444	NAC domain
341448	Intermediate filament protein
341449	Intermediate filament protein
341457	Cyclophilin type peptidyl-prolyl cis-trans isomerase
341459	Chaperonin 10 Kd subunit
341465	Ribosomal protein S6e
341470	POT family. The POT (proton-dependent oligopeptide transport) family all appear to be proton dependent transporters
341472	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
341473	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
341479	Signal recognition particle 14kD protein. The signal recognition particle (SRP) is a multimeric protein involved in targeting secretory proteins to the rough endoplasmic reticulum membrane. SRP14 and SRP9 form a complex essential for SRP RNA binding
341483	Chaperonin 10 Kd subunit
341511	Ribosomal protein L23
341512	Ribosomal protein L13
341513	Tetraspanin family
341515	Ribosomal protein S26e
341525	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
341549	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
341555	Ribosomal protein L21e
341556	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
341556	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
341556	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
341558	Homeobox domain
341560	ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain
341565	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
341568	7 transmembrane receptor (rhodopsin family)
341578	Glypican
341583	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
341592	Tub family
341592	Hsp90 protein
341599	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
341604	Ribosomal L29 protein
341616	Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc f
341638	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
341643	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
341651	Lyase
341669	Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex bi
341669	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
341674	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
341676	Protein kinase domain
341680	Cadherin domain
341700	Peptidase C13 family. This family of peptidases is known as the hemoglobinase family because it contains a globin degrading enzyme from blood parasites. However relatives are found in plants and other organisms that have other functions. Members of this f
341704	Ribosomal protein S2
341706	DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions
341715	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
341720	MmgE/PrpD family. This family includes 2-methylcitrate dehydratase EC:4.2.1.79 (PrpD) that is required for propionate catabolism. It catalyses the third step of the 2-methylcitric acid cycle
341722	Double-stranded RNA binding motif. Putative motif shared by proteins that bind to dsRNA. At least some DSRM proteins seem to bind to specific RNA targets. Exemplified by Staufen, which is involved in localization of at least five different mRNAs in the ea
341728	Ribosomal protein L31e
341732	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
341752	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
341753	PXA domain. This domain is associated with PX domains pfam00787
341760	Cyclophilin type peptidyl-prolyl cis-trans isomerase
341761	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
341776	Ribosomal protein L23
341784	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
341795	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
341796	7 transmembrane receptor (rhodopsin family)
341799	7 transmembrane receptor (rhodopsin family)
341803	Pancreatic ribonuclease. Ribonucleases. Members include pancreatic RNAase A and angiogenins. Structure is an alpha+beta fold -- long curved beta sheet and three helices
341816	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
341831	Ribosomal L27e protein family. The N-terminal region of the eukaryotic ribosomal L27 has the KOW motif. C-terminal region is represented by this family
341834	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
341835	Amidase
341835	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
341845	14-3-3 protein
341848	Ribosomal protein S8
341856	7 transmembrane receptor (rhodopsin family)
341858	GGL domain. G-protein gamma like domains (GGL) are found in the gamma subunit of the heterotrimeric G protein complex and in regulators of G protein signaling (RGS) proteins
341862	ATP synthase
341864	HMG14 and HMG17
341864	Ribosomal protein L21e
341864	Transcription factor TFIID (or TATA-binding protein, TBP)
341868	Ribosomal protein L13
341893	Aminotransferase class I and II
341894	Domain found in IF2B/IF5. This family includes the N terminus of eIF-5, and the C terminus of eIF-2 beta. This region corresponds to the whole of the archaebacterial eIF-2 beta homolog. The region contains a putative zinc binding C4 finger
341897	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
341899	HMG (high mobility group) box
341902	Disintegrin
341902	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endop
341902	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
341905	PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins
341910	Ribosomal protein S5, C-terminal domain
341910	Ribosomal protein S5, N-terminal domain
341915	60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2
341922	Dynein light chain type 1
341925	Ribosomal protein S26e
341927	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
341949	Ribosomal L15
341963	Ribosomal protein L3
341965	Eukaryotic porin
341973	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
341973	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
341974	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
341974	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
342013	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
342015	Ribosomal protein L6
342032	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
342035	Olfactomedin-like domain
342035	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G
342035	Olfactomedin-like domain
342035	Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-
342037	Leo1-like protein. Members of this family are part of the Paf1/RNA polymerase II complex. The Paf1 complex probably functions during the elongation phase of transcription
342041	Ribosomal protein S2
342042	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
342042	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
342042	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
342052	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
342052	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
342053	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
342072	'Cold-shock' DNA-binding domain
342080	SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and
342092	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
342101	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
342103	Cyclophilin type peptidyl-prolyl cis-trans isomerase
342113	Uncharacterized ACR, COG1579
342113	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
342113	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
342116	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
342119	Transcription factor E2F/dimerisation partner (TDP). This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in associa
342138	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
342146	AIR synthase related protein, N-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The N-terminal domain of AIR synthase forms the d
342155	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
342155	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
342156	Glypican
342163	Dynamin GTPase effector domain
342183	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
342206	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
342209	Glypican
342209	ABC transporter transmembrane region. This family represents a unit of six transmembrane helices. Many members of the ABC transporter family (pfam00005) have two such regions
342209	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
342234	F-actin capping protein alpha subunit
342253	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
342261	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
342289	Lectin C-type domain. This family includes both long and short form C-type
342293	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
342315	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
342316	Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi
342331	Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in
342341	Cytochrome C and Quinol oxidase polypeptide I
342357	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
342357	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342357	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
342358	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
342370	Ribosomal L10
342372	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
342372	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
342372	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
342372	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
342372	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
342372	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
342374	Intermediate filament protein
342405	Cyclophilin type peptidyl-prolyl cis-trans isomerase
342407	Ribosomal protein L21e
342409	Homeobox domain
342419	Intermediate filament protein
342419	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
342423	Filamin/ABP280 repeat
342424	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
342427	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
342429	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
342432	Homeobox domain
342443	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
342443	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
342444	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
342452	Ribosomal protein L23
342460	Sulfotransferase protein
342509	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
342527	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
342531	Lipoxygenase
342531	PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this
342538	NAC domain
342541	Cyclophilin type peptidyl-prolyl cis-trans isomerase
342543	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
342550	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
342565	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342573	Intermediate filament protein
342574	Intermediate filament protein
342574	Intermediate filament protein
342575	Intermediate filament protein
342575	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
342578	HMG (high mobility group) box
342594	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
342599	Anticodon binding domain. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases it is probably the anticodon binding domain
342614	Uncharacterised protein family (UPF0185). This family contains a number of small uncharacterised proteins including BM-002
342615	Adenylosuccinate synthetase
342626	Protein kinase domain
342636	Trypsin
342658	Flavodoxin
342660	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
342661	14-3-3 protein
342662	C2 domain
342667	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
342667	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
342684	Uncharacterized ACR, COG1354
342684	Uncharacterized ACR, COG1579
342684	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
342684	Autophagy protein Apg6. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg6/Vps30p has two distinct functions in the autophagic process, either assoc
342695	Ubiquitin fusion degradation protein UFD1. Post-translational ubiquitin-protein conjugates are recognized for degradation by the ubiquitin fusion degradation (UFD) pathway. Several proteins involved in this pathway have been identified. This family includ
342697	metallopeptidase family M24
342705	NAD binding domain of 6-phosphogluconate dehydrogenase. The NAD binding domain of 6-phosphogluconate dehydrogenase adopts a Rossman fold
342705	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
342705	6-phosphogluconate dehydrogenase, C-terminal domain. This family represents the C-terminal all-alpha domain of 6-phosphogluconate dehydrogenase. The domain contains two structural repeats of 5 helices each
342732	Intermediate filament protein
342741	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
342779	Ribosomal protein L6
342780	Acyltransferase family. This family includes a range of acyltransferase enzymes. This domain is found in many as yet uncharacterised C. elegans proteins and it is approximately 300 amino acids long
342784	BSD domain. This domain contains a distinctive -FW- motif. It is found in a family of eukaryotic transcription factors as well as a set of proteins of unknown function
342808	Ribosomal protein S5, C-terminal domain
342808	Ribosomal protein S5, N-terminal domain
342827	PXA domain. This domain is associated with PX domains pfam00787
342839	Ribosomal L10
342842	Sodium:solute symporter family
342851	Sodium:solute symporter family
342854	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
342883	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
342885	Glycosyltransferase family 6
342886	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
342886	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
342886	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
342888	Protein kinase domain
342892	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342894	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342897	F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. One member is involved in binding to N-glycosylated
342900	Homeobox domain
342905	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
342908	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342911	Ribosomal protein S28e
342917	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342923	Ribosomal L39 protein
342923	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342926	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342931	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
342933	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
342933	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
342934	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
342934	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
342938	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342939	Homeobox domain
342943	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
342945	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
342958	Perilipin family. The perilipin family includes lipid droplet-associated protein (perilipin) and adipose differentiation-related protein (adipophilin)
342965	7 transmembrane receptor (rhodopsin family)
342966	7 transmembrane receptor (rhodopsin family)
342967	Actin
342969	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342970	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342972	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342982	Lectin C-type domain. This family includes both long and short form C-type
342984	DNA-dependent RNA polymerase. This is a family of single chain RNA polymerases
342989	HesB-like domain. This family includes HesB which may be involved in nitrogen fixation
342989	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342990	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342990	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
342991	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
342994	Ribosomal protein L34e
343002	Ribosomal protein L34e
343011	Chitin binding Peritrophin-A domain. This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruse
343015	Ribosomal protein S19
343018	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
343019	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
343024	Sushi domain (SCR repeat)
343033	S25 ribosomal protein
343042	Ribosomal protein S28e
343045	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
343047	Intermediate filament protein
343059	Ets-domain
343066	Carboxylesterase
343069	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
343082	Amidase
343088	Sushi domain (SCR repeat)
343094	B-box zinc finger
343095	Fibronectin type III domain
343095	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
343113	Uricase
343114	Uricase
343119	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
343122	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
343128	ADP-ribosylation factor family
343128	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
343133	7 transmembrane receptor (rhodopsin family)
343141	Mitochondrial carrier protein
343142	Signal recognition particle 14kD protein. The signal recognition particle (SRP) is a multimeric protein involved in targeting secretory proteins to the rough endoplasmic reticulum membrane. SRP14 and SRP9 form a complex essential for SRP RNA binding
343145	Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA
343146	Ribosomal protein S19
343151	Aminotransferase class I and II
343161	HMG (high mobility group) box
343162	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
343164	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
343165	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
343169	7 transmembrane receptor (rhodopsin family)
343170	7 transmembrane receptor (rhodopsin family)
343171	7 transmembrane receptor (rhodopsin family)
343172	7 transmembrane receptor (rhodopsin family)
343173	7 transmembrane receptor (rhodopsin family)
343176	Core histone H2A/H2B/H3/H4
343181	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
343184	Ribosomal protein S5, C-terminal domain
343184	Ribosomal protein S5, N-terminal domain
343186	Cofilin/tropomyosin-type actin-binding protein. Severs actin filaments and binds to actin monomers
343187	Calreticulin family
343190	CUB domain
343205	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
343230	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
343230	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
343233	Zinc-binding dehydrogenase
343259	Hydantoinase/oxoprolinase. This family includes the enzymes hydantoinase and oxoprolinase EC:3.5.2.9. Both reactions involve the hydrolysis of 5-membered rings via hydrolysis of their internal imide bonds
343259	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
343263	Fibronectin type III domain
343290	ICE-like protease (caspase) p20 domain
343295	Ribosomal protein L14p/L23e
343295	SKIP/SNW domain. This domain is found in chromatin proteins
343296	Zinc-binding dehydrogenase
343297	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
343300	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
343301	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
343311	Ribosomal protein S14p/S29e. This family includes both ribosomal S14 from prokaryotes and S29 from eukaryotes
343320	Yippee putative zinc-binding protein
343324	Translation initiation factor SUI1
343326	Intermediate filament protein
343332	Polyprenyl synthetase
343337	Ribosomal protein L21e
343338	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
343338	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
343342	CUB domain
343342	Sushi domain (SCR repeat)
343353	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
343354	Actin
343363	NAC domain
343381	CobN/Magnesium Chelatase. This family contains a domain common to the cobN protein and to magnesium protoporphyrin chelatase. CobN is implicated in the conversion of precorrin-2 to cobyrinic acid. Magnesium protoporphyrin chelatase is involved in chloroph
343384	Cyclophilin type peptidyl-prolyl cis-trans isomerase
343388	Ribosomal protein L31e
343406	7 transmembrane receptor (rhodopsin family)
343407	Hsp90 protein
343408	HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758
343409	7 transmembrane receptor (rhodopsin family)
343429	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
343429	Ribosomal protein L4/L1 family. This family includes Ribosomal L4/L1 from eukaryotes and archaebacteria and L4 from eubacteria. L4 from yeast has been shown to bind rRNA
343433	Amidase
343448	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
343448	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
343448	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
343450	Calcium-activated BK potassium channel alpha subunit
343450	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
343472	Homeobox domain
343476	Ribosomal L39 protein
343477	Hsp90 protein
343479	Ribosomal protein L23
343479	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
343479	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
343489	Kinesin motor domain
343495	Ribosomal protein L6e
343498	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
343498	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
343499	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
343499	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
343500	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
343502	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
343503	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
343503	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
343508	Aconitase family (aconitate hydratase)
343508	Aconitase C-terminal domain. Members of this family usually also match to pfam00330. This domain undergoes conformational change in the enzyme mechanism
343510	Intermediate filament protein
343515	ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit
343521	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
343523	Amidase
343529	Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes
343552	Fork head domain
343563	7 transmembrane receptor (rhodopsin family)
343578	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
343585	Intermediate filament protein
343587	SCP-like extracellular protein. This domain is also found in prokaryotes
343588	Ribosomal L37ae protein family. This ribosomal protein is found in archaebacteria and eukaryotes. It contains four conserved cysteine residues that may bind to zinc
343591	Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure
343593	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
343602	Ribosomal protein S2
343602	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
343638	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
343641	Transglutaminase family
343641	Transglutaminase family, C-terminal ig like domain
343641	Transglutaminase-like superfamily. This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that center around conserved cysteine, histidine,
343651	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
343654	Cadherin domain
343667	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
343671	Cyclophilin type peptidyl-prolyl cis-trans isomerase
343683	Ribosomal protein L24e
343701	Interferon regulatory factor transcription factor. This family of transcription factors are important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. Three of the five conserved trypt
343705	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
343708	HMG (high mobility group) box
343718	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
343720	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
343729	Eukaryotic porin
343729	Ribosomal protein S26e
343744	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
343744	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
343745	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
343745	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
343749	Glutathione peroxidase
343750	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
343762	7 transmembrane receptor (rhodopsin family)
343807	Cytochrome C and Quinol oxidase polypeptide I
343822	Actin
343822	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
343824	Phosphoinositide 3-kinase family, accessory domain (PIK domain). PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation
343830	Eukaryotic-type carbonic anhydrase
343831	Protein kinase domain
343839	Sodium/hydrogen exchanger family. Na/H antiporters are key transporters in maintaining the pH of actively metobolizing cells. The molecular mechanisms of antiport are unclear. These antiporters contain 10-12 transmembrane regions (M) at the amino-terminus
343851	Carboxylesterase
343854	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
343854	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
343859	Ribosomal protein S28e
343868	Calponin family repeat
343868	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
343870	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
343875	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
343876	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
343924	7 transmembrane receptor (rhodopsin family)
343930	Helix-loop-helix DNA-binding domain
343943	WD domain, G-beta repeat
343946	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
343948	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
343954	NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane
343957	GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue
343958	7 transmembrane receptor (rhodopsin family)
343974	Doublecortin
343981	Glutamine synthetase, beta-Grasp domain
343981	Glutamine synthetase, catalytic domain
343995	RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer
344016	BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer
344017	Mitochondrial carrier protein
344018	Helix-loop-helix DNA-binding domain
344022	Homeobox domain
344026	Ribosomal protein S28e
344056	DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh
344056	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
344065	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
344085	Rad52/22 family double-strand break repair protein
344108	Domain of unknown function DUF130. This family has no known function, it consists of C. elegans proteins and is present as a repeat in some members. The aligned region has 4 conserved cysteine residues and is a maximum of 175 residues long
344138	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
344152	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
344160	NADH dehydrogenase
344160	Cytochrome C and Quinol oxidase polypeptide I
344163	Apolipoprotein A1/A4/E family. These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E
344167	Fork head domain
344171	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
344172	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
344177	Ribosomal protein L31e
344178	Cyclophilin type peptidyl-prolyl cis-trans isomerase
344179	Hsp90 protein
344181	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
344190	Thymosin beta-4 family
344191	Homeobox domain
344195	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
344222	Cyclophilin type peptidyl-prolyl cis-trans isomerase
344227	Actin
344228	Uncharacterised protein family (UPF0120)
344228	Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport
344231	Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included
344254	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
344262	7 transmembrane receptor (rhodopsin family)
344263	Ribosomal S3Ae family
344284	Homeobox domain
344286	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
344287	Putative zinc finger in N-recognin
344292	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
344299	Cyclophilin type peptidyl-prolyl cis-trans isomerase
344318	Intermediate filament protein
344320	Intermediate filament protein
344320	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
344328	TPR Domain
344357	Cyclophilin type peptidyl-prolyl cis-trans isomerase
344371	Mitochondrial carrier protein
344382	WD domain, G-beta repeat
344387	Protein kinase domain
344394	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
344401	HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes.
344403	HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localize to membranes.
344405	YbaK / prolyl-tRNA synthetases associated domain. This domain of unknown function is found in numerous prokaryote organisms. The structure of YbaK shows a novel fold. This domain also occurs in a number of prolyl-tRNA synthetases (proRS) from prokaryotes.
344423	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
344424	Ribosomal protein S24e
344424	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
344427	Ribosomal protein L24e
344431	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
344454	CO dehydrogenase flavoprotein C-terminal domain
344454	FAD binding domain in molybdopterin dehydrogenase
344454	Aldehyde oxidase and xanthine dehydrogenase, a/b hammerhead domain
344454	Aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding domain
344458	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
344462	Intermediate filament protein
344462	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
344471	Ribosomal protein L11, RNA binding domain
344471	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
344473	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
344474	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
344494	E1-E2 ATPase
344498	IQ calmodulin-binding motif. Calmodulin-binding motif
344510	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
344516	Ubiquitin carboxyl-terminal hydrolase family 2
344543	Ribosomal L22e protein family
344557	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
344558	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
344559	Ribosomal L22e protein family
344561	7 transmembrane receptor (rhodopsin family)
344565	SH2 domain
344568	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
344570	Transferrin
344572	Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain
344572	pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain
344574	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
344593	SH2 domain
344593	Protein-tyrosine phosphatase
344596	Citrate synthase
344620	WD domain, G-beta repeat
344622	Beta/Gamma crystallin. The alignment comprises two Greek key motifs since the similarity between them is very low
344653	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
344658	SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as
344666	Ribosomal Proteins L2, C-terminal domain
344673	Cytochrome c oxidase subunit III
344673	NADH-ubiquinone/plastoquinone oxidoreductase, chain 3
344682	Elongation factor 1 gamma, conserved domain
344682	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
344682	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
344690	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
344694	HMG (high mobility group) box
344702	Homeobox domain
344707	PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15)
344715	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
344716	7 transmembrane receptor (rhodopsin family)
344717	7 transmembrane receptor (rhodopsin family)
344718	7 transmembrane receptor (rhodopsin family)
344719	Ribosomal protein L23
344726	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
344728	7 transmembrane receptor (rhodopsin family)
344737	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
344741	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
344744	FKBP-type peptidyl-prolyl cis-trans isomerase
344752	Carboxylesterase
344758	7 transmembrane receptor (rhodopsin family)
344758	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
344758	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
344760	7 transmembrane receptor (metabotropic glutamate family)
344760	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
344762	Intermediate filament protein
344765	Ribosomal protein L35Ae
344767	7 transmembrane receptor (rhodopsin family)
344775	Aminotransferase class-V
344787	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
344788	Ribosomal protein S2
344797	Cyclophilin type peptidyl-prolyl cis-trans isomerase
344805	Trypsin
344805	CUB domain
344811	'Cold-shock' DNA-binding domain
344813	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
344813	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
344819	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
344819	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
344820	SH2 domain
344821	Cyclophilin type peptidyl-prolyl cis-trans isomerase
344829	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
344838	Uncharacterised protein family (Hly-III / UPF0073). Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It ha
344866	Intermediate filament protein
344875	von Willebrand factor type A domain
344880	Eukaryotic initiation factor 4E
344890	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
344905	E1-E2 ATPase
344921	GTP1/OBG family
344936	Cadherin domain
344946	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
344951	7 transmembrane receptor (rhodopsin family)
344953	Cyclin-dependent kinase 5 activator protein
344967	Cytosolic long-chain acyl-CoA thioester hydrolase. This family consist of various cytosolic long-chain acyl-CoA thioester hydrolases including human and rat. The aligned region is repeated with in the sequence of human and rat cytosolic long-chain acyl-Co
344973	Sulfatase
344978	Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectri
344988	ADP-ribosylation factor family
344998	Cyclophilin type peptidyl-prolyl cis-trans isomerase
345001	3-oxo-5-alpha-steroid 4-dehydrogenase. This family consists of 3-oxo-5-alpha-steroid 4-dehydrogenases, EC:1.3.99.5 Also known as Steroid 5-alpha-reductase, the reaction catalysed by this enzyme is: 3-oxo-5-alpha-steroid + acceptor <=> 3-oxo-delta(4)-stero
345012	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
345019	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
345019	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
345019	Elongation factor Tu C-terminal domain. Elongation factor Tu consists of three structural domains, this is the third domain. This domain adopts a beta barrel structure. This the third domain is involved in binding to both charged tRNA and binding to EF-Ts
345028	Hsp90 protein
345028	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
345035	PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family
345041	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
345051	Reeler domain
345062	Trypsin
345076	Chaperonin 10 Kd subunit
345079	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
345083	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
345095	Ribosomal protein S12
345097	Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29
345100	Trypsin
345100	C2 domain
345101	Trypsin
345101	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
345121	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
345131	Helix-loop-helix DNA-binding domain
345132	Ribosomal L22e protein family
345136	Mitochondrial carrier protein
345157	Hsp90 protein
345157	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
345174	Ribosomal protein L21e
345192	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
345203	7 transmembrane receptor (rhodopsin family)
345212	7 transmembrane receptor (rhodopsin family)
345217	Sas10/Utp3 family. This family contains Sas10 which hash been identified as a regulator of chromatin silencing. The family also contains Utp3 a component of the U3 ribonucleoprotein complex. The exact molecular function of this family is unknown
345217	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
345217	V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryo
345248	Hsp90 protein
345249	Glycosyl transferase. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids
345255	Cytochrome c oxidase subunit Vb
345258	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
345261	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
345262	Ribosomal protein S6e
345273	Zinc-binding dehydrogenase
345274	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the fa
345275	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
345275	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
345288	Serum albumin family
345302	Ribosomal protein S5, C-terminal domain
345302	Ribosomal protein S5, N-terminal domain
345305	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
345310	ADP-ribosylation factor family
345310	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
345315	GMP synthase C terminal domain. GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. This family is the C-terminal domain specific to the GMP synthases EC:6.3.5.2. In prokaryotes this domain mediates dimerisation. E
345319	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
345348	Skp1 family, dimerisation domain
345348	Myo-inositol-1-phosphate synthase. This is a family of myo-inositol-1-phosphate synthases. Inositol-1-phosphate catalyses the conversion of glucose-6- phosphate to inositol-1-phosphate, which is then dephosphorylated to inositol. Inositol phosphates play
345355	alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes
345358	Tctex-1 family. Tctex-1 is a dynein light chain. It has been shown that Tctex-1 can bind to the cytoplasmic tail of rhodopsin. C-terminal rhodopsin mutations responsible for retinitis pigmentosa inhibit this interaction
345363	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
345369	Family 4 glycosyl hydrolase
345372	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
345375	Herpesvirus transcription activation factor (transactivator). This family includes EBV BRLF1 and similar ORF 50 proteins from other herpesviruses
345378	UDP-glucoronosyl and UDP-glucosyl transferase
345379	UDP-glucoronosyl and UDP-glucosyl transferase
345381	UDP-glucoronosyl and UDP-glucosyl transferase
345381	Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contai
345392	Ribosomal protein L6e
345396	Intermediate filament protein
345412	Chaperonin 10 Kd subunit
345420	Xanthomonas avirulence protein, Avr/PthA
345430	Intermediate filament protein
345443	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
345456	Profilin
345462	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
345463	'Cold-shock' DNA-binding domain
345466	6-pyruvoyl tetrahydropterin synthase. 6-Pyruvoyl tetrahydrobiopterin synthase catalyses the conversion of dihydroneopterin triphosphate to 6-pyruvoyl tetrahydropterin, the second of three enzymatic steps in the synthesis of tetrahydrobiopterin from GTP. T
345487	Ribosomal protein L21e
345505	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
345507	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
345513	Intermediate filament protein
345528	Kinesin motor domain
345537	TRAF-type zinc finger
345537	MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode C. elegans
345540	Arginosuccinate synthase. This family contains a PP-loop motif
345547	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
345557	Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit
345561	Cyclophilin type peptidyl-prolyl cis-trans isomerase
345571	RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signaling pathways
345571	Reovirus viral attachment protein sigma 1. This family consists of the reovirus sigma 1 hemagglutinin, cell attachment protein. This glycoprotein is a minor capsid protein and also determins the serotype-specific humoral immune response. Sigma 1 consist o
345571	FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positi
345573	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
345587	Vinculin family
345589	Ribosomal S17
345621	Cyclophilin type peptidyl-prolyl cis-trans isomerase
345630	Fibrillarin
345637	Arginosuccinate synthase. This family contains a PP-loop motif
345637	tRNA methyl transferase. This family represents tRNA(5-methylaminomethyl-2-thiouridine)-methyltransferase which is involved in the biosynthesis of the modified nucleoside 5-methylaminomethyl-2-thiouridine present in the wobble position of some tRNAs
345645	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
345651	Actin
345654	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
345659	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
345663	Actin
345678	'Cold-shock' DNA-binding domain
345687	Ribosomal L39 protein
345688	Ribosomal protein L15
345711	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
345716	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
345720	Actin
345751	Intermediate filament protein
345754	ATP synthase A chain
345754	Cytochrome c oxidase subunit III
345754	Cytochrome C and Quinol oxidase polypeptide I
345754	NADH-ubiquinone/plastoquinone oxidoreductase, chain 3
345754	Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities
345754	Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities
345761	PH domain. PH stands for pleckstrin homology
345768	Actin
345774	Ribosomal S3Ae family
345774	HMG (high mobility group) box
345775	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
345779	Protein of unknown function (DUF343). This family of short proteins have no known function. The bacterial members are about 60-70 amino acids in length and the eukaryotic examples are about 120 amino acids in length. The C terminus contains the strongest
345781	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
345783	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
345783	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
345799	Ribosomal L10
345799	Ribosomal S3Ae family
345818	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
345820	Cullin family
345829	NAC domain
345841	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
345850	Non-repetitive/WGA-negative nucleoporin. This is a family of nucleoporin proteins. Nucleoporins are the main components of the nuclear pore complex in eukaryotic cells, and mediate bidirectional nucleocytoplasmic transport, especially of mRNA and proteins
345864	'Cold-shock' DNA-binding domain
345874	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
345882	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
345884	Intermediate filament protein
345884	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
345885	metallopeptidase family M24
345893	'chromo' (CHRromatin Organization MOdifier) domain
345893	Chromo shadow domain. This domain is distantly related to pfam00385. This domain is always found in association with a chromo domain
345922	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
345922	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
345923	HMG (high mobility group) box
345925	NAC domain
345930	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
345947	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
345982	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
345988	pfam02887, PK_C, Pyruvate kinase, alpha/beta domain
345988	Pyruvate kinase, barrel domain. This domain of the is actually a small beta-barrel domain nested within a larger TIM barrel. The active site is found in a cleft between the two domains
345993	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
345994	Isocitrate/isopropylmalate dehydrogenase
346004	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
346005	EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains.
346010	Intermediate filament protein
346012	Thymosin beta-4 family
346024	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
346024	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
346033	HMG (high mobility group) box
346036	Intermediate filament protein
346036	Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex
346036	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
346037	Translationally controlled tumor protein
346042	Connexin
346072	Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta
346085	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
346085	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
346096	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
346113	P21-ARC (ARP2/3 complex 21 kDa subunit). The seven component ARP2/3 actin-organizing complex is involved in actin assembly and function
346123	Ribosomal protein L21e
346129	Intermediate filament protein
346130	Rieske [2Fe-2S] domain. The rieske domain has a [2Fe-2S] centre. Two conserved cysteines that one Fe ion while the other Fe ion is coordinated by two conserved histidines
346149	Core histone H2A/H2B/H3/H4
346149	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
346162	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
346167	7 transmembrane receptor (rhodopsin family)
346168	7 transmembrane receptor (rhodopsin family)
346169	7 transmembrane receptor (rhodopsin family)
346170	7 transmembrane receptor (rhodopsin family)
346171	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
346171	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
346173	CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP
346177	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
346189	Cytosol aminopeptidase family, N-terminal domain
346189	Cytosol aminopeptidase family, catalytic domain. The two associated zinc ions and the active site are entirely enclosed within the C-terminal catalytic domain in leucine aminopeptidase
346190	ADP-ribosylation factor family
346190	7 transmembrane receptor (rhodopsin family)
346190	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
346191	Ribosomal protein L11, RNA binding domain
346191	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
346207	Ribosomal L29e protein family
346217	Patched family. The transmembrane protein Patched is a receptor for the morphogene Sonic Hedgehog. This protein associates with the smoothened protein to transduce hedgehog signals
346225	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
346230	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
346232	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
346233	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
346234	Ribosomal protein L31e
346239	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
346248	NADH-ubiquinone/plastoquinone oxidoreductase chain 6
346248	Glycosyl hydrolases family 2, sugar binding domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities and has a jelly-roll fold
346249	NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane
346250	NADH-ubiquinone/plastoquinone oxidoreductase chain 6
346250	Glycosyl hydrolases family 2, sugar binding domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities and has a jelly-roll fold
346252	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
346284	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
346286	ADP-ribosylation factor family
346286	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
346288	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
346294	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
346295	14-3-3 protein
346317	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
346319	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
346321	Integrase core domain. Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic do
346322	Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown
346322	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
346323	Ribosomal protein L24e
346325	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
346327	Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown
346327	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
346329	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
346355	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
346359	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
346362	7 transmembrane receptor (rhodopsin family)
346363	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid
346379	Uncharacterized protein family UPF0007
346402	Ribosomal protein L11, RNA binding domain
346404	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
346412	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
346416	HMG (high mobility group) box
346423	Cyclophilin type peptidyl-prolyl cis-trans isomerase
346446	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
346449	7 transmembrane receptor (rhodopsin family)
346453	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
346454	Protein kinase domain
346472	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
346476	ADP-ribosylation factor family
346476	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
346517	7 transmembrane receptor (rhodopsin family)
346521	HIT family
346522	SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase
346522	AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this
346524	7 transmembrane receptor (rhodopsin family)
346525	7 transmembrane receptor (rhodopsin family)
346528	7 transmembrane receptor (rhodopsin family)
346542	Protein kinase domain
346546	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
346559	Ribosomal L10
346562	ADP-ribosylation factor family
346562	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
346564	Eukaryotic porin
346565	Cadherin domain
346565	Eukaryotic initiation factor 4E
346566	Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain
346566	pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain
346600	Clathrin adaptor complex small chain
346606	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
346606	Ribosomal protein S2
346606	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
346643	Ribosomal protein S5, C-terminal domain
346643	Ribosomal protein S5, N-terminal domain
346644	Cyclophilin type peptidyl-prolyl cis-trans isomerase
346668	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
346671	IBR domain. The IBR (In Between Ring fingers) domain is found to occur between pairs of ring fingers (pfam00097). The function of this domain is unknown. This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain
346674	Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predict
346682	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
346702	Trypsin
346706	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
346707	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
346707	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
346708	7 transmembrane receptor (rhodopsin family)
346711	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
346722	C2 domain
346723	ADP-ribosylation factor family
346730	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
346741	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuron
346762	Protein kinase domain
346762	Ribosomal protein L21e
346802	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
346848	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
346853	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
346856	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
346867	G10 protein
346884	Ribosomal protein S6e
346884	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
346887	Sugar (and other) transporter
346897	Protein of unknown function (DUF425). Family member HYNA is the product of a novel gene expressed in human liver cancer tissue
346904	CUB domain
346905	CUB domain
346905	Sushi domain (SCR repeat)
346931	Adenylate kinase
346931	Ribosomal protein S26e
346950	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
346971	tRNA synthetases class II (D, K and N). Other tRNA synthetase sub-families are too dissimilar to be included
346986	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
346988	BAG domain. Domain present in Hsp70 regulators
346989	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
346990	Core histone H2A/H2B/H3/H4
346991	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
346998	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
347013	Ribosomal protein L14p/L23e
347028	Peptidase family M41
347028	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
347031	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
347038	TMS membrane protein/tumour differentially expressed protein (TDE)
347046	HMG (high mobility group) box
347048	Inositol monophosphatase family
347048	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
347051	Sodium Bile acid symporter family. This family consists of Na+/bile acid co-transporters. These transmembrane proteins function in the liver in the uptake of bile acids from portal blood plasma a process mediated by the co-transport of Na+. Also in the fa
347055	Tropomyosin
347074	Ribosomal protein L34e
347085	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
347088	7 transmembrane receptor (Secretin family)
347088	7 transmembrane receptor (Secretin family)
347088	Pentaxin family. Pentaxins are also known as pentraxins
347088	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
347099	Adenylate kinase
347115	Fork head domain
347122	Cytochrome C and Quinol oxidase polypeptide I
347165	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
347166	Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltransferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcN
347168	7 transmembrane receptor (rhodopsin family)
347169	7 transmembrane receptor (rhodopsin family)
347177	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
347179	Ribosomal S3Ae family
347206	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
347207	Protein kinase domain
347218	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
347233	Mitochondrial carrier protein
347235	Arginosuccinate synthase. This family contains a PP-loop motif
347240	Kinesin motor domain
347252	Kazal-type serine protease inhibitor domain. Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tande
347263	Microtubule associated protein (MAP65/ASE1 family)
347263	IncA protein. Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation
347263	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
347264	Intermediate filament protein
347265	Intermediate filament protein
347265	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
347287	Cyclophilin type peptidyl-prolyl cis-trans isomerase
347288	Ribosomal protein L44
347292	Ribosomal protein L36e
347295	'Cold-shock' DNA-binding domain
347305	Protein kinase domain
347314	7 transmembrane receptor (rhodopsin family)
347316	7 transmembrane receptor (rhodopsin family)
347318	Ribosomal protein L21e
347319	Protein-tyrosine phosphatase
347333	Intermediate filament protein
347336	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
347337	Actin
347344	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
347345	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
347346	Aminotransferase class-III
347347	Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily
347348	Hsp20/alpha crystallin family
347358	YDG/SRA domain. The function of this domain is unknown, it contains a conserved motif YDG after which it has been named
347362	Raf-like Ras-binding domain
347362	Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain
347363	Kinesin motor domain
347365	von Willebrand factor type A domain
347376	Core histone H2A/H2B/H3/H4
347381	Thiolase, C-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
347381	Thiolase, N-terminal domain. Thiolase is reported to be structurally related to beta-ketoacyl synthase (pfam00109), and also chalcone synthase
347381	Beta-ketoacyl synthase, N-terminal domain. The structure of beta-ketoacyl synthase is similar to that of the thiolase family (pfam00108) and also chalcone sythase. The active site of beta-ketoacyl synthase is located between the N and C-terminal domains.
347386	NADH dehydrogenase
347393	Ets-domain
347393	Core histone H2A/H2B/H3/H4
347395	Core histone H2A/H2B/H3/H4
347400	DnaJ C terminal region. This family consists of the C terminal region form the DnaJ protein. Although the function of this region is unknown, it is always found associated with pfam00226 and pfam00684. DnaJ is a chaperone associated with the Hsp70 heat-sh
347400	DnaJ central domain (4 repeats). The central cysteine-rich (CR) domain of DnaJ proteins contains four repeats of the motif CXXCXGXG where X is any amino acid. The isolated cysteine rich domain folds in zinc dependent fashion. Each set of two repeats binds
347403	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
347411	Uncharacterised protein family (UPF0041)
347418	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
347419	GrpE
347420	K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerization domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly r
347421	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
347422	N2,N2-dimethylguanosine tRNA methyltransferase. This enzyme EC:2.1.1.32 used S-AdoMet to methylate tRNA. The TRM1 gene of Saccharomyces cerevisiae is necessary for the N2,N2-dimethylguanosine modification of both mitochondrial and cytoplasmic tRNAs. The e
347428	TPR Domain
347441	Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins
347442	WD domain, G-beta repeat
347442	Microtubule associated protein 1A/1B, light chain 3. Light chain 3 is proposed to function primarily as a subunit of microtubule associated proteins 1A and 1B and that its expression may regulate microtubule binding activity
347449	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
347452	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
347454	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
347458	metallopeptidase family M24
347468	7 transmembrane receptor (rhodopsin family)
347480	PWI domain
347492	Rhabdovirus Non-virion protein. Infectious hematopoietic necrosis virus (IHNV) is a member of the family Rhabdoviridae. The non-virion protein (NV) is coded for by one of the six genes of the IHNV genome, but is absent in vesiculovirus -like rhabdovirus
347504	Proteasome A-type and B-type
347512	HSF-type DNA-binding
347516	Diacylglycerol acyltransferase. The terminal step of triacylglycerol (TAG) formation is catalysed by the enzyme diacylglycerol acyltransferase (DAGAT)
347517	ADP-ribosylation factor family
347517	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
347527	Sulfatase
347530	pfam02883, Alpha_adaptinC2, Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
347530	Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pi
347541	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
347542	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
347544	Ribosomal L18ae protein family
347556	Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14
347572	WAP-type (Whey Acidic Protein) 'four-disulfide core'
347573	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
347582	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
347583	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
347598	Trypsin
347598	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
347604	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
347617	ADP-ribosylation factor family
347617	7 transmembrane receptor (rhodopsin family)
347624	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
347628	Ribosomal protein S2
347632	7 transmembrane receptor (rhodopsin family)
347634	7 transmembrane receptor (rhodopsin family)
347635	7 transmembrane receptor (rhodopsin family)
347636	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
347636	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
347639	CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP
347649	Class I Histocompatibility antigen, domains alpha 1 and 2
347650	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
347653	Ribosomal protein S2
347655	7 transmembrane receptor (rhodopsin family)
347657	7 transmembrane receptor (rhodopsin family)
347659	7 transmembrane receptor (rhodopsin family)
347660	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
347660	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
347661	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
347663	Zinc finger, C3HC4 type (RING finger)
347665	CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP
347670	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
347722	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
347722	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
347722	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
347722	Ankyrin repeat. Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
347722	Putative GTP-ase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs
347722	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
347722	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
347731	Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
347732	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
347735	TMS membrane protein/tumour differentially expressed protein (TDE)
347736	Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active center disulfide bond. Some members with only the active site are not separated from the noise
347741	Protein of unknown function, DUF270
347741	Protein of unknown function, DUF270
347754	Homeobox domain
347755	Homeobox domain
347756	Homeobox domain
347757	Homeobox domain
347761	Ribosomal protein S7e
347765	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
347766	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
347773	CUB domain
347777	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
347779	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
347784	Homeobox domain
347785	Homeobox domain
347787	Homeobox domain
347788	Homeobox domain
347789	Homeobox domain
347790	Homeobox domain
347791	Homeobox domain
347792	Homeobox domain
347796	Intermediate filament protein
347797	Actin
347798	ATP synthase subunit C
347799	Cytochrome C and Quinol oxidase polypeptide I
347799	Cytochrome C oxidase subunit II, transmembrane domain. The N-terminal domain of cytochrome C oxidase contains two transmembrane alpha-helices
347800	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
347801	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
347808	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
347809	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
347812	Uncharacterised protein family (UPF0100). This family of proteins are periplasmic binding proteins that includes ModA the molybdate-binding protein. Some members of this family are in the UPF0100 family
347812	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
347813	Cell division protein. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP
347822	7 transmembrane receptor (rhodopsin family)
347827	7 transmembrane receptor (rhodopsin family)
347828	Eukaryotic porin
347835	7 transmembrane receptor (rhodopsin family)
347837	Intermediate filament protein
347840	Ribosomal protein S27
347844	7 transmembrane receptor (rhodopsin family)
347845	7 transmembrane receptor (rhodopsin family)
347849	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
347853	T-box. The T-box encodes a 180 amino acid domain that binds to DNA
347856	Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anch
347860	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
347862	DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. This family appears to be distantly related to Glutamine amidotransferases pfam00117
347867	Zinc finger, C3HC4 type (RING finger)
347868	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
347869	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
347870	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
347871	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
347873	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
347875	Ribosomal protein L44
347876	Serum amyloid A protein
347878	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
347885	7 transmembrane receptor (rhodopsin family)
347886	7 transmembrane receptor (rhodopsin family)
347890	7 transmembrane receptor (rhodopsin family)
347891	7 transmembrane receptor (rhodopsin family)
347892	7 transmembrane receptor (rhodopsin family)
347894	Ribosomal Proteins L2, C-terminal domain
347897	Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle
347898	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
347904	SPFH domain / Band 7 family. This family includes proteins with high blast scores to known Band 7 protein: HflC from E. coli, HflK from E. coli, Prohibitin family members
347904	SCO1/SenC. This family is involved in biogenesis of respiratory and photosynthetic systems. SCO1 is required for a post-translational step in the accumulation of subunits COXI and COXII of cytochrome c oxidase. SenC is required for optimal cytochrome c ox
347907	Intermediate filament protein
347908	Intermediate filament protein
347911	7 transmembrane receptor (rhodopsin family)
347912	7 transmembrane receptor (rhodopsin family)
347914	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
347915	14-3-3 protein
347921	Vacuolar protein sorting-associated protein 35. Vacuolar protein sorting-associated protein (Vps) 35 is one of around 50 proteins involved in protein trafficking. In particular, Vps35 assembles into a retromer complex with at least four other proteins Vps
347922	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
347925	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
347930	Helix-loop-helix DNA-binding domain
347931	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
347936	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
347936	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
347937	Intermediate filament protein
347937	BTG1 family. A novel family of anti-proliferative proteins
347937	Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region
347938	Alpha-2-macroglobulin family. This family includes the C-terminal region of the alpha-2-macroglobulin family
347941	Ribosomal protein L19e
347942	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
347944	Mitochondrial import inner membrane translocase subunit Tim17. The pre-protein translocase of the mitochondrial outer membrane (Tom) allows the import of pre-proteins from the cytoplasm. Tom forms a complex with a number of proteins, including Tim17. Tim1
347947	Domain of unknown function (DUF341)
347947	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
347947	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
347948	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
347960	Ribosomal protein L35Ae
347961	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
347962	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
347966	Galactoside-binding lectin
347967	Core histone H2A/H2B/H3/H4
347969	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
347970	DNA directed RNA polymerase, 7 kDa subunit
347972	Mitochondrial carrier protein
347974	ADP-ribosylation factor family
347975	Homeobox domain
347986	7 transmembrane receptor (rhodopsin family)
347987	7 transmembrane receptor (rhodopsin family)
347988	Ribosomal protein L13
347988	7 transmembrane receptor (rhodopsin family)
347992	Ribosomal L15
347994	ADP-ribosylation factor family
348000	Class II histocompatibility antigen, alpha domain
348001	7 transmembrane receptor (rhodopsin family)
348002	7 transmembrane receptor (rhodopsin family)
348003	Ribosomal protein L13
348003	7 transmembrane receptor (rhodopsin family)
348005	Zinc finger, C3HC4 type (RING finger)
348008	Ribosomal L15
348016	Gamma-glutamyltranspeptidase
348019	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
348026	Ribosomal protein S21e
348028	7 transmembrane receptor (rhodopsin family)
348029	7 transmembrane receptor (rhodopsin family)
348030	7 transmembrane receptor (rhodopsin family)
348031	7 transmembrane receptor (rhodopsin family)
348032	7 transmembrane receptor (rhodopsin family)
348033	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
348034	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
348036	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
348037	Intermediate filament protein
348038	Cyclophilin type peptidyl-prolyl cis-trans isomerase
348039	Eukaryotic ribosomal protein L18
348040	PH domain. PH stands for pleckstrin homology
348042	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
348045	Ribosomal protein S8
348048	Ribosomal protein L24e
348050	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
348052	Cytochrome b(N-terminal)/b6/petB
348053	Ubiquitin carboxyl-terminal hydrolase family 2
348054	Ribosomal protein L21e
348055	Ribosomal protein L21e
348071	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
348074	Mpp10 protein. This family includes proteins related to Mpp10 (M phase phosphoprotein 10). The U3 small nucleolar ribonucleoprotein (snoRNP) is required for three cleavage events that generate the mature 18S rRNA from the pre-rRNA. In Saccharomyces cerevi
348076	Actin
348077	Dynamin GTPase effector domain
348078	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
348078	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
348079	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
348080	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
348081	Ribosomal protein S19
348085	NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane
348087	S-adenosyl-L-homocysteine hydrolase
348092	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
348093	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
348094	Death domain
348094	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
348095	BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of p
348095	Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that one member is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet.
348100	Dynamin GTPase effector domain
348101	Intermediate filament protein
348104	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
348105	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
348111	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
348114	Disintegrin
348114	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not
348146	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
348148	Cyclic nucleotide-binding domain
348148	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
348150	ATP synthase A chain
348150	NADH-ubiquinone/plastoquinone oxidoreductase, chain 3
348158	AMP-binding enzyme
348159	AMP-binding enzyme
348163	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
348163	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
348163	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
348174	Lectin C-type domain. This family includes both long and short form C-type
348174	SCP-like extracellular protein. This domain is also found in prokaryotes
348178	Actin
348180	Uncharacterized protein family UPF0021
348186	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
348189	Kinesin motor domain
348191	Lectin C-type domain. This family includes both long and short form C-type
348200	Poly-adenylate binding protein, unique domain
348201	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
348205	C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system
348209	Sodium:neurotransmitter symporter family
348214	Helix-loop-helix DNA-binding domain
348215	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
348217	Zinc finger, C3HC4 type (RING finger)
348224	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
348229	Intermediate filament protein
348243	Galactoside-binding lectin
348245	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
348248	Ribosomal L39 protein
348250	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
348251	NADH-ubiquinone/plastoquinone oxidoreductase chain 6
348251	NADH-ubiquinone/plastoquinone oxidoreductase, chain 3
348257	TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases
348258	bZIP Maf transcription factor. Maf transcription factors contain a conserved basic region leucine zipper (bZIP) domain, which mediates their dimerization and DNA binding property. Thus, this family is probably related to pfam00170
348263	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
348267	Ribosomal protein L6e
348267	Ribosomal protein L6, N-terminal domain
348269	Eukaryotic phosphomannomutase. This enzyme EC:5.4.2.8 is involved in the synthesis of the GDP-mannose and dolichol-phosphate-mannose required for a number of critical mannosyl transfer reactions
348277	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
348278	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
348280	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
348282	Ribosomal protein S8
348283	MCM2/3/5 family
348285	Ribosomal protein L11, RNA binding domain
348286	Bacterial extracellular solute-binding protein, family 7. This family of proteins are involved in binding extracellular solutes for transport across the bacterial cytoplasmic membrane. This family includes a C4-dicarboxylate-binding protein
348289	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
348292	Ribosomal protein L10
348296	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
348297	Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity
348321	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
348324	Ribosomal protein L13e
348327	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
348327	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
348327	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
348330	Ribosomal protein S5, N-terminal domain
348331	Trypsin
348337	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
348339	7 transmembrane receptor (rhodopsin family)
348340	Ribosomal family S4e
348346	7 transmembrane receptor (rhodopsin family)
348347	Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase
348349	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
348350	YjeF-related protein N-terminus
348354	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
348360	P21-ARC (ARP2/3 complex 21 kDa subunit). The seven component ARP2/3 actin-organizing complex is involved in actin assembly and function
348362	Cyclophilin type peptidyl-prolyl cis-trans isomerase
348364	Ribosomal protein L21e
348372	Ribosomal protein S27a. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP). The S27a domain compromises the C-terminal half of the protein. The synthesis of
348375	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
348376	Ribosomal protein L10
348380	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
348385	Intermediate filament protein
348394	Ribosomal protein L13e
348395	Cytochrome c oxidase subunit Vb
348402	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
348402	Endonuclease/Exonuclease/phosphatase family. This large family of proteins includes magnesium dependent endonucleases and a large number of phosphatases involved in intracellular signalling. This family includes: AP endonuclease proteins EC:4.2.99.18, DNa
348415	Eukaryotic porin
348419	Hsp90 protein
348419	Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90
348420	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
348421	Ribosomal protein S12
348423	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
348424	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
348429	Guanylate-binding protein, C-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP
348434	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
348435	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
348435	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
348436	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
348440	Cytochrome C and Quinol oxidase polypeptide I
348441	Ribosomal protein S17
348447	Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical
348449	7 transmembrane receptor (rhodopsin family)
348451	Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This
348453	7 transmembrane receptor (metabotropic glutamate family)
348453	Cell cycle protein. This entry includes the following members
348453	Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure
348459	Dynein light chain type 1
348463	Actin
348464	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
348467	Cytochrome c oxidase subunit III
348471	Cyclophilin type peptidyl-prolyl cis-trans isomerase
348472	Cyclophilin type peptidyl-prolyl cis-trans isomerase
348477	Intermediate filament protein
348477	Intermediate filament protein
348478	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
348478	Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone
348479	Zinc finger, C3HC4 type (RING finger)
348512	Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), whic
348516	7 transmembrane receptor (rhodopsin family)
348517	7 transmembrane receptor (rhodopsin family)
348518	7 transmembrane receptor (rhodopsin family)
348519	7 transmembrane receptor (rhodopsin family)
348526	Octicosapeptide repeat. Short motif that may bind Ca2+
348539	Intermediate filament protein
348546	Ribosomal protein S16
348548	Ribosomal protein L19e
348550	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
348556	Ribosomal L15
348563	Ribosomal protein S5, C-terminal domain
348563	Ribosomal protein S5, N-terminal domain
348566	7 transmembrane receptor (rhodopsin family)
348570	Cyclophilin type peptidyl-prolyl cis-trans isomerase
348573	Translationally controlled tumor protein
348589	SEA domain. Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains
348590	Calcium-activated potassium channel, beta subunit
348598	Gamma-glutamyltranspeptidase
348600	Phosphatidylinositol 3- and 4-kinase
348602	Gamma-glutamyltranspeptidase
348607	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
348618	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
348619	ADP-ribosylation factor family
348619	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
348621	Ribosomal S3Ae family
348628	Cytochrome c oxidase subunit III
348633	Actin
348653	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
348662	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
348664	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
348668	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
348670	S-adenosyl-L-homocysteine hydrolase
348675	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
348677	Sulfotransferase protein
348678	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
348679	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
348679	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
348688	Cyclophilin type peptidyl-prolyl cis-trans isomerase
348691	Ribosomal protein L21e
348693	Citrate synthase
348695	Ribosomal protein S2
348698	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
348705	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
348720	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
348722	Poly-adenylate binding protein, unique domain
348726	metallopeptidase family M24
348729	Ribosomal protein L21e
348731	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
348732	NADH dehydrogenase
348733	NADH-ubiquinone/plastoquinone oxidoreductase chain 4L
348733	NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane
348737	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
348742	TT viral orf 1. TT virus (TTV), isolated initially from a Japanese patient with hepatitis of unknown aetiology, has since been found to infect both healthy and diseased individuals and numerous prevalence studies have raised questions about its role in un
348745	Ribosomal protein L21e
348746	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
348750	AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synt
348752	Cytochrome b(C-terminal)/b6/petD
348752	Cytochrome C and Quinol oxidase polypeptide I
348753	Cyclophilin type peptidyl-prolyl cis-trans isomerase
348754	Ribosomal protein L6
348755	Ribosomal protein S27
348756	Ribosomal protein L31e
348757	HMG (high mobility group) box
348769	PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains
348772	Ribosomal protein L21e
348774	HMG (high mobility group) box
348774	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
348775	G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines
348778	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
348787	Cytochrome C and Quinol oxidase polypeptide I
348791	FAD binding domain. This family includes members that bind FAD. This family includes the flavoprotein subunits from succinate and fumarate dehydrogenase, aspartate oxidase and the alpha subunit of adenylylsulfate reductase
348793	WD domain, G-beta repeat
348794	Ribosomal protein L36e
348796	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
348800	7 transmembrane receptor (rhodopsin family)
348806	Ribosomal L10
348810	Translationally controlled tumor protein
348811	HMG (high mobility group) box
348813	Glycosyl transferases group 1. Mutations in this domain may lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharid
348814	Elongation factor 1 gamma, conserved domain
348814	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
348819	Insulinase (Peptidase family M16)
348822	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
348823	Ribosomal L39 protein
348826	Cytochrome c oxidase subunit III
348826	NADH-ubiquinone/plastoquinone oxidoreductase, chain 3
348828	Actin
348829	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
348832	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
348838	Cyclophilin type peptidyl-prolyl cis-trans isomerase
348841	Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacte
348841	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
348844	Gastrin/cholecystokinin family
348845	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
348847	Ribosomal protein L6e
348848	Tetraspanin family
348852	Ribosomal protein L6e
348852	Ribosomal protein L6, N-terminal domain
348856	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
348860	Transcription initiation factor TFIID 23-30kDa subunit
348863	Intermediate filament protein
348867	Ribosomal S3Ae family
348868	Ribosomal protein S11
348869	AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4-
348870	NADH-ubiquinone/plastoquinone oxidoreductase, chain 3
348871	Cytochrome C and Quinol oxidase polypeptide I
348871	Cytochrome C oxidase subunit II, periplasmic domain
348872	HMG14 and HMG17
348873	NADH-ubiquinone/plastoquinone oxidoreductase chain 6
348879	Ribosomal protein L19e
348884	Ribosomal protein L6e
348885	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
348889	Ribosomal L22e protein family
348895	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
348901	Ubiquitin carboxyl-terminal hydrolase family 2
348901	Ubiquitin carboxyl-terminal hydrolases family 2
348902	Ubiquitin carboxyl-terminal hydrolase family 2
348902	Ubiquitin carboxyl-terminal hydrolases family 2
348903	Ubiquitin carboxyl-terminal hydrolase family 2
348903	Ubiquitin carboxyl-terminal hydrolases family 2
348904	Ubiquitin carboxyl-terminal hydrolase family 2
348904	Ubiquitin carboxyl-terminal hydrolases family 2
348905	Ubiquitin carboxyl-terminal hydrolase family 2
348905	Ubiquitin carboxyl-terminal hydrolases family 2
348906	Ubiquitin carboxyl-terminal hydrolase family 2
348906	Ubiquitin carboxyl-terminal hydrolases family 2
348907	Ubiquitin carboxyl-terminal hydrolases family 2
348908	Ubiquitin carboxyl-terminal hydrolase family 2
348908	Ubiquitin carboxyl-terminal hydrolases family 2
348909	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
348911	Herpes virus major capsid protein. This family represents the major capsid protein (MCP) of herpes viruses. The capsid shell consists of 150 MCP hexamers and 12 MCP pentamers. One pentamer is found at each of the 12 apices of the icosahedral shell, and th
348916	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
348917	Myo-inositol-1-phosphate synthase. This is a family of myo-inositol-1-phosphate synthases. Inositol-1-phosphate catalyses the conversion of glucose-6- phosphate to inositol-1-phosphate, which is then dephosphorylated to inositol. Inositol phosphates play
348920	UDP-glucoronosyl and UDP-glucosyl transferase
348921	UDP-glucoronosyl and UDP-glucosyl transferase
348922	UDP-glucoronosyl and UDP-glucosyl transferase
348923	UDP-glucoronosyl and UDP-glucosyl transferase
348932	Sodium:neurotransmitter symporter family
348934	Ribosomal protein S12
348948	Cytochrome c oxidase subunit VIIa. Cytochrome c oxidase, a 13 sub-unit complex, is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of the heart and liver isoforms of cytochrome c oxidase subunit VIIa
348949	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
348949	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
348951	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
348952	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
348953	HMG (high mobility group) box
348955	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
348956	Ribosomal protein L21e
348961	Mago nashi protein. This family was originally identified in Drosophila and called mago nashi, it is a strict maternal effect, grandchildless-like, gene. The human homologue has been shown to interact with an RNA binding protein. An RNAi knockout of the C
348963	Ribosomal protein S19e
348965	Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases
348966	Adenylate kinase
348967	Ribosomal protein L31e
348975	Core histone H2A/H2B/H3/H4
348978	CoA-ligase. This family includes the CoA ligases Succinyl-CoA synthetase alpha and beta chains, malate CoA ligase and ATP-citrate lyase. Some members of the family utilise ATP others use GTP
348980	Cyclic nucleotide-binding domain
348980	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
348980	Cyclic nucleotide-binding domain
348980	Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is
348982	Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase
348983	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
348984	Ribosomal L10
348987	Cyclophilin type peptidyl-prolyl cis-trans isomerase
348996	Cytochrome C oxidase subunit II, periplasmic domain
348996	Cytochrome C oxidase subunit II, transmembrane domain. The N-terminal domain of cytochrome C oxidase contains two transmembrane alpha-helices
349000	AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA)
349001	Ribosomal protein S2
349003	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
349004	Ribosomal protein L21e
349004	EF-1 guanine nucleotide exchange domain. This family is the guanine nucleotide exchange domain of EF-1 beta and EF-1 delta chains
349005	Ribosomal L15
349008	Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities
349011	S25 ribosomal protein
349013	ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes
349014	Ribosomal protein S26e
349017	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
349018	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
349019	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
349020	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
349021	Ribosomal protein L21e
349025	Ribosomal protein S11
349029	Translationally controlled tumor protein
349029	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
349033	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
349033	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
349035	Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities
349036	Ribosomal L10
349046	Intermediate filament protein
349049	Ubiquitin carboxyl-terminal hydrolase family 2
349052	Ribosomal protein S2
349060	Bacterial regulatory helix-turn-helix protein, lysR family
349060	LysR substrate binding domain. The structure of this domain is known and is similar to the periplasmic binding proteins
349061	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
349065	Protein kinase domain
349066	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
349069	von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094
349073	EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typi
349075	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
349083	Ribosomal protein L21e
349084	Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catal
349088	Vomeronasal organ pheromone receptor family, V1R. This family represents one of two known vomeronasal organ receptor families, the V1R family (after)
349089	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
349105	Ribosomal protein L6e
349109	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
349121	NADH dehydrogenase
349136	WD domain, G-beta repeat
349139	Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryon
349141	UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to
349142	GrpE
349143	ADP-ribosylation factor family
349143	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
349146	Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats
349149	Connexin
349149	Connexin
349154	Peptidase family M3. This is the Thimet oligopeptidase family, large family of mammalian and bacterial oligopeptidases that cleave medium sized peptides. The group also contains mitochondrial intermediate peptidase which is encoded by nuclear DNA but func
349158	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
349173	Ribosomal protein L24e
349177	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
349179	Minor capsid protein VI. This minor capsid protein may act as a link between the external capsid and the internal DNA-protein core. The C-terminal 11 residues may function as a protease cofactor leading to enzyme activation
349182	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
349188	7 transmembrane receptor (rhodopsin family)
349189	Ubiquitin carboxyl-terminal hydrolase family 2
349197	Aminotransferase class I and II
349198	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
349201	Ribosomal protein L44
349203	Ribosomal protein S26e
349206	Pou domain - N-terminal to homeobox domain
349209	Ribosomal protein S2
349210	Eukaryotic ribosomal protein L18
349222	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
349223	Cyclophilin type peptidyl-prolyl cis-trans isomerase
349224	Ribosomal protein L13e
349227	Intermediate filament protein
349230	Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. Structure is an eight-stranded beta barrel
349231	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
349233	Cyclic nucleotide-binding domain
349237	Synaptogyrin. This family of proteins is distantly related to pfam01284
349246	Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins
349253	ATP synthase alpha/beta chain, C terminal domain
349253	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
349253	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
349255	7 transmembrane receptor (metabotropic glutamate family)
349258	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
349259	ATP synthase alpha/beta chain, C terminal domain
349259	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
349261	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling.
349264	ATP synthase alpha/beta chain, C terminal domain
349264	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
349264	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
349266	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
349267	ATP synthase alpha/beta chain, C terminal domain
349267	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
349280	Calponin family repeat
349280	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
349284	Calponin family repeat
349284	Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is no
349285	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
349288	7 transmembrane receptor (Secretin family)
349288	Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin
349289	Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. These growth factors cause dimerisation of their tyrosine kinase receptors leading to intracellular signaling.
349293	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
349294	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
349304	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
349311	pfam02879, PGM_PMM_II, Phosphoglucomutase/phosphomannomutase, alpha/beta/alpha domain II
349314	Major intrinsic protein. MIP (Major Intrinsic Protein) family proteins exhibit essentially two distinct types of channel properties: (1) specific water transport by the aquaporins, and (2) small neutral solutes transport, such as glycerol by the glycerol
349318	Fork head domain
349320	ATP synthase alpha/beta chain, C terminal domain
349320	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
349325	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
349326	Protein kinase domain
349326	pfkB family carbohydrate kinase. This family includes a variety of carbohydrate and pyrimidine kinases. The family includes phosphomethylpyrimidine kinase EC:2.7.4.7. This enzyme is part of the Thiamine pyrophosphate (TPP) synthesis pathway, TPP is an ess
349331	Protein of unknown function DUF82. This prokaryotic protein family has no known function. The protein contains four conserved cysteines that may be involved in metal binding or disulphide bridges
349334	Fork head domain
349335	Cobalamin synthesis protein/P47K. This family of proteins contains P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression, and the cobW gene product, which may be involved in cobalamin biosynthesis in Pseudomonas denitrificans
349345	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
349356	Actin
349363	ATP synthase alpha/beta family, nucleotide-binding domain. This family includes the ATP synthase alpha and beta subunits the ATP synthase associated with flagella
349368	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
349369	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
349370	Homeobox domain
349371	PMP-22/EMP/MP20/Claudin family
349372	Homeobox domain
349373	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
349373	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
349374	C-type lysozyme/alpha-lactalbumin family. Alpha-lactalbumin is the regulatory subunit of lactose synthase, changing the substrate specificity of galactosyltransferase from N-acetylglucosamine to glucose. C-type lysozymes are secreted bacteriolytic enzymes
349375	Pancreatic hormone peptide
349376	NADH dehydrogenase
349380	Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel
349380	Elongation factor Tu domain 2. Elongation factor Tu consists of three structural domains, this is the second domain. This domain adopts a beta barrel structure. This the second domain is involved in binding to charged tRNA. This domain is also found in ot
349384	Ribosomal L18ae protein family
349385	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
349386	Homeobox domain
349395	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
349397	Lysosome-associated membrane glycoprotein (Lamp)
349398	LIM domain. This family represents two copies of the LIM structural domain
349400	Ets-domain
349400	Sterile alpha motif (SAM)/Pointed domain
349404	Protein kinase domain
349405	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
349406	Ribosomal protein S2
349406	Mpv17 / PMP22 family. The 22-kDa peroxisomal membrane protein (PMP22) is a major component of peroxisomal membranes. PMP22 seems to be involved in pore forming activity and may contribute to the unspecific permeability of the organelle membrane. PMP22 is
349409	ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown
349411	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
349412	Ribosomal protein S7p/S5e. This family contains ribosomal protein S7 from prokaryotes and S5 from eukaryotes
349413	Cation-dependent mannose-6-phosphate receptor
349414	Cyclophilin type peptidyl-prolyl cis-trans isomerase
349415	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
349418	WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro
349419	7 transmembrane receptor (rhodopsin family)
349424	AIR synthase related protein, C-terminal domain. This family includes Hydrogen expression/formation protein HypE, AIR synthases EC:6.3.3.1, FGAM synthase EC:6.3.5.3 and selenide, water dikinase EC:2.7.9.3. The function of the C-terminal domain of AIR synt
349426	Glu/Leu/Phe/Val dehydrogenase, dimerisation domain
349426	Glutamate/Leucine/Phenylalanine/Valine dehydrogenase
349434	Mov34/MPN/PAD-1 family. Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain. It has b
349436	LIM domain. This family represents two copies of the LIM structural domain
349439	SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1) , DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin
349441	Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are significantly similar and contain identical active site residues
349443	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
349444	F-actin capping protein alpha subunit
349446	Sulfatase
349448	Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial
349450	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
349479	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
349509	Homeobox domain
349519	Glu/Leu/Phe/Val dehydrogenase, dimerisation domain
349519	Dual specificity phosphatase, catalytic domain. Ser/Thr and Tyr protein phosphatases. The enzyme's tertiary fold is highly similar to that of tyrosine-specific phosphatases, except for a "recognition" region
349550	7 transmembrane receptor (rhodopsin family)
349556	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
349557	CUB domain
349589	Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure
349599	Ribosomal protein L30p/L7e. This family includes prokaryotic L30 and eukaryotic L7
349603	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
349611	Cyclophilin type peptidyl-prolyl cis-trans isomerase
349624	ICE-like protease (caspase) p10 domain
349624	ICE-like protease (caspase) p20 domain
349625	ICE-like protease (caspase) p10 domain
349625	ICE-like protease (caspase) p20 domain
349632	Ribosomal S17
349636	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
349642	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
349642	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
349646	Glutamine synthetase, catalytic domain
349646	Glutamine synthetase, beta-Grasp domain
349648	Intermediate filament protein
349664	7 transmembrane receptor (rhodopsin family)
349666	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
349668	Ribosomal family S4e
349669	Presenilin. Mutations in presenilin-1 are a major cause of early onset Alzheimer's disease. It has been found that presenilin-1 binds to beta-catenin in-vivo. This family also contains SPE proteins from C.elegans
349673	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
349675	Ribosomal S3Ae family
349677	7 transmembrane receptor (rhodopsin family)
349698	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
349712	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
349730	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
349734	Ribosomal protein L44
349741	Ribosomal protein L34e
349758	RanBP1 domain
349772	Ribosomal protein L21e
349774	Anaphase-promoting complex, subunit 10 (APC10)
349781	Ribosomal protein L31e
349792	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
349796	Cyclophilin type peptidyl-prolyl cis-trans isomerase
349803	Ribosomal protein S17
349814	Homeobox domain
349815	NAD binding domain of 6-phosphogluconate dehydrogenase. The NAD binding domain of 6-phosphogluconate dehydrogenase adopts a Rossman fold
349815	6-phosphogluconate dehydrogenase, C-terminal domain. This family represents the C-terminal all-alpha domain of 6-phosphogluconate dehydrogenase. The domain contains two structural repeats of 5 helices each
349816	Ribosomal L10
349818	Nucleolar RNA-binding protein, Nop10p family. Nop10p is a nucleolar protein that is specifically associated with H/ACA snoRNAs. It is essential for normal 18S rRNA production and rRNA pseudouridylation by the ribonucleoprotein particles containing H/ACA s
349820	Ribosomal protein S6e
349824	Ribosomal protein S3, C-terminal domain. This family contains a central domain pfam00013, hence the amino and carboxyl terminal domains are stored separately. This is a minimal carboxyl-terminal domain. Some are much longer
349831	Ribosomal protein S6e
349842	HMG14 and HMG17
349842	G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase
349848	Ribosomal L39 protein
349861	Homeobox domain
349869	Chaperonin 10 Kd subunit
349875	Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase doma
349881	Ribosomal protein L11, RNA binding domain
349881	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
349884	GrpE
349886	Intermediate filament protein
349892	Hsp90 protein
349898	Ribosomal protein S6e
349899	7 transmembrane receptor (rhodopsin family)
349907	ATP synthase subunit C
349933	Eukaryotic porin
349948	pfam02936, COX4, Cytochrome c oxidase subunit IV. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit IV. The Dictyostelium
349954	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
349954	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
349957	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
349960	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
349963	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
349963	DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA
349974	Plectin/S10 domain. This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding
349996	Nucleoside diphosphate kinase
349997	NADH-ubiquinone/plastoquinone oxidoreductase, chain 3
350003	Protein-tyrosine phosphatase
350005	WD domain, G-beta repeat
350006	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
350009	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
350010	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
350012	Ribosomal L39 protein
350018	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
350042	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
350042	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
350048	Ribosomal protein L37e. This family includes ribosomal protein L37 from eukaryotes and archaebacteria. The family contains many conserved cysteines and histidines suggesting that this protein may bind to zinc
350052	Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-rib
350053	Ribosomal protein L21e
350070	Ribosomal L22e protein family
350077	Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacter
350082	Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich
350104	Ribosomal protein S2
350121	Dynamin GTPase effector domain
350125	Dynamin GTPase effector domain
350127	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
350131	Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins
350144	Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anch
350154	Ribosomal protein S15
350156	Ribosomal protein L21e
350156	SCAN domain. The SCAN domain (named after SRE-ZBP, CTfin51, AW-1 and Number 18 cDNA) is found in several pfam00096 proteins. The domain has been shown to be able to mediate homo- and hetero-oligomerisation
350178	HMG14 and HMG17
350203	Poly-adenylate binding protein, unique domain
350223	7 transmembrane receptor (rhodopsin family)
350244	NLI interacting factor. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase
350264	Phospholipase A2. Phospholipase A2 releases fatty acids from the second carbon group of glycerol. Perhaps the best known members are secreted snake venoms, but also found in secreted pancreatic and membrane-associated forms. Structure is all-alpha, with t
350292	Cyclophilin type peptidyl-prolyl cis-trans isomerase
350330	NADH-ubiquinone/plastoquinone oxidoreductase chain 6
350351	ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporte
350354	Protein kinase domain
350363	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
350378	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
350383	7 transmembrane receptor (rhodopsin family)
350383	7 transmembrane receptor (rhodopsin family)
350393	Dihydrouridine synthase (Dus). Members of this family catalyse the reduction of the 5,6-double bond of a uridine residue on tRNA. Dihydrouridine modification of tRNA is widely observed in prokaryotes and eukaryotes, and also in some archae. Most dihydrour
350426	SH2 domain
350431	Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine
350451	C2 domain
350455	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
350457	Inward rectifier potassium channel
350462	Lipoxygenase
350485	Translation initiation factor SUI1
350496	pfam02937, COX6C, Cytochrome c oxidase subunit VIc. Cytochrome c oxidase, a 13 sub-unit complex, EC:1.9.3.1 is the terminal oxidase in the mitochondrial electron transport chain. This family is composed of cytochrome c oxidase subunit VIc
350515	Helicase conserved C-terminal domain. This domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase
350543	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
350568	Ribosomal protein S2
350588	Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1
350601	D-isomer specific 2-hydroxyacid dehydrogenase, NAD binding domain. This domain is inserted into the catalytic domain, the large dehydrogenase and D-lactate dehydrogenase families in SCOP. N-terminal portion of which is represented by family pfam00389
350604	Galactoside-binding lectin
350614	7 transmembrane receptor (metabotropic glutamate family)
350616	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
350616	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
350617	7 transmembrane receptor (rhodopsin family)
350623	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
350628	7 transmembrane receptor (rhodopsin family)
350631	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
350632	Ribosomal L39 protein
350634	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
350636	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
350638	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
350639	Ribosomal L18p/L5e family. This family includes ribosomal proteins from the large subunit. This family includes L18 from bacteria and L5 from eukaryotes. It has been shown for one member that the amino terminal 93 amino acids are necessary and sufficient
350641	Retroviral GAG p10 protein. This family consists of various retroviral GAG (core) polyproteins and encompasses the p10 region producing the p10 protein upon proteolytic cleavage of GAG by retroviral protease. The p10 or matrix protein (MA) is associated w
350642	Ribosomal protein S19e
350652	RNase H. RNase H digests the RNA strand of an RNA/DNA hybrid. Important enzyme in retroviral replication cycle, and often found as a domain associated with reverse transcriptases. Structure is a mixed alpha+beta fold with three a/b/a layers
350671	Ribosomal protein S24e
350678	Ribosomal L39 protein
350686	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
350689	Cytochrome C and Quinol oxidase polypeptide I
350693	7 transmembrane receptor (rhodopsin family)
350694	Core histone H2A/H2B/H3/H4
350694	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
350695	Core histone H2A/H2B/H3/H4
350696	Core histone H2A/H2B/H3/H4
350701	Elongation factor 1 gamma, conserved domain
350704	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
350718	7 transmembrane receptor (rhodopsin family)
350721	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
350723	Cytochrome c oxidase subunit III
350744	Lectin C-type domain. This family includes both long and short form C-type
350746	Ribosomal protein S26e
350747	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
350766	Ribosomal protein S5, N-terminal domain
350767	CUB domain
350778	Ribosomal protein S13/S18. This family includes ribosomal protein S13 from prokaryotes and S18 from eukaryotes
350782	HesB-like domain. This family includes HesB which may be involved in nitrogen fixation
350792	7 transmembrane receptor (rhodopsin family)
350793	Pentaxin family. Pentaxins are also known as pentraxins
350799	Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase
350802	Amidase
350806	Ribosomal protein S5, C-terminal domain
350806	Ribosomal protein S5, N-terminal domain
350815	NAD dependent epimerase/dehydratase family. This family of proteins utilise NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions
350815	3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursor
350816	FKBP-type peptidyl-prolyl cis-trans isomerase
350818	Ribosomal L39 protein
350822	Ribosomal protein L6e
350843	7 transmembrane receptor (rhodopsin family)
350864	Calpain family cysteine protease
350864	Sm protein. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major splic
350868	Protein kinase domain
350871	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
350878	Ribosomal protein S7e
350893	7 transmembrane receptor (rhodopsin family)
350921	Ribosomal protein L13e
350955	Appr-1"-p processing enzyme family. This domain is found in a number of otherwise unrelated proteins. This domain is found at the C-terminus of the macro-H2A histone protein. This domain is found in the non-structural proteins of several types of ssRNA vi
350965	Homeobox domain
350982	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
350997	Ribosomal Proteins L2, C-terminal domain
351001	SET domain. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET doma
351022	Herpes virus major capsid protein. This family represents the major capsid protein (MCP) of herpes viruses. The capsid shell consists of 150 MCP hexamers and 12 MCP pentamers. One pentamer is found at each of the 12 apices of the icosahedral shell, and th
351040	LIM domain. This family represents two copies of the LIM structural domain
351047	Uncharacterised protein family (UPF0184)
351049	Syndecan domain. Syndecans are transmembrane heparin sulfate proteoglycans which are implicated in the binding of extracellular matrix components and growth factors
351076	RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases
351098	NADH dehydrogenase
351145	Ribosomal protein L21e
351149	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
351149	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
351150	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
351152	Ribosomal protein L11, RNA binding domain
351152	Ribosomal protein L11, N-terminal domain. The N-terminal domain of Ribosomal protein L11 adopts an alpha/beta fold and is followed by the RNA binding C-terminal domain
351178	ribosomal L5P family C-terminus. This region is found associated with pfam00281
351195	WD domain, G-beta repeat
351207	Glutamine amidotransferases class-II
351209	Zinc finger, C3HC4 type (RING finger)
351209	7 transmembrane receptor (rhodopsin family)
351222	Ribosomal protein S21e
351223	Ribosomal L22e protein family
351225	7 transmembrane receptor (rhodopsin family)
351227	Actin
351240	Ribosomal L15
351253	Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14
351259	Cytochrome C and Quinol oxidase polypeptide I
351259	Cytochrome C oxidase subunit II, periplasmic domain
351273	Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of
351281	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
351298	Ribosomal protein L23
351320	Ribosomal protein S2
351323	Cyclophilin type peptidyl-prolyl cis-trans isomerase
351332	IMP dehydrogenase / GMP reductase. This family is involved in biosynthesis of guanosine nucleotide biosynthesis. Members of this family contain a TIM barrel structure. The alignment does not contain the whole TIM barrel domain. The alignment is truncated
351348	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
351351	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
351383	7 transmembrane receptor (rhodopsin family)
351390	Ferritin-like domain. This family contains ferritins and other ferritin-like proteins such as members of the DPS family and bacterioferritins
351391	Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosylt
351421	Ribulose-phosphate 3 epimerase family. This enzyme catalyses the conversion of D-ribulose 5-phosphate into D-xylulose 5-phosphate
351425	Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein
351458	7 transmembrane receptor (rhodopsin family)
351465	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
351471	Histone-like transcription factor (CBF/NF-Y) and archaeal histone. This family includes archaebacterial histones and histone like transcription factors from eukaryotes
351494	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
351507	ATP synthase subunit C
351508	Actin
351515	Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria
351525	Intermediate filament protein
351532	Ribosomal protein L23
351567	Fructose-bisphosphate aldolase class-I
351575	HIT family
351590	Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex bind
351593	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
351596	Protein kinase domain
351597	Furin-like cysteine rich region
351597	Giardia variant-specific surface protein
351600	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
351607	Ribosomal protein L31e
351638	Elongation factor 1 gamma, conserved domain
351638	Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
351638	Glutathione S-transferase, C-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation f
351642	Ribosomal protein S5, C-terminal domain
351642	Ribosomal protein S5, N-terminal domain
351655	Adenylate kinase
351672	Zinc finger, C3HC4 type (RING finger)
351672	SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues
351677	NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane
351682	Trypsin
351726	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
351732	Ribosomal S3Ae family
351746	Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the clea
351753	NADH-Ubiquinone/plastoquinone (complex I), various chains. This family is part of complex I which catalyses the transfer of two electrons from NADH to ubiquinone in a reaction that is associated with proton translocation across the membrane
351780	Glutamine amidotransferases class-II
351799	Cytochrome C and Quinol oxidase polypeptide I
351811	Cytochrome b(C-terminal)/b6/petD
351813	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
351814	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
351819	Actin
351835	Ribosomal L10
351835	Ribosomal protein S2
351853	NADH-ubiquinone/plastoquinone oxidoreductase chain 6
351864	BAR domain. The BAR domain is found in amphiphysin and clathrin binding protein. However the function of this domain is unknown
351864	RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains
351864	SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organization. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel
351870	Ribosomal protein L6e
351883	Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes
351890	Core histone H2A/H2B/H3/H4
351899	Ribosomal L28e protein family
351911	Ribosomal protein L1p/L10e family. This family includes prokaryotic L1 and eukaryotic L10
351942	Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-termin
351942	Glycine cleavage H-protein. This is a family of glycine cleavage H-proteins, part of the glycine cleavage multienzyme complex (GCV) found in bacteria and the mitochondria of eukaryotes. GCV catalyses the catabolism of glycine in eukaryotes. A lipoyl group
351943	Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde
351949	Ribosomal family S4e
351951	Intermediate filament protein
351953	Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain
351953	pfam02882, THF_DHG_CYH_C, Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain
351961	Actin
351967	Ribosomal protein L13
351969	Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14
351984	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
352006	Intermediate filament protein
352012	Mitochondrial carrier protein
352013	Ribosomal protein L23
352019	Ribosomal family S4e
352027	Ribosomal protein S2
352029	Ribosomal S3Ae family
352037	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
352038	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
352054	Ribosomal protein L6
352054	Ribosomal protein S7e
352097	Spumavirus gag protein
352097	Virulence determinant. The UK protein is an African swine fever virus (ASFV) protein that is highly conserved amongst strains, and is an important viral virulence determinant for domestic pigs
352103	Ribosomal family S4e
352108	Ribosomal protein L6e
352118	Core histone H2A/H2B/H3/H4
352119	Ribosomal Proteins L2, C-terminal domain
352124	Ribosomal L15
352145	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
352146	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
352152	Ribosomal protein L31e
352156	GDNF receptor family. This family consists of Glial-cell-line-derived neurotrophic factor (GDNF) and neurturin (NTN) these receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuron
352157	Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays
352158	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
352160	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
352161	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
352162	Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS)
352170	AMP-binding enzyme
352172	lactate/malate dehydrogenase, alpha/beta C-terminal domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also member
352172	lactate/malate dehydrogenase, NAD binding domain. L-lactate dehydrogenases are metabolic enzymes which catalyse the conversion of L-lactate to pyruvate, the last step in anaerobic glycolysis. L-2-hydroxyisocaproate dehydrogenases are also members of the f
352193	Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. The high cutoff is so high to avoid overlaps with related families
352196	KH domain. KH motifs can bind RNA in vitro . Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia
352230	Cytochrome C and Quinol oxidase polypeptide I
352230	Cytochrome C oxidase subunit II, periplasmic domain
352236	pfam02946, GTF2I, GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain
352266	Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from euka
352272	Ribosomal protein S27
352278	Mitochondrial carrier protein
352327	SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase
352327	AIR carboxylase. Members of this family catalyse the decarboxylation of 1-(5-phosphoribosyl)-5-amino-4-imidazole-carboxylate (AIR). This family catalyse the sixth step of de novo purine biosynthesis. Some members of this family contain two copies of this
352347	Dynein light chain type 1
352353	Ribosomal S3Ae family
352368	Arginosuccinate synthase. This family contains a PP-loop motif
352391	Ribosomal S17
352391	Prothymosin/parathymosin family. Prothymosin alpha and parathymosin are two ubiquitous small acidic nuclear proteins that are thought to be involved in cell cycle progression, proliferation, and cell differentiation
352392	Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes
352393	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
352400	DnaJ domain. DnaJ domains (J-domains) are associated with hsp70 heat-shock system and it is thought that this domain mediates the interaction. DnaJ-domain is therefore part of a chaperone (protein folding) system. The T-antigens are confirmed as DnaJ cont
352428	Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p
352436	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
352454	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
352455	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
352455	Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
352457	7 transmembrane receptor (rhodopsin family)
352459	Ribosomal L10
352462	Ubiquitin carboxyl-terminal hydrolase family 2
352462	Ubiquitin carboxyl-terminal hydrolases family 2
352463	Defensin propeptide
352464	Defensin propeptide
352464	Mammalian defensin
352465	Ribosomal S3Ae family
352466	7 transmembrane receptor (rhodopsin family)
352467	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
352468	TCP-1/cpn60 chaperonin family. This family includes members from the HSP60 chaperone family and the TCP-1 (T-complex protein) family
352473	Ribosomal protein L6e
352492	Glyceraldehyde 3-phosphate dehydrogenase, NAD binding domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. N-terminal domain is a Rossman NAD(P) binding fold
352492	Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain. GAPDH is a tetrameric NAD-binding enzyme involved in glycolysis and glyconeogenesis. C-terminal domain is a mixed alpha/antiparallel beta fold
352507	Core histone H2A/H2B/H3/H4
352524	Copper/zinc superoxide dismutase (SODC). superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the copper/zinc-binding family is one. Defects in
352559	Protein kinase domain
352582	Ribosomal protein L13
352617	Fructose-1-6-bisphosphatase
352618	Papain family cysteine protease
352618	Peptidase C1-like family. This family is closely related to the Peptidase_C1 family pfam00112, containing several prokaryotic and eukaryotic aminopeptidases and bleomycin hydrolases
352625	F5/8 type C domain. This domain is also known as the discoidin (DS) domain family. The bacterial examples are not yet included in the SEED alignment and are only found with low scores
352626	6-pyruvoyl tetrahydropterin synthase. 6-Pyruvoyl tetrahydrobiopterin synthase catalyses the conversion of dihydroneopterin triphosphate to 6-pyruvoyl tetrahydropterin, the second of three enzymatic steps in the synthesis of tetrahydrobiopterin from GTP. T
352628	FKBP-type peptidyl-prolyl cis-trans isomerase
352629	6-pyruvoyl tetrahydropterin synthase. 6-Pyruvoyl tetrahydrobiopterin synthase catalyses the conversion of dihydroneopterin triphosphate to 6-pyruvoyl tetrahydropterin, the second of three enzymatic steps in the synthesis of tetrahydrobiopterin from GTP. T
352631	Skp1 family, dimerisation domain
352631	Skp1 family, tetramerisation domain
352632	Skp1 family, dimerisation domain
352632	Skp1 family, tetramerisation domain
352633	FKBP-type peptidyl-prolyl cis-trans isomerase
352636	6-pyruvoyl tetrahydropterin synthase. 6-Pyruvoyl tetrahydrobiopterin synthase catalyses the conversion of dihydroneopterin triphosphate to 6-pyruvoyl tetrahydropterin, the second of three enzymatic steps in the synthesis of tetrahydrobiopterin from GTP. T
352637	FKBP-type peptidyl-prolyl cis-trans isomerase
352638	Skp1 family, dimerisation domain
352638	Skp1 family, tetramerisation domain
352640	FKBP-type peptidyl-prolyl cis-trans isomerase
352642	7 transmembrane receptor (metabotropic glutamate family)
352645	KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons.
352656	Zinc finger, C3HC4 type (RING finger)
352659	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
352662	Zinc finger, C3HC4 type (RING finger)
352662	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
352663	FKBP-type peptidyl-prolyl cis-trans isomerase
352664	7 transmembrane receptor (metabotropic glutamate family)
352669	Transcription factor S-II (TFIIS)
352670	Zinc finger, C3HC4 type (RING finger)
352670	Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involve
352671	GDA1/CD39 (nucleoside phosphatase) family
352671	3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of E. coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-termin
352689	Cyclophilin type peptidyl-prolyl cis-trans isomerase
352699	Interferon alpha/beta domain
352700	Interferon alpha/beta domain
352733	Nucleoside diphosphate kinase
352746	Ribosomal protein L31e
352788	Ribosomal protein L19e
352792	Aminotransferase class I and II
352793	MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumors but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of t
352807	Ribosomal protein L19e
352814	Ribosomal protein L35Ae
352832	Ribosomal S17
352837	Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks
352840	Cytochrome b(N-terminal)/b6/petB
352852	Cytochrome c. The cytochrome 556 and cytochrome c' families are not included
352861	Sodium:neurotransmitter symporter family
352864	ADP-ribosylation factor family
352871	D12 class N6 adenine-specific DNA methyltransferase
352879	Senescence marker protein-30 (SMP-30). SMP-30, also known as regucalcin, seems to play a critical role in the highly differentiated functions of the liver and kidney and to exert a major impact on Ca2+ homeostasis
352882	E1-E2 ATPase
352882	haloacid dehalogenase-like hydrolase. This family are structurally different from the alpha/ beta hydrolase family (pfam00561). This family includes L-2-haloacid dehalogenase, epoxide hydrolases and phosphatases. The structure of the family consists of tw
352893	Nucleosome assembly protein (NAP). It is thought that NAPs may be involve in regulating gene expression as a result of histone accessibility
352896	Ocular albinism type 1 protein
352897	RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternativ
353155	Connexin
353169	Sugar (and other) transporter
353172	WHEP-TRS domain
353174	pfam02931, Neur_chan_LBD, Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure
353187	Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors
353187	Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other
353188	Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endo
353188	Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is no
353189	Organic Anion Transporter Polypeptide (OATP) family, C-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
353189	Organic Anion Transporter Polypeptide (OATP) family, N-terminus. This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution
353190	YjeF-related protein N-terminus
353204	Fructose-bisphosphate aldolase class-I
353218	TNF(Tumor Necrosis Factor) family
353233	Beige/BEACH domain
353250	7 transmembrane receptor (rhodopsin family)
353252	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
353258	Low temperature viability protein
353281	Leucine rich repeat C-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is
353288	Intermediate filament protein
353307	Nucleoside transporter. This is a family of nucleoside transporters. In mammalian cells nucleoside transporters transport nucleoside across the plasma membrane and are essential for nucleotide synthesis via the salvage pathways for cells that lack their
353328	von Willebrand factor type D domain
353328	Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that
353344	7 transmembrane receptor (rhodopsin family)
353345	7 transmembrane receptor (rhodopsin family)
353346	7 transmembrane receptor (rhodopsin family)
353376	emp24/gp25L/p24 family. Members of this family are implicated in bringing cargo forward from the ER and binding to coat proteins by their cytoplasmic domains
353498	Cytochrome P450. Cytochrome P450s are involved in the oxidative degradation of various compounds. Particularly well known for their role in the degradation of environmental toxins and mutagens. Structure is mostly alpha, and binds a heme cofactor
353500	TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein
359725	short chain dehydrogenase. This family contains a wide variety of dehydrogenases
359726	Protein kinase domain
360034	Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas
360202	Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits
